CrawlJobs Logo

Senior Manager, Hybrid Services & Reliability (SRE)

gm.com Logo

General Motors

Location Icon

Location:
United States , Austin, Texas

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

201600.00 - 302000.00 USD / Year

Job Description:

As the Senior Engineering Manager for Hybrid Services & Reliability (HSR) within AV Core Infrastructure (ACI) at GM, you are the architect of our system trust. You will lead a newly seeded team responsible for the measurable availability of the hybrid cloud systems that underlie all autonomous vehicle development and operations. We need a leader who views reliability not as an afterthought, but as an inherent property of the platform, ensuring that all teams have a stable and ready-state engineering environment. You are comfortable operating systems at scale, not just designing them.

Job Responsibility:

  • Reliability Engineering: Define, measure, and enforce strict SLOs/SLIs for critical hybrid cloud services, including network connectivity and compute readiness
  • Foundational Utilities: Own and manage core on-prem utilities, such as DHCP, PXE, and CDN, to ensure seamless server auto-provisioning across the global fleet
  • Environment Integrity: Manage the entire data flow path, from initial ingestion at the test bench through the secure cloud network into production staging
  • HIL Readiness: Guarantee the 99%+ availability and stability of remote CI-based Hardware-in-the-Loop (HIL) benches required for AV safety validation
  • Organization Growth: Actively lead the recruitment and technical mentorship of Senior and Staff ICs as part of the team's expansion

Requirements:

  • Extensive background in Site Reliability Engineering (SRE) and defining SLO/SLI frameworks for hybrid cloud environments
  • Technical proficiency in managing on-prem Linux utilities (DHCP/PXE/NTP) and core development services
  • Opinionated view on automated observability, incident response, and MTTR reduction
  • Proven leadership experience

Nice to have:

Experience with configuration management tools (e.g., Chef, Ansible) for large-scale, remote hardware fleets

What we offer:
  • medical, dental, vision, Health Savings Account, Flexible Spending Accounts, retirement savings plan, sickness and accident benefits, life insurance, paid vacation & holidays, tuition assistance programs, employee assistance program, GM vehicle discounts
  • relocation benefits

Additional Information:

Job Posted:
March 03, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior Manager, Hybrid Services & Reliability (SRE)

Senior Engineer, Hybrid Cloud Fabric

Become a key player in GEICO's tech transformation! We are seeking a Senior or S...
Location
Location
United States , Palo Alto, CA; Dallas, TX; Seattle, WA
Salary
Salary:
100000.00 - 215000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Service mesh expertise (dev): familiar with mesh architecture, components, and configuration options, including advanced traffic management, security policies, and telemetry customization
  • Service mesh experience (ops): designed, implemented, and managed service mesh solutions at scale, addressing challenges related to performance, security, and observability
  • Programming skills: Experience with Go is a must
  • Rust is a bonus
  • Linux OS: In-depth knowledge of Linux operating systems, including performance tuning, troubleshooting, and security best practices
  • Networking: Advanced understanding of networking concepts and tools (e.g., iptables, netfilter, traffic shaping) for analyzing and optimizing service mesh performance within the hybrid cloud environment
  • Kubernetes and containerization: Extensive experience with Kubernetes and container orchestration platforms, including networking, security, and service management
  • Microservices architecture: Deep understanding of microservices design patterns, service discovery mechanisms, API gateways, and distributed tracing
  • Observability and monitoring: Expertise in tools like Prometheus, Grafana, Jaeger, and Kiali to monitor service mesh performance and troubleshoot issues
  • Security best practices: Knowledge of zero-trust security principles, authentication and authorization mechanisms, and encryption technologies within the context of service mesh
Job Responsibility
Job Responsibility
  • Design and implement a robust service mesh architecture, encompassing traffic management, security, observability, and resilience for microservices across public and private clouds within our on-premises data centers
  • Integrate the service mesh with existing infrastructure and applications, ensuring seamless operation and interoperability with various platforms and technologies, including legacy systems
  • Establish and enforce service mesh best practices, including security policies, traffic routing rules, circuit breakers, and access control mechanisms, to maintain a secure and reliable application environment
  • Develop comprehensive monitoring and observability dashboards to provide deep insights into service mesh health, performance, and potential issues, enabling proactive problem identification and resolution
  • Guide and mentor engineers on service mesh principles and best practices, fostering knowledge sharing and expertise development within the team, empowering them to contribute effectively to the service mesh implementation
  • Work closely with networking and security teams to ensure secure and efficient integration of the service mesh with on-premises infrastructure and networks, addressing potential challenges and ensuring smooth operation
  • Partner with SREs to establish service mesh observability, monitoring, and alerting strategies for maintaining high availability and performance, collaborating to define SLOs, SLIs, and error budgets
  • Actively engage with the Istio community, contribute to open-source projects, and represent GEICO's leadership in service mesh adoption
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right

Senior Staff Engineer, Hybrid Cloud Fabric

Become a key player in GEICO's tech transformation! We are seeking a Senior or S...
Location
Location
United States , Palo Alto; Dallas; Chevy Chase; Seattle
Salary
Salary:
120000.00 - 260000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Service mesh expertise (dev): familiar with mesh architecture, components, and configuration options, including advanced traffic management, security policies, and telemetry customization
  • Service mesh experience (ops): designed, implemented, and managed service mesh solutions at scale, addressing challenges related to performance, security, and observability
  • Programming skills: Experience with Go is a must
  • Rust is a bonus
  • Linux OS: In-depth knowledge of Linux operating systems, including performance tuning, troubleshooting, and security best practices
  • Networking: Advanced understanding of networking concepts and tools (e.g., iptables, netfilter, traffic shaping) for analyzing and optimizing service mesh performance within the hybrid cloud environment
  • Kubernetes and containerization: Extensive experience with Kubernetes and container orchestration platforms, including networking, security, and service management
  • Microservices architecture: Deep understanding of microservices design patterns, service discovery mechanisms, API gateways, and distributed tracing
  • Observability and monitoring: Expertise in tools like Prometheus, Grafana, Jaeger, and Kiali to monitor service mesh performance and troubleshoot issues
  • Security best practices: Knowledge of zero-trust security principles, authentication and authorization mechanisms, and encryption technologies within the context of service mesh
Job Responsibility
Job Responsibility
  • Design and implement a robust service mesh architecture, encompassing traffic management, security, observability, and resilience for microservices across public and private clouds within our on-premises data centers
  • Integrate the service mesh with existing infrastructure and applications, ensuring seamless operation and interoperability with various platforms and technologies, including legacy systems
  • Establish and enforce service mesh best practices, including security policies, traffic routing rules, circuit breakers, and access control mechanisms, to maintain a secure and reliable application environment
  • Develop comprehensive monitoring and observability dashboards to provide deep insights into service mesh health, performance, and potential issues, enabling proactive problem identification and resolution
  • Guide and mentor engineers on service mesh principles and best practices, fostering knowledge sharing and expertise development within the team, empowering them to contribute effectively to the service mesh implementation
  • Work closely with networking and security teams to ensure secure and efficient integration of the service mesh with on-premises infrastructure and networks, addressing potential challenges and ensuring smooth operation
  • Partner with SREs to establish service mesh observability, monitoring, and alerting strategies for maintaining high availability and performance, collaborating to define SLOs, SLIs, and error budgets
  • Actively engage with the Istio community, contribute to open-source projects, and represent GEICO's leadership in service mesh adoption
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right

Senior Lead Systems Operations Engineer

Wells Fargo is seeking a Senior Lead Systems Operations Engineer.
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
https://www.wellsfargo.com/ Logo
Wells Fargo
Expiration Date
May 24, 2026
Flip Icon
Requirements
Requirements
  • 7+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
  • 7+ years of experience in Systems Operations, SRE, Platform Engineering, or Production Support with deep expertise in at least one platform domain: Database, Cloud, Network, Compute/Storage, Middleware, or Enterprise Application Support
  • Strong hands-on experience applying SRE practices, including SLI/SLO definition, error budgets, and reliability metrics
  • Proven experience troubleshooting and resolving large-scale, distributed production systems
  • Hands-on experience with observability and monitoring tools such as Grafana, Splunk, Prometheus, Cribl, ThousandEyes, AppDynamics, or equivalent, including dashboards, alerting, logs, and metrics
  • Strong scripting and automation skills using Python, Bash, and/or PowerShell to reduce operational toil
  • Experience building automation or reliability tooling using APIs, Git-based workflows, and modern engineering practices
  • Solid understanding of incident, problem, and change management in enterprise production environments
  • Strong communication and influencing skills across engineering teams and senior leadership
  • Experience with capacity management, performance engineering, and resiliency design (HA, fault tolerance, RTO/RPO)
Job Responsibility
Job Responsibility
  • Act as an advisor to senior leadership to develop or influence platform support solutions for highly complex business and technical needs or technology initiatives
  • Lead highly complex, broad impact initiatives including provision of high-level systems consultation for the technology teams related to large scale planning of computer systems and network infrastructure for Systems Operations functional areas
  • Lead the strategy and resolution of highly complex and unique challenges requiring in-depth evaluation across multiple areas or the enterprise, delivering solutions that are long-term, large-scale and require vision, creativity, innovation, advanced analytical and inductive thinking
  • Translate advanced technology experience, in-depth knowledge of the organizations tactical and strategic business objectives, the enterprise technological environment, the organization structure, and strategic technological opportunities and requirements into technical engineering solutions
  • Provide vision, direction and expertise to senior leadership on implementing innovative and significant business solutions
  • Maintain knowledge of industry best practices and new technologies and recommend innovations that enhance operations or provide a competitive advantage to the organization
  • Strategically engage with all levels of professionals and managers across the enterprise and serve as an expert advisor to leadership
  • Provide training and mentoring to less experienced team members on guidebook changes and lead team to meet technical deliverables, while leveraging solid understanding of technical process controls or standards
  • Act as a Platform Reliability Engineering (PRE) subject matter expert, providing deep technical leadership in one core domain (Database, Cloud, Network, Compute/Storage, Middleware, or Application Support)
  • Lead analysis and resolution of complex, systemic production reliability issues, translating recurring incidents into long-term engineering solutions
  • Fulltime
Read More
Arrow Right
New

Managing Vice President - Infrastructure Platforms & Operations

The Managing Vice President, Infrastructure Platforms & Operations is a senior t...
Location
Location
United States , Bethesda
Salary
Salary:
215700.00 - 389700.00 USD / Year
https://www.marriott.com Logo
Marriott Bonvoy
Expiration Date
May 20, 2026
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Information Systems, Engineering, Business Administration, or related technical field
  • 15+ years of senior leadership experience across cloud engineering, infrastructure platforms, network services, and/or enterprise workplace technologies, preferably in a large global Fortune 500 organization
  • 10+ years of prior hands-on technical engineering or development experience (cloud, infrastructure, networking, automation, or enterprise platforms)
  • Demonstrated success leading large, multi-disciplinary global engineering and operations organizations
  • Deep expertise in multi-cloud platforms, network architecture, DevSecOps, automation, and reliability engineering
  • Strong experience partnering with cybersecurity teams to deliver secure by design platforms
  • Proven ability to influence senior executives and lead transformation in complex, matrixed enterprises
  • Strong financial acumen with experience managing large technology budgets and vendor portfolios
Job Responsibility
Job Responsibility
  • Lead global teams responsible for cloud foundations, DevOps and CI/CD platforms, automation, container platforms, service mesh, and self-service engineering capabilities
  • Oversee enterprise cloud landing zones across all regions, ensuring secure, scalable, and cost-efficient architecture
  • Drive modernization of hybrid platforms, including datacenter, edge compute, and infrastructure engineering capabilities
  • Oversee SRE, observability, resiliency, and disaster recovery governance
  • Lead global network architecture and operations across datacenter networks, property connectivity, enterprise networks, and cloud network integration
  • Drive transformation of Marriott's global connectivity ecosystem, including SD WAN, wireless, secure network edge, voice, and network automation
  • Ensure network performance, reliability, compliance, and resiliency at global scale
  • Lead workplace technology platforms supporting collaboration, productivity, endpoint, and digital employee experience solutions
  • Partner with business, HR, and IT leaders to deliver intuitive, reliable, and secure workplace tools that enable associate productivity
  • Drive standardization, modernization, and lifecycle management of workplace platforms and services
What we offer
What we offer
  • 401(k) plan
  • stock purchase plan
  • discounts at Marriott properties
  • commuter benefits
  • employee assistance plan
  • childcare discounts
  • medical
  • dental
  • vision
  • health care flexible spending account
  • Fulltime
Read More
Arrow Right

Senior Staff Engineer – Change Management

GEICO is seeking an experienced Software Engineer who is passionate about buildi...
Location
Location
United States , Chevy Chase; Austin; New York City; Seattle; Palo Alto
Salary
Salary:
110000.00 - 260000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Expertise in at least two modern programming languages (Go, Python, Java, C, C++) and object-oriented design
  • Strong ownership and accountability with excellent communication and collaboration skills
  • Hands-on experience in incident response, troubleshooting, and root cause analysis
  • Experience managing distributed systems in public, private, or hybrid cloud environments
  • Experience with monitoring, logging, and observability tools (Prometheus, Grafana, OpenTelemetry, Loki)
  • Passion for automation and reducing manual operations using tools like Terraform and Ansible
  • Familiarity with configuration management and orchestration tools (Helm, Puppet, Spinnaker)
  • Experience with CI/CD pipelines, Infrastructure as Code (IaC), and cloud-based deployments
  • Ability to operate in a fast-paced, high-scale environment with a problem-solving mindset
  • 10+ years of professional experience in software development, platform architecture, and infrastructure management
Job Responsibility
Job Responsibility
  • Develop and drive the overall strategy for our enterprise Change and Approval Management, aligning it with the organization's business goals and objectives
  • Lead technical initiatives across multiple teams, providing strategic and technical guidance
  • Utilize programming languages like Go, Python, Java, and work with SQL/NoSQL databases
  • Work with container orchestration tools such as Docker, Kubernetes, and OpenStack
  • Architect and develop cloud-native applications using Azure services
  • Collaborate with product managers, engineering teams, and stakeholders to solve complex challenges
  • Ensure the quality, performance, and usability of engineering solutions
  • Serve as a mentor and thought leader, coaching engineers and influencing executives
  • Continuously improve processes, adopt best practices, and drive operational efficiency
  • Support and participate in On Call rotations, respond to incidents, diagnosing production issues, and conducting post-incident reviews to improve system reliability
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right
New

Principal Engineer-Site Reliability Engineering and AIOps

We are looking for a Principal Engineer to set the enterprise technical directio...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
https://www.wellsfargo.com/ Logo
Wells Fargo
Expiration Date
May 10, 2026
Flip Icon
Requirements
Requirements
  • 7+ years of Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
  • 7+ years of engineering experience, including principal-level technical leadership on large-scale reliability, production operations, or platform programs across complex environments
  • 7+ years of software engineering experience (e.g., Java, C#, Python) with demonstrated expertise in system design and distributed systems
  • track record of delivering reusable automation and platform capabilities adopted by multiple teams
  • 5+ years operating Linux/Unix and Windows platforms in production, including performance tuning, capacity planning, and reliability hardening for mission-critical services
  • 5+ years designing and operating cloud solutions (public and/or private cloud), including reliability and security architecture, infrastructure-as-code, and cost-aware engineering at scale
  • 5+ years leading reliability and operations practices for enterprise-scale, highly available services, including major incident leadership, problem management, and establishing operational readiness mechanisms
  • 5+ years architecting and scaling full-stack observability solutions, including instrumentation standards, alert strategy, service dashboards, and governance that improves signal quality and reduces noise
  • 5+ years with automation and observability toolsets (e.g., Ansible, Grafana, Elastic, Splunk, Prometheus) and experience building reusable components, templates, and paved paths integrated with CI/CD
  • Exceptional communication and influence skills, including the ability to align senior stakeholders, drive technical decisions across organizations, and clearly articulate risk, tradeoffs, and recommended paths forward
Job Responsibility
Job Responsibility
  • Act as an advisor to leadership to develop or influence applications, network, information security, database, operating systems, or web technologies for highly complex business and technical needs across multiple groups
  • Lead the strategy and resolution of highly complex and unique challenges requiring in-depth evaluation across multiple areas or the enterprise, delivering solutions that are long-term, large-scale and require vision, creativity, innovation, advanced analytical and inductive thinking
  • Translate advanced technology experience, an in-depth knowledge of the organizations tactical and strategic business objectives, the enterprise technological environment, the organization structure, and strategic technological opportunities and requirements into technical engineering solutions
  • Provide vision, direction and expertise to leadership on implementing innovative and significant business solutions
  • Maintain knowledge of industry best practices and new technologies and recommends innovations that enhance operations or provide a competitive advantage to the organization
  • Strategically engage with all levels of professionals and managers across the enterprise and serve as an expert advisor to leadership
  • Set and evangelize the SRE and AIOps technical strategy for EFT, establishing reference architectures, standards, and guardrails (service tiering, onboarding criteria, SLO/error budget governance) and holding teams accountable through transparent executive-level reporting
  • Act as a principal-level technical advisor and multiplier: mentor senior engineers, contribute to hiring and technical bar-raising, and define reliability patterns and guardrails across applications, networks, databases, operating systems, and web technologies
  • Own the reliability and observability architecture across hybrid/multi-cloud, driving standardization of monitoring, logging, tracing, synthetics, and resilience/chaos testing
  • define platform patterns that teams can adopt with minimal friction
  • Fulltime
!
Read More
Arrow Right
New

Senior IAM Automation Engineer

We’re seeking a Senior IAM Automation Engineer to transform how Apex manages wor...
Location
Location
United States , Austin
Salary
Salary:
108800.00 - 136000.00 USD / Year
apexclearing.com Logo
Apex Clearing
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7-10+ years in DevOps, SRE, or software engineering roles with significant IAM/identity automation focus
  • Demonstrated experience building automation solutions for enterprise IAM platforms using APIs, scripting, and infrastructure-as-code
  • Track record of implementing workflow automation or orchestration platforms in production environments
  • Understanding of both technical IAM implementations and business processes (joiner/mover/leaver, access requests, compliance)
  • Experience working in hybrid on-premises and cloud environments
  • Software development proficiency - 5+ years writing production code (Python, PowerShell, Go, or similar) with strong API and SDK integration experience
  • IAM architecture skills - Deep understanding of SSO protocols (SAML, OIDC), provisioning standards (SCIM), directory services (Active Directory, Entra ID), and enterprise IAM platforms (Okta strongly preferred)
  • Infrastructure-as-Code mastery - Hands-on experience with Terraform, Ansible, or similar tools, plus CI/CD pipelines for automated deployments
  • DevOps/SRE practices - Experience building observable, reliable systems with appropriate monitoring, logging, and incident response capabilities
  • Workflow automation platforms - Demonstrated ability to implement and govern low-code/code-first automation tools (Tines, Workato, n8n, or similar)
Job Responsibility
Job Responsibility
  • Lead Tines platform implementation and governance - Define technical standards, architect RBAC models, and build workflows that automate employee lifecycle management, access requests, and certification campaigns
  • Build infrastructure-as-code for identity systems - Develop and maintain Terraform, PowerShell, and Python automation across hybrid infrastructure (on-prem AD/Adaxes, Entra ID, Okta, AWS IAM, GCP/GCI) to enable repeatable, version-controlled deployments with proper change management
  • Design API-driven automation and integrations - Architect scalable solutions that orchestrate identity workflows across HRIS (Workday), ticketing (ServiceNow), collaboration platforms (Slack, Teams, M365), and enterprise applications, leveraging APIs and SDKs to eliminate manual processes
  • Implement observability and self-healing capabilities - Build monitoring, alerting, and automated remediation for identity systems to reduce operational toil, improve reliability, and enable proactive issue detection across authentication flows and provisioning processes
  • Enable rapid application onboarding - Create automation frameworks and integration patterns that allow the business to onboard new SaaS applications with minimal manual intervention while maintaining security and compliance standards
  • Pioneer non-human identity (NHI) governance - Partner with SecOps to develop policies, controls, and automation for managing AI agents, LLM API keys, service accounts, bot identities, and machine-to-machine authentication as AI adoption accelerates across the organization
  • Mentor and develop junior team members - Share your hard-won experience and technical expertise to elevate the team’s capabilities. Conduct code reviews, pair programming sessions, and knowledge transfer that builds automation skills, IAM expertise, and engineering judgment across the team
  • Drive technical innovation in the identity space - Evaluate emerging tools and practices, establish CI/CD pipelines for IAM deployments, and leverage AI-powered development tools (LLMs, code generation, AI assistants) responsibly to accelerate automation delivery and stay ahead of business needs
What we offer
What we offer
  • Healthcare benefits (medical, dental and vision, EAP)
  • competitive PTO
  • 401k match
  • parental leave
  • HSA contribution match
  • paid subscription to the Calm app
  • generous external learning and tuition reimbursement benefits
  • Fulltime
Read More
Arrow Right

Senior Network Engineer

Bumble is seeking a Network Engineer to maintain a stable, predictable, controll...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
bumble.com Logo
Bumble Inc.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of hands-on Linux systems engineering experience (preferably rpm-based distributions such as RHEL or CentOS)
  • Strong diagnostic and troubleshooting skills spanning application performance, traffic-delivery issues, and complex multi-layer networking challenges
  • Deep understanding of networking across L1–L4 and L7, including copper/optics, Ethernet, and static/dynamic routing
  • Production experience with IS-IS and BGP (OSPF familiarity beneficial)
  • Extensive hands-on experience with Juniper MX, SRX, and QFX devices
  • Practical experience implementing and supporting EVPN-VXLAN architectures
  • Strong background in load balancing (CARP, IPVS, userspace, or enterprise solutions) and packet filtering
  • Experience building and supporting cloud networking architectures (VPC structures, virtual routing, firewalling, hybrid connectivity, etc.)
  • Proficiency with 802.1X, 802.1Q, and bonding/teaming at both the server and network hardware layers
  • Strong diagnostic capabilities with IPv4, ICMP, TCP, UDP, DHCP, and DNS (IPv6 is a plus)
Job Responsibility
Job Responsibility
  • Support and evolve Bumble’s global network infrastructure across multiple data centres and offices, including diagnostics of network subsystems within Linux servers (primarily CentOS/RHEL)
  • Improve network reliability and operational efficiency through configuration management, automation, and continuous optimisation of BAU tasks
  • Contribute to the design, implementation, and operation of cloud networking as we migrate a significant portion of our workloads into cloud environments
  • Collaborate closely with Systems Engineering and SRE teams, sharing networking expertise, participating in design reviews, and shaping resilient, secure platform architectures
  • Manage relationships with global service providers, including IP transit operators, to ensure optimal performance, availability, and accountability
  • Own IP address management, including subnet allocation, VLAN design, and maintaining accurate documentation
  • Strengthen Bumble’s security posture by contributing to perimeter defence, segmentation strategy, and proactive threat prevention
  • Participate in the on-call rota to maintain platform availability and support timely incident response
Read More
Arrow Right