CrawlJobs Logo

Staff Operations Engineer

Canada; Germany; United States Employment contract · Job Posted June 14, 2026
Apply Position
Job Link Share

Job Description

A Staff Operations Engineer leads the design, reliability, and evolution of hybrid-cloud and workplace infrastructure. This senior role spans teams, drives complex initiatives, sets technical direction, and ensures systems are scalable, secure, and efficient. This role combines technical execution with leadership, shaping architecture both directly and collaboratively.

Job Responsibility

  • Own and evolve architecture within a defined infrastructure domain
  • Design and implement scalable, reliable systems spanning multiple teams or environments
  • Establish and promote best practices, patterns, and standards within the domain
  • Contribute to medium- and long-term technical strategy (typically 6-18 months)
  • Lead delivery of ambiguous, high-impact infrastructure projects
  • Break down elaborate system problems into implementable solutions
  • Drive migrations, re-architectures, and performance/reliability improvements
  • Remain hands-on with critical systems and implementations
  • Work across teams (IT, SRE, Security, Service Owners) to unify solutions
  • Influence technical decisions through design reviews and collaboration
  • Ensure systems integrate cleanly across infrastructures (office, DC, cloud)
  • Improve system reliability through monitoring, alerting, and operational design
  • Contribute to defining SLIs/SLOs and capacity planning within the domain
  • Participate in and lead root cause analysis for complex incidents
  • Decrease operational toil through automation and system improvements
  • Design and support core infrastructure components (compute, DNS, networking, identity, etc.)
  • Drive improvements in performance, scalability, and dependability
  • Contribute deep expertise in at least one area (e.g., DNS, network architecture, cloud infra)
  • Build and improve automation using scripting and Infrastructure as Code
  • Contribute to internal tooling and platform improvements
  • Promote repeatable, standardized approaches to system management
  • Mentor engineers and guide system design and troubleshooting
  • Raise the technical quality of the team through reviews and shared practices
  • Act as a go-to resource within the domain
  • Maintain clear documentation, diagrams, and runbooks for systems owned
  • Ensure systems are understandable and operable by others
  • Contribute to knowledge sharing across teams

Requirements

  • 6+ years of experience in systems engineering or infrastructure roles
  • Strong experience designing and operating production infrastructure
  • Solid expertise in: VMware
  • Cisco UCS
  • Application/Network Loadbalancers
  • Linux/Unix Operating Systems
  • Networking fundamentals (DNS, TCP/IP, routing, firewalls)
  • Data center environments
  • Demonstrated ability to lead complex technical work across teams

Nice to have

  • Experience with Infrastructure as Code
  • Puppet/Ansible/etc
  • Python
  • Familiarity with observability tooling and reliability practices
  • Experience with containerization and modern platform tooling
  • Exposure to security best practices in infrastructure design
  • GitHub Enterprise Administration
  • User Access Management
  • GH Org / Repo Management
  • SSO / SCIM Integration
  • Policy Enforcement

What we offer

  • Generous performance-based bonus plans for all eligible employees
  • Rich medical, dental, and vision coverage
  • Generous retirement contributions with 100% immediate vesting (regardless of whether you contribute)
  • Quarterly all-company wellness days where everyone takes a pause together
  • Country-specific holidays plus a day off for your birthday
  • One-time home office stipend
  • Annual professional development budget
  • Quarterly well-being stipend
  • Considerable paid parental leave
  • Employee referral bonus program
  • Other benefits (life/AD&D, disability, EAP, etc. - varies by country)
  • Flexible work environment (majority of Mozillians work remotely)
  • Industry-leading paid parental leave (up to 26 weeks of fully paid leave for childbearing parents and up to 12 weeks for non-childbearing parents)
  • Reimbursement for professional development (up to $3,000/year)
  • A work setup including the latest hardware and software of your choice

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Staff Operations Engineer

8 matching positions

Sr. Staff Engineer, Operations Engineer

At GEICO, we offer a rewarding career where your ambitions are met with endless ...
Location
Location
United States , Seattle; Palo Alto; Chevy Chase
Salary
Salary:
130000.00 - 260000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Infrastructure technologies knowledge in a hybrid cloud environment such including Containerization, VMs, CI/CD pipeline, IaC
  • Extensive experience in engineering and solution delivery in a dynamic service provider environment
  • Strong program and project management skills with proven experience coordinating projects across multiple teams, with successful project/product delivery at scale
  • Working knowledge of security services and their impact on production systems including runtime protection services, detective and protective agents and/or daemon sets, vulnerability and application scanning, etc.
  • Experience in a multi-platform environment with Linux, Mac, Windows
  • Experience communicating and presentating to senior and junior staff with the ability to influence stakeholders
  • Detail and deadline oriented with effective organizational and analytic skills
  • Strong critical thinking, problem solving, decision making, and analytical skills
  • Outstanding time management skills and attention to detail
  • Excellent verbal/written communication skills, including the ability to clearly document findings, proposals, issues, and status
Job Responsibility
Job Responsibility
  • Monitor and track signals of security gaps, initiative delays, compliance risks due to system issues, and drive resolution
  • Create visuals on current state of the union related to security engineering
  • Help to develop standards on reporting tool effectiveness, maturity, resilience and other factors in determining risks as they come up
  • Help drive automation of routine tasks to drive growth in security protection and detection technologies
  • Provide expert guidance, demonstrations and lead discussions on security best practices to stakeholders and leadership
  • Works in lockstep with our CSIRT, GRC, Tech and partner teams to ensure protection coverages, proper detection event notifications, documentation and standards we can all use
  • Organize, store and manage operational best practices documentation for security solutions to protect our platforms including endpoint, cloud, collaboration, identity and network
  • Partner with the project sponsors, delivery teams, and stakeholders to deliver quality solutions on time and within budget by coordinating project activities across multiple systems, departments, and teams
  • Create, maintain, and actively manage a detailed project schedule, change control process, and documentation
  • Identify and raise appropriate security risks, in addition to presenting detailed and implementable solutions or alternatives
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right

Senior Staff Engineer - Operations Research

The mission of the Surge team is to maintain overall marketplace reliability by ...
Location
Location
United States , Sunnyvale; San Francisco; New York
Salary
Salary:
267000.00 - 297000.00 USD / Year
uber.com Logo
Uber
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in relevant fields (Operations Research, Industrial Engineering, Computer Science) with a focus on optimization modeling
  • 6+ years of industry experience developing algorithms and models for large-scale deployment
  • Experience with optimization packages such as Gurobi, CPLEX, and OR Tools
  • Experience with two-sided marketplace design, pricing optimization, matching/allocation
  • Strong communication skills and ability to work effectively with cross-functional partners
  • Proficiency in one or more coding languages such as Python, Java, Go, or C++
Job Responsibility
Job Responsibility
  • Work with a mixed team of Engineers, Operations Researchers, and Economists
  • Build new scalable algorithms for real-time pricing of Ubers products across hundreds of global marketplaces
  • Help set the team's technical direction and roadmap
  • Communicate with leadership, identify new opportunities, and help guide the growth of more junior engineers
What we offer
What we offer
  • Eligible to participate in Uber's bonus program
  • May be offered an equity award & other types of comp
  • Eligible for various benefits
  • Fulltime
Read More
Arrow Right

Staff Engineer | Group Operations

Every parcel we deliver starts with an engineering decision. At Instabee, we don...
Location
Location
Sweden , Stockholm
Salary
Salary:
Not provided
instabee.com Logo
Instabee
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A Technical Leader — You have broad experience building distributed systems and can navigate deep technical dives and high-level architectural trade-offs with equal confidence.
  • Strategic & Pragmatic — You know that 'perfect is the enemy of good.' You favor an iterative mindset, delivering value early and often rather than waiting for the ideal solution.
  • Operationally Curious — You're fascinated by the intersection of software and physical operations. Experience in logistics, routing, or hardware integration is a significant plus.
  • A Communicator & Collaborator — You can translate complex technical concepts for non-technical stakeholders and thrive in a cross-functional environment where engineering, product, and business work closely together toward shared goals.
  • A Quality Advocate — You hold a high bar for stability and security — but you know how to achieve it without killing the pace of innovation.
Job Responsibility
Job Responsibility
  • Drive Technical Strategy & Architecture
  • Tackle Genuinely Hard Problems
  • Champion Developer Experience
  • Bridge Business & Tech
  • Mentor & Build Culture
  • Enable Data-Driven Decisions
What we offer
What we offer
  • The Best View in Town
  • Dog-Friendly Culture
  • Unwind & Recharge
  • Family First
  • Birthday Gift
  • Work-Life Harmony
  • Fulltime
Read More
Arrow Right

Staff Network Operations Engineer

Crusoe's mission is to accelerate the abundance of energy and intelligence. We’r...
Location
Location
United States , Sunnyvale; San Francisco
Salary
Salary:
193000.00 - 234000.00 USD / Year
crusoe.ai Logo
Crusoe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of related experience building and operating at scale in a production environment
  • In-depth knowledge of network protocols including TCP/IP, QoS, BGP, OSPF/IS-IS, EVPN, VXLAN, QoSand MPLS-related technologies like RSVP-TE, LDP, etc.
  • Good understanding of network monitoring protocols and tools, such as SNMP, IPFIX, Sflow/netflow, and Telemetry
  • Experience with tools like Kentik, Arbor, Thousand eyes, Catch point, packet design etc
  • Familiar with data center network architecture, such as Fat Tree architecture, CLOS, BGP-TE, and peering for edge
  • Hands-on experience with major network devices like Mellanox, Cisco, Arista, Juniper, and other mainstream vendors
  • Familiar with mainstream commercial switch/router chipsets, such as Broadcom, Barefoot, etc.
  • In-depth knowledge of public cloud architecture connectivity options to AWS, GCP, Azure, Ali Cloud, OCI, etc.
  • Good understanding of IPv6 and IPv4-IPv6 coexistence technologies
  • Self-motivated, with good communication and writing skills
Job Responsibility
Job Responsibility
  • Manage, and optimize Crusoe Energy Cloud's global network, including edge, backbone, data center, and public cloud connectivity
  • Collaborate with Network Engineering and cross-functional teams including but not limited to Software Infrastructure, and Product, to drive the innovation and evolution of the Crusoe Energy Cloud network
  • Lead operational excellence initiatives—developing monitoring, alerting, and self-healing systems to ensure high network availability
  • Perform advanced troubleshooting and root cause analysis for incidents, guiding post-mortem reviews and improvements
  • Mentor network engineers and establish best practices for incident response, documentation, and operational readiness
  • Will be part of a 24/7 Oncall Support for the Crusoe Network
What we offer
What we offer
  • Restricted Stock Units in a fast-growing, well-funded technology company
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • Pet-friendly offices
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Fulltime
Read More
Arrow Right

Staff Observability Operations Engineer

We are currently seeking several experienced and highly skilled Staff Observabil...
Location
Location
United States , Hartford
Salary
Salary:
130295.00 - 260590.00 USD / Year
https://www.cvshealth.com/ Logo
CVS Health
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ Years of experience in IT operations, with significant responsibilities in system monitoring, performance tuning, and troubleshooting enterprise applications
  • 5+ Years in a Site Reliability Engineering (SRE) role deploying and managing modern observability solutions
  • 5+ Years managing and implementing observability and event management platforms (e.g., AppDynamics, Splunk, Prometheus, Grafana)
  • Experience developing and administering ServiceNow ITOM event management solutions
  • Experience deploying and managing service reliability platforms (e.g., xMatters, OpsGenie, PagerDuty)
  • Experience with and deep knowledge of cloud environments, cloud monitoring platforms, and container orchestration tools (e.g., AWS/CloudTrail, Azure/Monitor, GCP/GCM, Kubernetes, OpenShift)
  • Proficiency in Python and other scripting languages such as Ansible, PowerShell, Bash for automation and configuration
  • Hands-on experience deploying, managing, and administering observability platforms
  • Hands-on experience leading, coordinating, and performing migration of application, platform, and infrastructure observability solutions
  • Proven ability to troubleshoot and resolve complex technical issues
Job Responsibility
Job Responsibility
  • Deploy and implement modern observability solutions
  • Manage and administer observability and event management platforms
  • Coordinate and manage release cycles for observability platforms
  • Troubleshoot and resolve incidents related to observability platforms
  • Continuously monitor and enhance platform performance
  • Collaborate with cross-functional stakeholders
  • Provide training and mentoring to junior engineers
  • Ensure compliance and security of observability platforms
  • Maintain documentation of observability platform configurations
  • Generate and analyze reports on platform performance and capacity
What we offer
What we offer
  • Affordable medical plan options
  • a 401(k) plan (including matching company contributions)
  • an employee stock purchase plan
  • No-cost programs for all colleagues including wellness screenings, tobacco cessation and weight management programs
  • confidential counseling and financial coaching
  • Paid time off
  • flexible work schedules
  • family leave
  • dependent care resources
  • colleague assistance programs
  • Fulltime
Read More
Arrow Right

Staff Operations AI Engineer

We're looking for a Staff Operations AI Engineer who will architect, build, and ...
Location
Location
United States , Denver
Salary
Salary:
150000.00 - 176000.00 USD / Year
https://checkr.com Logo
Checkr
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years in systems engineering, automation platforms, integration architecture, or AI-enabled operations
  • Deep expertise in API design, OAuth/token-based authentication, webhooks, and event-driven systems
  • Proven experience building reliable automation workflows with observability, retries, and failure handling
  • Strong JavaScript and/or Python expertise for backend logic, scripting, and internal tooling
  • Solid SQL experience for data transformation, validation, and operational analytics (Snowflake a plus)
  • Advanced comfort with JSON, schemas, and data normalization for LLM and automation use cases
  • Hands-on experience running LLM-powered agents or automations in production, including guardrails and output validation
  • Formal experience in prompt engineering frameworks
  • Familiarity with integrating with CRM and support systems (e.g., Salesforce, Zendesk)
  • Track record designing end-to-end integration architectures across multiple platforms
Job Responsibility
Job Responsibility
  • Design and own the integration architecture that enables AI Agents to operate safely and reliably across Checkr systems and third-party platforms
  • Build production-grade API integrations with secure authentication flows, webhook and event-driven patterns, and robust automation workflows that coordinate actions across tools like Zendesk, Salesforce, and internal services
  • Ensure AI-driven operations are resilient through strong error handling, retries, observability, and fallback mechanisms
  • Proactively identify integration gaps, system bottlenecks, and failure modes, and lead the technical solutions that improve reliability and scale
  • Build and maintain the technical foundations that power AI-driven workflows, including structured data pipelines, normalized schemas, and predictable JSON inputs for LLMs
  • Write high-quality JavaScript and SQL to support automation logic, internal tooling, and operational insights
  • Establish engineering standards for workflow design, code quality, and system observability
  • Communicate architecture clearly through documentation and diagrams, and serve as a technical authority across teams to ensure consistent, high-quality execution
  • Implement guardrails, validation rules, and safety checks that ensure AI Agents act responsibly and accurately in production
  • Evaluate model output quality and continuously refine prompts, transformations, and logic to improve consistency, reliability, and trust in AI-driven decisions
What we offer
What we offer
  • A fast-paced and collaborative environment
  • Learning and development allowance
  • Competitive cash and equity compensation, and opportunity for advancement
  • 100% medical, dental, and vision coverage
  • Up to $25K reimbursement for fertility, adoption, and parental planning services
  • Flexible PTO policy
  • Monthly wellness stipend
  • In-office perks are provided, such as lunch five times a week, a commuter stipend, and an abundance of snacks and beverages
  • A relocation stipend may be available for those willing to relocate to a Checkr hub location
  • Fulltime
Read More
Arrow Right

Staff Customer Operations Engineer

At Cloudera, we empower people to transform complex data into clear and actionab...
Location
Location
Israel
Salary
Salary:
Not provided
cloudera.com Logo
Cloudera
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Valid Security Clearance (Level 3 or 2) is mandatory
  • Ability to pass customer Security Clearance and Vetting as needed
  • Enterprise Technical Support experience or equivalent troubleshooting skills
  • Experience with Unix or Linux environments
  • Applicants are required to read, write and speak the following languages: English, Hebrew
Job Responsibility
Job Responsibility
  • Providing remote technical support on break-fix issues and operational guidance for our internal and external customers through cases via Zoom, phone, and case updates
  • Practicing active listening to identify and understand customer issues at hand and current business impact, driving resolution with a sense of urgency
  • Learning and giving training on new technology and human skill sets
  • Working cross-functionally as Subject Matter Experts with other COEs, Engineering, Product Management, and Account teams to ensure speedy time to resolution exceeding our Service Level Agreement
  • Setting and influencing customer expectations clearly and concisely with a friendly and collaborative disposition to achieve the highest customer satisfaction
  • Providing proactive support by authoring and approving Knowledge Base articles and Community posts
  • Participate in weekend and holiday on-call roster
What we offer
What we offer
  • Generous PTO Policy
  • Support work life balance with Unplugged Days
  • Flexible WFH Policy
  • Mental & Physical Wellness programs
  • Phone and Internet Reimbursement program
  • Access to Continued Career Development
  • Comprehensive Benefits and Competitive Packages
  • Paid Volunteer Time
  • Employee Resource Groups
  • Fulltime
Read More
Arrow Right

Staff Security Software Engineer - Security Operations

The Role GM’s Cybersecurity Team safeguards the company’s global information ...
Location
Location
United States , Austin
Salary
Salary:
Not provided
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years in software engineering with a focus on distributed systems, security integrations, and data platforms
  • Deep expertise building event-driven, horizontally scalable services and contract-first APIs
  • Track record productizing AI in security workflows (multi-agent patterns, RAG at scale, evaluation harnesses, guardrails, red-teaming)
  • Cloud architecture depth (Azure/AWS/GCP), including networking, Kubernetes, service meshes, observability stacks, and IaC at scale
  • Data platform expertise: streaming (Kafka/Event Hub/PubSub), vector/search (pgvector/FAISS/Pinecone), schema/versioning, governance/lineage
  • Demonstrated org-wide influence: authored standards, drove cross-team adoption, led multi-quarter programs to successful outcomes
  • Exceptional communication with executives
  • ability to frame risk, ROI, and tradeoffs succinctly
Job Responsibility
Job Responsibility
  • Set the reference architecture for security data integration and AI orchestration (agents, policy-guard railed workflows, governance)
  • Lead cross-org programs that unify SIEM/EDR/IAM/SSPM/CSPM/ITSM/cloud data models and establish single sources of truth
  • Operationalize AI at scale with safety, privacy, and governance—including data retention, PII controls, model routing, evaluation, and fallback strategies
  • Drive cost/performance optimization (throughput, latency, storage tiering, vector index strategies) for high-volume security telemetry
  • Influence vendor strategy and negotiate integration roadmaps
  • guide build-vs-buy decisions and multi-year investments
  • Mentor/coach Staff/Senior engineers
  • build a culture of design excellence, pragmatic risk management, and measurable outcomes
  • Communicate upward with crisp executive narratives, metrics, and business impact framing
What we offer
What we offer
  • Relocation benefits
  • Fulltime
Read More
Arrow Right