CrawlJobs Logo

Staff Systems Engineer, Fault Management

kodiak.ai Logo

Kodiak Robotics

Location Icon

Location:
United States , Mountain View

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

200000.00 - 268000.00 USD / Year

Job Description:

The Systems and Safety Engineering team at Kodiak is seeking an experienced Systems Engineer to own the design and execution of Kodiak’s next-generation Autonomy Fault Management System. This individual will lead the effort end-to-end: from product and system requirement definition, through architecture and implementation, to verification and validation, and safety case integration. This leader will ensure that the Kodiak Driver handles onboard system faults with the desired, correct, safe response. This role is central to progressing towards achieving a scalable driverless deployment and will work closely with autonomy hardware, software, and system safety teams. This role directly shapes Kodiak’s ability to operate sustainably at commercial scale. Fault management is not only a safety system—it is a primary lever of fleet availability, utilization, and cost per mile. You will own the technical strategies that determine when the system can continue operating safely, when it must degrade, and when it must exit service.

Job Responsibility:

  • Lead the end-to-end development of the next generation of Autonomy Fault Management System, leading the collaborative effort across hardware, software, system safety, and operations teams
  • Own the systems and safety engineering execution for fault management across the full V-model lifecycle
  • Lead the development of systems engineering artifacts, including requirements, traceability, V&V plans, V&V evidence
  • Define and lead the fault management architecture and concept of operations, including detection, isolation, response, safe-state definition, and minimum risk conditions
  • Generate technical evidence in support of the adequacy, coverage, and sufficiency of the Fault Management System as an element of Kodiak’s Driverless Safety Case
  • Support quantitative and qualitative analyses used to set detection thresholds, prioritize hazards, and evaluate risk associated with fault responses and minimum risk maneuvers
  • Lead and influence system architecture trade studies that impact the fault coverage, system availability, safety risk, and operational continuity
  • Develop the strategy for managing system availability, degraded operation, and operational continuity through the Fault Management System
  • Quantify the commercial and safety impact of false positive and false negative detections
  • Provide analysis to support complex autonomy system design trade-offs to inform system design decisions affecting safety and performance
  • Serve as the technical leader to align cross-functional teams around a unified fault management strategy

Requirements:

  • B.S., M.S., or PhD in engineering or related technical field
  • 5+ years experience within real-time safety-critical applications, preferably in highly automated or autonomous systems (autonomous vehicles, aerospace, nuclear, medical, etc)
  • Experience with fault management, diagnostic development, safe state identification and development
  • Experience working with agile software engineering teams
  • Ability to read C/C++ code
  • Experienced in Systems Engineering V-model and application within product life cycle
  • Strong verbal and written communication skills
  • Ability to collaborate effectively with technical stakeholders spanning multiple technical disciplines
What we offer:
  • Competitive compensation package including equity and annual bonuses
  • Excellent Medical, Dental, and Vision plans through Kaiser Permanente, Cigna, and MetLife (including a medical plan with infertility benefits)
  • MetLife Legal Services, Identity & Fraud Protection, Hospital Indemnity Insurance, Accident Insurance, & Critical Illness Insurance
  • Flexible PTO, 10 paid holidays, and generous parental leave policies
  • Office perks: dog-friendly, free catered lunch, a fully stocked kitchen, and free EV charging
  • Long Term Disability, Short Term Disability, Life Insurance
  • Wellbeing Benefits - Headspace through Cigna, Calm through Kaiser, One Medical, Gympass, Spring Health through Cigna, Rula (mental health navigation)
  • Fidelity 401(k)
  • Commuter, FSA, Dependent Care FSA, HSA
  • Various incentive programs (referral bonuses, patent bonuses, etc.)

Additional Information:

Job Posted:
January 03, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Staff Systems Engineer, Fault Management

Systems Manager - CAFM - Facilities Management System

My client Require a Systems Manager ro work as a senior Mmeber of thier Estates ...
Location
Location
United Kingdom , Cambridgeshire
Salary
Salary:
Not provided
datacareers.co.uk Logo
DataCareers
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience of working with Estates information systems (CAFM)
  • Significant experience of working within technical or Estates management teams, on data and information related management
  • Proven experience managing large teams
  • Experience of managing process driven operations within an organisation
  • Experience of project planning and implementation
  • Experience of working with complex processes and creating process maps and drafting policies and processes
  • Change management experience
Job Responsibility
Job Responsibility
  • Act as main support for systems used by the Building Engineering Maintenance team such as CAFM system (Concept)
  • Maintain an electronic asset database as part of the asset management system
  • oversee the system software upgrades and additions if required
  • Fault finding
  • regular system back-up
  • Provide appropriate training for staff on systems and the effective use of the following modules (Planned and Reactive maintenance programme, Work orders / Help Desk module, Service contracts module and Asbestos management modules)
  • EnsurE the data is accurately recorded and evident to enable the safe, effective, and efficient control of Building Engineering Maintenance tasks, workflows, and workforce
  • Production of timely and accurate BEM data
  • Monthly reporting of key performance indicators
  • Processing of Freedom of Information requests, ERIC, Estates CAS Alert, etc. requirements
  • Fulltime
Read More
Arrow Right

Software Engineer Staff - Platforms/System Software Development

Designs, develops, troubleshoots and debugs software programs for software enhan...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science or a related field, with 5+ years of experience in platforms/system software development
  • Strong technical, analytical, and problem-solving skills
  • Proficiency in C, C++, embedded systems, and Linux kernel/driver development
  • Experience working close to hardware, including device drivers, system bring-up, and Linux/OS fundamentals
  • Expertise in troubleshooting and debugging complex issues
  • Knowledge and experience with firmware, optics, SPI, retimers, FPGA, CPLD, MDIO, Ethernet interfaces (10GE - 400GE), timing protocols (SyncE, PTP), SerDes, fabric management, and chassis management are preferred
  • Understanding of system hardware, including buses, hardware queues/FIFOs, interrupts, BIOS, PCIe, I2C, etc., is a plus
  • Experience with new hardware and/or ASIC bring-up
  • Experience designing fault-tolerant and resilient systems is highly desirable
  • Excellent debugging skills and experience with various software, hardware, and memory debugging tools
Job Responsibility
Job Responsibility
  • Work with cross-functional teams to develop detailed software functional specifications, articulate system/software architecture specifications, for product features, to meet product requirements
  • SW development in the areas of platform infrastructure, device drivers, kernel, chassis control, device management, link, and interface management
  • Carry out detailed design and implementation, unit testing, integration of packet forwarding, related device/kernel drivers, and other related software components for products and features
  • Work closely with system and solution test teams to ensure correct and complete verification of software and components, for the feature to meet real-life network deployments
  • Work closely with Juniper Technical Assistance Team, for providing engineering assistance in supporting critical customer escalations for customer deployments
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Credit Cards & Banking Infrastructure

Join us in building the future of finance. Our mission is to democratize finance...
Location
Location
United States , Menlo Park; New York
Salary
Salary:
169000.00 - 255000.00 USD / Year
robinhood.com Logo
Robinhood
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Drive to take your career — and the systems you build — to the next level
  • Ability to lead a small, high-caliber pod to deliver outsized impact
  • Hands-on leadership — you set direction, make the key calls, and build alongside your team
  • Proven success scaling complex, high-availability infrastructure in production
  • Breadth across infrastructure experience: DevOps, SRE, backend performance, developer experience, pure infra
  • Standout ability to structure initiatives for maximum business impact and aggressive sequencing
  • Creativity and expertise to deliver lean, cost-effective solutions without compromising quality
  • Strong judgment under pressure and skill at making high-impact technical decisions
  • Track record of building fault-tolerant distributed systems and modernizing release/deployment pipelines
  • Leadership skills with the ambition to grow into Senior Staff+ or management roles
Job Responsibility
Job Responsibility
  • Lead high-stakes scaling efforts to support millions more customers without breaking stride
  • Set and drive the long-term technical strategy for Robinhood Banking Infrastructure
  • Determine which parts of our infrastructure to keep, which to evolve, and lead those transitions from the front
  • Design bulletproof solutions for mission-critical systems that require extreme availability (think: transaction authorization)
  • Establish and implement best-in-class practices for load testing, canary releases, recovery tooling, and deployment pipelines
  • Drive stability and scalability across all layers — databases, compute, deployments, and beyond
  • Reduce operational friction by improving oncall, crafting elegant protocols, and streamlining incident response
  • Invest in developer experience so the whole ENG team can iterate faster with less risk
  • Anticipate scaling bottlenecks before they appear and take decisive action to stay ahead
What we offer
What we offer
  • Performance driven compensation with multipliers for outsized impact, bonus programs, equity ownership, and 401(k) matching
  • 100% paid health insurance for employees with 90% coverage for dependents
  • Lifestyle wallet - a highly flexible benefits spending account for wellness, learning, and more
  • Employer-paid life & disability insurance, fertility benefits, and mental health benefits
  • Time off to recharge including company holidays, paid time off, sick time, parental leave, and more
  • Exceptional office experience with catered meals, events, and comfortable workspaces
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Justice

The Justice and Corrections team is building new solutions to improve efficiency...
Location
Location
United States , Seattle
Salary
Salary:
168750.00 - 270000.00 USD / Year
axon.com Logo
Axon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 12+ years of experience in software engineering
  • Experience driving technical strategy with impact across multiple teams
  • Expertise in building large scale cloud services, including deep knowledge with different database technologies such as SQL, NoSQL, and queuing systems
  • Experience designing and delivering high-availability, scalable cloud-based systems
  • Backend cloud experience in managed languages such as Java, Scala, Go, C#, or similar
  • Frontend experience with JavaScript, Typescript, React, Angular or similar technologies
  • Experience working with containerization and orchestration technologies (Docker, Kubernetes, etc)
  • Proven ability to work side by side with product managers and other engineers to create a technical backlog to build product experiences that customers love
  • A passion for quality design, code, tools and architectures
Job Responsibility
Job Responsibility
  • Lead by example, drive major projects from design to production
  • Set a high bar on standards of excellence, provide deep technical insights during architecture design reviews
  • Effectively mentor and grow engineers into the Senior and Staff levels
  • Obsess over customers, works backward from customers to build the right solutions
  • Build services that adhere to high bar on availability, performance, and security in mission-critical space
  • Maintain curiosity and desire to stay up to speed on latest technologies and frameworks
  • Collaborate with other Staff and Senior Engineers to shape and drive the technical direction for the team
  • Take a holistic approach to ensuring our products meet high standards including fault-tolerance, scalability, performance, etc
  • Join forces with product and design to imagine and deliver innovative solutions to key customer problems
  • Coach and mentor the next set of engineering leaders
What we offer
What we offer
  • Competitive salary and 401k with employer match
  • Discretionary paid time off
  • Paid parental leave for all
  • Medical, Dental, Vision plans
  • Fitness Programs
  • Emotional & Mental Wellness support
  • Learning & Development programs
  • Snacks in our offices
  • Fulltime
Read More
Arrow Right

Staff Software Engineer (Back End)

As a Staff Software Engineer on our Runtime team, you will work directly with Vi...
Location
Location
United States , Los Angeles
Salary
Salary:
150000.00 - 205000.00 USD / Year
viantinc.com Logo
Viant
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of professional experience in the software industry
  • Bachelor’s degree in Computer Science, Mathematics, or a related field (Master’s degree preferred)
  • Proficiency in GoLang
  • Expertise in designing, building, and delivering high-quality, scalable, and high-performance distributed systems, including developing detailed design specifications
  • Experience with relational databases (e.g., MySQL, Oracle, or SQL Server), including writing SQL
  • Familiarity with cloud-based technologies, especially AWS (e.g., EC2, S3, Lambda, RDS) and GCP
  • Knowledge of multithreading, concurrency, and low-latency application development
  • Proven ability to identify and resolve project issues effectively and in a timely manner
Job Responsibility
Job Responsibility
  • Work directly with Viant’s ad server, managing a high-volume, low-latency application that processes billions of unique requests a day
  • Work with state-of-the-art technology as part of bid optimization, dynamic performance optimization, as well as develop a self-diagnostic and healing architecture
  • Deliver simple, elegant, scalable, extensible solutions that strike the right balance between short-term and long-term considerations
  • Design, develop, maintain and trouble software features with an eye on cost, scalability, performance and fault-tolerance
  • Participate in design discussions, code reviews and bug triage / Root Cause Analysis (RCA) sessions
  • Be the technical lead of projects that involve 2-3 other engineers
  • Collaborate with Product, other Engineering and Technical operations teams on the entire lifecycle from ideation to productization and beyond
  • Drive the development and continuous improvement of existing projects while shaping the vision and scope of future initiatives
  • Collaborate with internal teams to design, build, and refine a highly reliable, highly available, low-latency runtime platform
  • Own and provide technical leadership for multiple subsystems of the runtime platform, ensuring scalability, performance, and fault tolerance
What we offer
What we offer
  • Fully paid health insurance
  • Paid parental leave
  • Unlimited PTO
Read More
Arrow Right

Software Engineer Staff

We are seeking a talented and motivated Staff Software Engineer to join our dyna...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A minimum of 10 years of professional software development experience
  • Proven expertise in one or more backend programming languages such as Golang (highly preferred), Java, Python, or C/C++
  • Deep understanding of networking protocols, network architectures, network security, and common networking concepts
  • Proven experience in designing, building, and deploying scalable microservices using Docker, Kubernetes, etc.
  • Significant experience in building, deploying, and operating scalable SaaS applications in a Public Cloud (AWS/GCP) environment
  • Strong understanding of distributed systems principles, including concurrency, scalability, fault tolerance, and consistency
  • Experience with various database technologies, including relational (e.g., PostgreSQL, MySQL) and NoSQL (e.g., DynamoDB, Redis) databases
  • Experience designing, building, and consuming RESTful APIs and other integration technologies like WebSocket, Kafka, etc.
  • Experience with network security principles, threat modelling, and secure coding practices is an added advantage
  • Excellent analytical and problem-solving skills
Job Responsibility
Job Responsibility
  • Technical Leadership: Work with product managers, architects, and other engineers to understand the software requirements, and define corresponding functional and design specifications
  • Software Development: Design, develop, test, deploy, and maintain high-quality, production-grade software, with a strong emphasis on backend systems
  • System Design & Optimization: Design and implement micro-services for high availability, scalability, performance, and security within our SaaS platform
  • Networking Expertise: Apply deep knowledge of networking protocols (e.g., TCP/IP, HTTP/S, DNS, NAT), network security, and cloud networking concepts to build robust and secure solutions
  • SaaS & Cloud Native Development: Design and implement solutions leveraging cloud platforms (e.g., AWS, Azure, GCP) and containerization technologies (e.g., Kubernetes, Docker)
  • Collaboration: Collaborate effectively with cross-functional teams including product management, QA, SRE, and Juniper technical assistance team
  • Code Quality & Best Practices: Champion best practices in software development, including code reviews, testing methodologies, CI/CD, and DevOps principles
  • Problem Solving: Troubleshoot and resolve complex technical issues in a timely and effective manner, often in production environments
  • Innovation & Research: Stay abreast of emerging technologies and industry trends in networking, SaaS, and software engineering
  • Documentation: Create and maintain comprehensive technical documentation for designs, APIs, and operational procedures
What we offer
What we offer
  • Health & Wellbeing: Comprehensive suite of benefits that supports physical, financial and emotional wellbeing
  • Personal & Professional Development: Programs catered to helping you reach any career goals
  • Unconditional Inclusion: We are unconditionally inclusive in the way we work and celebrate individual uniqueness
  • Fulltime
Read More
Arrow Right

Software Engineer Staff

Designs, develops, troubleshoots and debugs software programs for software enhan...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A minimum of 10 years of professional software development experience
  • Proven expertise in one or more backend programming languages such as Golang (highly preferred), Java, Python, or C/C++
  • Deep understanding of networking protocols, network architectures, network security, and common networking concepts
  • Proven experience in designing, building, and deploying scalable microservices using Docker, Kubernetes, etc.
  • Significant experience in building, deploying, and operating scalable SaaS applications in a Public Cloud (AWS/GCP) environment
  • Strong understanding of distributed systems principles, including concurrency, scalability, fault tolerance, and consistency
  • Experience with various database technologies, including relational (e.g., PostgreSQL, MySQL) and NoSQL (e.g., DynamoDB, Redis) databases
  • Experience designing, building, and consuming RESTful APIs and other integration technologies like WebSocket, Kafka, etc.
  • Experience with network security principles, threat modelling, and secure coding practices is an added advantage
  • Excellent analytical and problem-solving skills
Job Responsibility
Job Responsibility
  • Technical Leadership: Work with product managers, architects, and other engineers to understand the software requirements, and define corresponding functional and design specifications
  • Software Development: Design, develop, test, deploy, and maintain high-quality, production-grade software, with a strong emphasis on backend systems
  • System Design & Optimization: Design and implement micro-services for high availability, scalability, performance, and security within our SaaS platform
  • Networking Expertise: Apply deep knowledge of networking protocols (e.g., TCP/IP, HTTP/S, DNS, NAT), network security, and cloud networking concepts to build robust and secure solutions
  • SaaS & Cloud Native Development: Design and implement solutions leveraging cloud platforms (e.g., AWS, Azure, GCP) and containerization technologies (e.g., Kubernetes, Docker)
  • Collaboration: Collaborate effectively with cross-functional teams including product management, QA, SRE, and Juniper technical assistance team
  • Code Quality & Best Practices: Champion best practices in software development, including code reviews, testing methodologies, CI/CD, and DevOps principles
  • Problem Solving: Troubleshoot and resolve complex technical issues in a timely and effective manner, often in production environments
  • Innovation & Research: Stay abreast of emerging technologies and industry trends in networking, SaaS, and software engineering
  • Documentation: Create and maintain comprehensive technical documentation for designs, APIs, and operational procedures
What we offer
What we offer
  • Health & Wellbeing: Comprehensive suite of benefits that supports physical, financial and emotional wellbeing
  • Personal & Professional Development: Specific programs catered to helping you reach any career goals you have
  • Unconditional Inclusion: We are unconditionally inclusive in the way we work and celebrate individual uniqueness
  • Fulltime
Read More
Arrow Right

Staff Software Engineer

This SaaS product connects millions of JVM runtimes, collects and aggregates det...
Location
Location
Serbia , Belgrade
Salary
Salary:
Not provided
azul.com Logo
Azul Systems
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience in Java/Kotlin covering technical architecture, algorithms, design, network management, application development, middleware, AWS/GCP, RDBMS, NoSQL, messaging
  • 5+ years of experience in one or more of the following areas: scalable distributed systems, cloud optimizations and costs, monitoring and alerting, reliable and fault-tolerant systems with performance in mind
  • Experience as an architect or technical lead with customer-facing large-scale products
  • Passionate about simplicity and efficiency, hate for complexity
  • Strong technical problem-solver
  • Positive, enjoys collaborating and communicating with others
  • Experienced in communicating and working across functions to drive solutions
  • Holds BS/MS degree in Computer Science, Engineering, Mathematics or a related field or equivalent experience
Job Responsibility
Job Responsibility
  • Implement new features, fix issues and perform code reviews in Java
  • Participate in designs and architecture decisions
  • Provide unique insights into cloud architecture
  • Translation of complex functional, technical, and business requirements into designs
  • Understanding risk-driven/spiral development approach and enforcing proofs-of-concept and prototypes to validate and compare design alternatives
  • Performing cost/benefit and trade-off analyses of design alternatives
  • Defining high-level development tasks, providing estimates, and identifying skills necessary for implementation
  • Recommending strategies for SaaS monitoring, performance improvements, and capacity planning
  • Being a charismatic team player with exceptional collaboration and communication skills
  • Driving the team's goals & technical direction to pursue opportunities that make the larger organization more efficient
What we offer
What we offer
  • Equity Program
  • Annual bonus based on company performance
  • Referral Program
  • IT Equipment - MacBook Pro or any other HW according to your preferences
  • Work-life balance - 5 weeks of holidays, 5 sick days, flexible working hours, 100% work from home also possible
  • Offices in Belgrade City Centre - if you prefer
  • Work with top experts worldwide who contribute to the Java ecosystem
  • Fulltime
Read More
Arrow Right