CrawlJobs Logo

Staff Engineer, Software Reliability Engineering

India, Bengaluru · Job Posted February 23, 2026
Apply Position
Job Link Share

Job Description

We are seeking a Staff Engineer to join our dynamic team in Bengaluru, India. In this role, you will lead the development of innovative and scalable applications, driving technical excellence and best practices within our organization.

Job Responsibility

  • Architect, design, and implement high-performance, scalable test suite for Reliability testing
  • Collaborate with cross-functional teams to define and implement new features and products
  • Lead code reviews and provide mentorship to junior developers
  • Optimize test performance and ensure high-quality, efficient code
  • Troubleshoot and resolve complex technical issues
  • Stay current with emerging technologies and industry trends, recommending improvements to our technology stack
  • Contribute to the development of technical standards and best practices
  • Participate in Agile ceremonies and help drive continuous improvement in our development processes

Requirements

  • Bachelor's degree in CSE or ECE or EEE, Software Engineering, or related field
  • Master's degree preferred
  • 5 years of software development experience of python scripting and test case development
  • Advanced proficiency in programming languages such as Java, Python, or C++
  • Proficient in version control systems, preferably GitHub
  • Solid understanding of software architecture and design patterns
  • Experience with API development and integration
  • Strong skills in performance optimization and debugging
  • Experience with Agile methodologies and full software development lifecycle
  • Excellent problem-solving and analytical skills
  • Strong communication and teamwork abilities
  • Should take up responsibilities and must be self-driven

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Staff Engineer, Software Reliability Engineering

8 matching positions

Staff Engineer – Reliability Engineering

At GEICO, we offer a rewarding career where your ambitions are met with endless ...
Location
Location
United States , Bethesda, MD; Seattle, WA
Salary
Salary:
115000.00 - 230000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience in at least two modern programming languages (Go, Python, Java, .NET) and object-oriented design
  • Advance knowledge of web technologies such as HTML, CSS, JavaScript is preferred
  • Understand open-source databases like MySQL, PostgreSQL, etc., familiar with No-SQL databases like ONgDB, Cassandra, MongoDB, Elasticsearch, etc.
  • Deep hands-on experience in complex system design and data pipeline and architectures, scale and performance, tuning, with good knowledge of Docker and Kubernetes
  • Hands-on experience with major cloud platforms (Azure, AWS, GCP) or large-scale private data center environments
  • Experience managing distributed systems in public, private or hybrid cloud environments
  • Experience with monitoring, logging and observability tools (Prometheus, Grafana, Open Telemetry)
  • Passion for automation and reducing manual operations using tools like Terraform and Ansible
  • Familiarity with configuration management and orchestration tools like Helm, Puppet, Spinnaker
  • Experience with CI/CD pipelines, Infrastructure as Code(IaC), and cloud-based deployments
Job Responsibility
Job Responsibility
  • Focus on multiple areas and provide strategic and technical guidance
  • Utilize programming languages like Go, Python, Java, .Net or other object-oriented languages, SQL, and NoSQL databases
  • Work with container orchestration tools such as Docker and Kubernetes (K8S), OpenStack and a variety of Azure tools and services
  • Architect and develop cloud-native applications using Azure Services
  • Collaborate with product managers, team members, customers, and other engineering teams to solve our toughest problems
  • Ensure the quality, performance and usability of the engineering solutions
  • Serve as a mentor and thought leader, coaching engineers and Influence and educate executives
  • Drive best practices for platform reliability, disaster recovery, monitoring, alerting, and incident management
  • Collaborate with cross-functional teams (Platform engineering, DevOps, SREs) to integrate, test, and improve platform reliability and performance
  • Determine and support resource requirements, evaluate operational processes, measure outcomes to ensure desired results, demonstrate adaptability and sponsor continuous learning
What we offer
What we offer
  • Market-competitive compensation
  • 401K savings plan vested from day one with 6% match
  • Performance and recognition-based incentives
  • Tuition assistance
  • Mental healthcare
  • Fertility and adoption assistance
  • Workplace flexibility
  • GEICO Flex program (ability to work from anywhere in the US for up to four weeks per year)
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Reliability

The AV platform team develops the first layers of software on the GM Autonomous ...
Location
Location
United States , Austin; Mountain View
Salary
Salary:
160200.00 - 290700.00 USD / Year
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of experience professional experience with multi-sensor system services and frameworks
  • Bachelors Degree in relevant field or relevant work experience
  • Proven experience writing production software to improve data quality and reliability of safety critical systems including root cause and corrective actions
  • Proficiency with C++11 or later and Python
  • Proficiency in debugging and troubleshooting firmware-related issue
  • Experience driving complex embedded software projects through the full lifecycle of product development
  • Experience architecting and delivering Embedded Systems solutions that support multiple generations of the product
  • Experience engaging in communication at senior management levels and influencing technical strategies
  • Experience applying and mentoring team members on software development best practice
  • Clear and concise written and verbal communication skills
Job Responsibility
Job Responsibility
  • Collaborate with hardware, systems engineering, program management, product management and peer software teams to develop critical reliability software features for the autonomous vehicle
  • Root-cause analysis of complex problems involving multiple cross-functional partners, including hardware and software
  • Identify reliability issue trends, provide clear guidance on reliability requirements, develop reliability design guidelines, and apply lessons learned to enable continuous improvement
  • Design and implement shared infrastructure and tooling among the AV Platform teams to monitor and analyze embedded software and data quality metrics
  • Own the development quality and ensure the solutions are scalable, secure, and optimized for customer experience and performance
  • Partner with cross-functional teams to architect and implement embedded software observability and monitoring solutions
  • Work with the engineering teams to architect and build services to simplify troubleshooting and operational response to incidents and Autonomous Vehicles fleet outages
  • Own technical projects, participate in design reviews and provide input for the reliability section of others’ design reviews
  • Ensure efficiency of the vehicle change process involving embedded software changes and dependencies
  • Participate in on-call rotation
What we offer
What we offer
  • medical
  • dental
  • vision
  • Health Savings Account
  • Flexible Spending Accounts
  • retirement savings plan
  • sickness and accident benefits
  • life insurance
  • paid vacation & holidays
  • tuition assistance programs
  • Fulltime
Read More
Arrow Right

Senior Staff Engineer, Software Engineering

We are seeking a highly accomplished Senior Staff Engineer to join our engineeri...
Location
Location
United States , Chevy Chase, MD; Palo Alto, CA; Seattle, WA
Salary
Salary:
130000.00 - 260000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep expertise in infrastructure systems, including compute platforms (Kubernetes, Docker, cloud services), networking, and storage
  • Strong database experience across relational databases (PostgreSQL, MySQL) and NoSQL solutions (MongoDB, Cassandra, Redis, DynamoDB)
  • Demonstrated experience applying AI to solve real-world problems in production environments
  • Expert-level proficiency in at least two programming languages (e.g., Python, Java, Go, Rust)
  • Experience designing and building distributed systems at scale
  • Strong understanding of cloud platforms (Azure OR AWS) and infrastructure-as-code practices
  • Hands-on experience with CI/CD pipelines, build systems, and deployment automation (e.g., GitHub Actions, Jenkins, Azure DevOps, ArgoCD)
  • Background in building real-time data processing systems (Kafka, Flink, Spark)
  • Excellent communication skills with the ability to articulate complex technical concepts to diverse audiences
  • Experience working in a platform engineering team, building internal developer platforms or shared infrastructure services
Job Responsibility
Job Responsibility
  • Define and drive the technical vision for infrastructure and AI-powered systems across the organization
  • Design, architect, and implement highly scalable, fault-tolerant distributed systems
  • Lead technical decision-making on critical projects, balancing short-term needs with long-term sustainability
  • Establish and champion engineering best practices, design patterns, and coding standards
  • Architect and optimize compute infrastructure for performance, reliability, and cost efficiency
  • Design and implement database solutions (relational and NoSQL) that scale to meet business demands
  • Drive cloud infrastructure strategy, including containerization, orchestration, and serverless architectures
  • Ensure system reliability, observability, and operational excellence across all platform components
  • Identify and prioritize opportunities to apply AI/ML to solve high-impact business problems
  • Stay current with emerging AI technologies and evaluate their applicability to business challenges
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right

Senior Staff Engineer, Software Engineering

Our Senior Staff Engineer works with our Staff and Sr. Engineers to innovate and...
Location
Location
United States , Chevy Chase; Austin; Richardson; Seattle; Palo Alto
Salary
Salary:
110000.00 - 260000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Exemplary ability to design, perform experiments, and influence engineering direction and product roadmap
  • Experience partnering with engineering teams and transferring research to production
  • Track-record of publications history in credible conferences and journals
  • Experience with continuous delivery and infrastructure as code
  • In-depth knowledge of CS data structures and algorithms
  • Experience solving analytical problems with quantitative approaches
  • Ability to excel in a fast-paced, startup-like environment
  • Knowledge of developer tooling across the software development life cycle (task management, source code, building, deployment, operations, real-time communication)
  • Fluency and Specialization with at least two modern languages such as Go, Java, C++, Python or C# including object-oriented design
  • Experience with Microservices oriented architecture and extensible REST APIs
Job Responsibility
Job Responsibility
  • Focus on multiple areas and provide technical and thought leadership to the enterprise
  • Collaborate with product managers, team members, customers, and other engineering teams to solve our toughest problems
  • Develop and execute technical software development strategy for a variety of domains
  • Accountable for the quality, usability, and performance of the solutions
  • Utilize programming languages like Python, C# or other object-oriented languages, SQL, and NoSQL databases, Container Orchestration services including Docker and Kubernetes, and a variety of Azure tools and services
  • Be a role model and mentor, helping to coach and strengthen the technical expertise and know-how of our engineering and product community. Influence and educate executives
  • Consistently share best practices and improve processes within and across teams
  • Analyze cost and forecast, incorporating them into business plans
  • Determine and support resource requirements, evaluate operational processes, measure outcomes to ensure desired results, and demonstrate adaptability and sponsoring continuous learning
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right

Reliability Staff Software Engineer - OpenSearch

We're seeking a skilled Staff Software Engineer with leadership ambition, to joi...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
optimizely.com Logo
Optimizely
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree (Computer Science or engineering preferred) or equivalent work experience
  • Significant experience designing, implementing, and maintaining SaaS with high traffic load
  • Several years of experience directly managing scalable and reliable Elasticsearch and/or Opensearch clusters
  • Experience with TypeScript, JavaScript, C#
  • Experience with GraphQL, REST
  • Experience with Cloudflare workers, Kubernetes
  • Experience with OpenSearch
Job Responsibility
Job Responsibility
  • Architect, implement, and optimize Opensearch indexing and query pipelines for scalability and reliability
  • Design and maintain backup, disaster recovery, and failover strategies for Opensearch clusters
  • Lead root cause analysis and resolution of complex search-related incidents and performance bottlenecks
  • Drive automation for cluster provisioning, upgrades, and configuration management (e.g., with Terraform, Ansible, or Kubernetes)
  • Mentor engineers on Opensearch internals, query optimization, and troubleshooting
  • Collaborate with product and engineering teams to translate business requirements into robust search features
  • Own capacity planning and cost optimization for search infrastructure
  • Author technical documentation and best practices for search development and operations
Read More
Arrow Right

Staff Software Engineer, Production Engineering

Engineering at Uber means building for real-world impact under real-world constr...
Location
Location
United States , San Francisco; Sunnyvale
Salary
Salary:
232000.00 - 258000.00 USD / Year
uber.com Logo
Uber
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience in Go, Java, Python, or similar language
  • Experience in delivering solutions end-to-end from defining problems to generating architecture plans, implementation, testing, and delivery
  • Writes clear technical proposals and RFCs
  • able to drive engineering alignment across teams through written design docs and verbal discussion
Job Responsibility
Job Responsibility
  • Design, build, and maintain software to increase the reliability, scalability, and efficiency of thousands of stateless and stateful production services spread across multiple datacenter zones and regions
  • Lead initiatives end-to-end within the team, the Production Engineering org, and across engineering at large to increase reliability through automation, setting standards, developer tooling, and reusable frameworks
  • Work with other engineers to deeply understand their services and guide them towards practical and reliable architecture and implementation
  • Apply SRE concepts such as observability, integration/load/chaos testing, on-call, incident management, failovers, and disaster recovery to improve mean time between failures (MTBF), time to detection (TTD), and time to mitigation (TTM) of incidents
  • Participate in on-call rotations, responding to and leading mitigation of production incidents, and driving post-incident improvements
What we offer
What we offer
  • Uber's bonus program
  • equity award
  • 401(k) plan
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Production Engineering

Delivery Production Engineering is a software engineering team, not a traditiona...
Location
Location
United States , San Francisco
Salary
Salary:
232000.00 - 258000.00 USD / Year
uber.com Logo
Uber
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience in Go, Java, Python, or similar language
  • Experience in delivering solutions end-to-end from defining problems to generating architecture plans, implementation, testing, and delivery
  • Writes clear technical proposals and RFCs
  • able to drive engineering alignment across teams through written design docs and verbal discussion
Job Responsibility
Job Responsibility
  • Design, build, and maintain software to increase the reliability, scalability, and efficiency of thousands of stateless and stateful production services spread across multiple datacenter zones and regions
  • Lead initiatives end-to-end within the team, the Production Engineering org, and across engineering at large to increase reliability through automation, setting standards, developer tooling, and reusable frameworks
  • Work with other engineers to deeply understand their services and guide them towards practical and reliable architecture and implementation
  • Apply SRE concepts such as observability, integration/load/chaos testing, on-call, incident management, failovers, and disaster recovery to improve mean time between failures (MTBF), time to detection (TTD), and time to mitigation (TTM) of incidents
  • Participate in on-call rotations, responding to and leading mitigation of production incidents, and driving post-incident improvements
What we offer
What we offer
  • Eligible to participate in Uber's bonus program
  • May be offered an equity award & other types of comp
  • Eligible to participate in a 401(k) plan
  • Various benefits
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Data Engineering

You'll own Gamma's data infrastructure and architecture as we scale to hundreds ...
Location
Location
United States , San Francisco
Salary
Salary:
230000.00 - 310000.00 USD / Year
gamma.app Logo
Gamma
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience as a data engineer or software engineer working on data infrastructure with deep expertise in distributed systems
  • Expert-level knowledge of event streaming platforms, especially Apache Kafka (producers, consumers, Kafka Connect, stream processing)
  • Extensive hands-on experience with Snowflake, including performance optimization, cost management, and data modeling at massive scale
  • Strong understanding of relational databases (particularly Postgres) and experience with CDC patterns and replication strategies in distributed environments
  • Proven track record architecting and leading major data infrastructure initiatives that handled orders of magnitude growth
  • Experience establishing data engineering best practices and driving technical strategy across organizations
  • Strong communication skills and experience influencing technical direction across engineering, analytics, and leadership
Job Responsibility
Job Responsibility
  • Own and evolve our end-to-end event pipeline architecture, from Kafka ingestion through Snowflake analytics, setting technical direction for data infrastructure
  • Design and architect distributed data systems that scale to orders of magnitude more data volume while maintaining world-class query performance
  • Lead initiatives to build and optimize CDC (change data capture) pipelines and streaming data transformations at massive scale
  • Establish best practices for data quality, pipeline reliability, and system observability across the organization
  • Drive strategic technical decisions about data modeling, infrastructure architecture, and technology choices
  • Mentor engineers and elevate data engineering practices across analytics, product, and engineering teams
What we offer
What we offer
  • competitive equity
  • Fulltime
Read More
Arrow Right