CrawlJobs Logo

Staff Software Developer, Search & Distributed Systems

Canada, Toronto Employment contract 147000.00 - 220000.00 CAD / Year · Job Posted May 05, 2026
Apply Position
Job Link Share

Job Description

If you are looking for a career at a dynamic company with a people-first mindset and a deep culture of growth and autonomy, ACV is the right place for you! Competitive compensation packages and learning and development opportunities, ACV has what you need to advance to the next level in your career. We will continue to raise the bar every day by investing in our people and technology to help our customers succeed. We hire people who share our passion, bring innovative ideas to the table, and enjoy a collaborative atmosphere. Who we are: ACV is a technology company that has revolutionized how dealers buy and sell cars online. We are transforming the automotive industry. ACV Auctions Inc. (ACV), has applied innovation and user-designed, data driven applications and solutions. We are building the most trusted and efficient digital marketplace with data solutions for sourcing, selling and managing used vehicles with transparency and comprehensive insights that were once unimaginable. We are disruptors of the industry and we want you to join us on our journey. Our network of brands include ACV Auctions, ACV Transportation, ClearCar, MAX Digital and ACV Capital within its Marketplace Products, as well as, True360 and Data Services. At ACV we focus on the Health, Physical, Financial, Social and Emotional Wellness of our Teammates and to support this we offer: Company Sponsored (paid) Healthcare, Dental, Vision, Life/AD&D, Short-Term and Long-Term Disability. Comprehensive additional optional benefits such Critical Illness and Supplemental Life/AD&D. All insurance benefits go into effect on your date of hire. Generous Parental Leave Top-Up Pay and Vacation Programs. Employee Stock Purchase Program with additional opportunities to earn stock in the company. Retirement planning through the Company's RRSP. The Role We are looking for a Staff Software Engineer who would thrive on being accountable for our Search infrastructure: its scalability, reliability, and data resiliency. We don't just need someone who knows how to write a complex query; we need a battle-scarred Distributed Systems expert who understands the deep internals of Elasticsearch and who has a deep toolbox for analyzing, monitoring, alerting, and quickly resolving critical issues as they arise. You know exactly how Elasticsearch fails, why it fails under load, and how to architect a topology that prevents it. Because our search ecosystem doesn't exist in a vacuum, you will also own the architectural connective tissue—ensuring our service layers and event-based ecosystem interact with Search flawlessly. As a Staff Engineer, you will set the technical standard, drive systemic reliability, and mentor senior engineers across the organization.

Job Responsibility

  • Architect for Scale: Design, configure, and scale our Elasticsearch clusters. You will define our global strategies for shard routing, Index Lifecycle Management (ILM), heap tuning, and data tiering to support massive auction throughput
  • Master the Failure Modes: Anticipate and engineer away points of failure. You will design circuit breakers, implement backpressure mechanisms, and tune asymmetric timeouts to prevent retry storms between our BFFs, K8s services, and the Search layer
  • Expert Troubleshooting & IR: Act as the ultimate technical escalation point for complex, cross-system performance degradation. You will dive deep into JVM metrics, Garbage Collection pauses, K8s network bottlenecks, and slow logs to uncover and remediate root causes
  • Holistic System Ownership: Manage the entire data lifecycle. You will optimize the ingestion pipelines syncing our event datastreams driven by producers and consumers (Kafka) to Elasticsearch, ensuring eventual consistency and data integrity at scale
  • Drive Engineering Excellence: Draft authoritative architectural Blueprints, SOPs, and Runbooks. You will elevate the surrounding engineering culture by coaching teams on distributed systems design, observability best practices, and incident management
  • Modernize & Innovate: Scan the horizon for emerging technologies. You will help evaluate and integrate next-generation search capabilities (e.g., Vector Search, RAG architectures) to support our broader AI and machine learning initiatives

Requirements

  • 8+ years of software engineering experience, with at least 3+ years operating at a Senior or Staff level focusing on distributed systems and high-throughput platforms
  • Deep, authoritative knowledge of Elasticsearch internals. You have managed large-scale clusters and deeply understand mapping, analysis, query optimization, cluster state management, and split-brain mitigation
  • Proficiency in the systems upstream and downstream of Search. You have hands-on experience with Kubernetes (EKS/GKE), API Gateway/BFF architectures, and event streams (Kafka)
  • A proven track record of implementing fault-tolerant patterns (retries, rate limiting, circuit breaking, dead letter queues) in microservice architectures
  • Expert-level ability to instrument systems and diagnose complex performance issues using modern observability stacks (Datadog, Prometheus, Grafana, OpenTelemetry)
  • Strong communication skills with a proven ability to influence cross-functional teams, build consensus around architectural decisions (the Knoster model!), and mentor mid-level and senior engineers

Nice to have

  • Experience with Infrastructure as Code (Terraform, Helm) for stateful applications
  • Familiarity with FinOps practices, specifically optimizing Elasticsearch compute and storage costs
  • Experience integrating AI-assisted development tools into your daily workflow

What we offer

  • Company Sponsored (paid) Healthcare
  • Dental
  • Vision
  • Life/AD&D
  • Short-Term and Long-Term Disability
  • Comprehensive additional optional benefits such Critical Illness and Supplemental Life/AD&D
  • Generous Parental Leave Top-Up Pay and Vacation Programs
  • Employee Stock Purchase Program with additional opportunities to earn stock in the company
  • Retirement planning through the Company's RRSP

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Staff Software Developer, Search & Distributed Systems

8 matching positions

Staff Software Engineer, Search Platform

As a vital member of the Search PlatformTeam, you will be part of a specialized ...
Location
Location
Singapore; China , Singapore; Shanghai
Salary
Salary:
Not provided
airwallex.com Logo
Airwallex
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • More than 7 years of back-end development experience
  • Have experience in developing large-scale distributed systems
  • Proficient in coding and scripting languages (Java(Kotlin) /C++ /Python, etc.) with strong software and system design abilities
  • Deep familiarity with the standard library, idiomatic usage, and best practices of your primary programming languages
  • Able to write clear, maintainable, and efficient code
  • In depth knowledge on storage & streaming over PostgreSQL or Kafka
Job Responsibility
Job Responsibility
  • Work closely with Product Managers to analyze the product requirements and then produce the technique solutions & execution plan to deliver the software products
  • Hands-on design, implement and deliver production-grade streaming ingestion using Flink (or similar technologies), focusing on low-latency, high-throughput, and fault-tolerant design
  • Tackle challenging problems in timely computation, stateful stream processing, partitioning, and resilience
  • Proactively troubleshooting and addressing technical bottlenecks
  • Participate in and contribute to critical code, design, and performance reviews, raising the technical bar across the team
  • Engage with the Technical leads in building a backlog that continuously contributes to the execution of the roadmap
  • Collaborate with local/global engineering teams, infrastructure teams, and product development teams and translate business requirements into robust engineering solutions
  • Fulltime
Read More
Arrow Right

Staff Software Engineer – Discrete Event Simulation & Route Optimization

The Autonomous Robotics Center (ARC) is a multidisciplinary organization develop...
Location
Location
United States , Austin, Texas; Mountain View, California; Warren, Michigan
Salary
Salary:
Not provided
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master's degree in Computer Science, Computer Engineering, Electrical Engineering, Operations Research, Applied Mathematics, or a related field
  • PhD or equivalent experience preferred
  • 10+ years of software engineering experience with a strong focus on algorithms, simulation, or optimization
  • Strong skills in Python, C++, C#, or similar languages, with a track record of shipping production-quality software
  • Deep experience implementing and optimizing shortest-path and routing algorithms (e.g., Dijkstra, A*, flows, matchings, search on large graphs)
  • Solid expertise in probability, statistics, and stochastic processes applied to modeling and simulation (e.g., Monte Carlo simulation)
  • Experience running large batches of simulations or distributed experiments (cloud or on-prem)
  • Excellent communication and collaboration skills, with a history of influencing architecture and technical direction
  • Practical experience with simulation frameworks (e.g., SimPy, AnyLogic, Arena, or custom DES frameworks)
Job Responsibility
Job Responsibility
  • Design and implement core algorithms for discrete event simulation, scheduling, routing, graph-based modeling, and system optimization
  • Build and maintain DES models of complex systems (e.g., production lines, AMR flows, logistics networks), including event logic, resources, and KPIs
  • Build and run large-scale simulation experiments and translate results into actionable recommendations for throughput, cost, and reliability improvements
  • Develop and maintain core data structures and services for maps, graphs, and spatial databases
  • Develop production-quality software (primarily in Python and C#) and expose capabilities via stable APIs and internal tools
  • Partner with cross-functional teams (manufacturing, robotics, data, and platform engineering) to connect models with real-world telemetry and digital twins
  • Define and champion best practices for algorithm design, simulation modeling, testing, and observability
  • Mentor engineers and provide technical leadership on algorithms, modeling, and software design
What we offer
What we offer
  • Relocation benefits
  • Fulltime
Read More
Arrow Right

Senior Staff Software Engineer - AI

GEICO is seeking an experienced Engineer with a passion for building high-perfor...
Location
Location
United States , Seattle, WA; Austin, TX; Palo Alto, CA; Chicago, IL; Dallas, TX
Salary
Salary:
110000.00 - 230000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience building and deploying ML systems in production with cross-functional engineering teams
  • Fluency in at least two modern languages such as Python, Go, Java, C++, or C# including object-oriented design
  • Experience architecting multi-component ML platforms using open-source/cloud-agnostic components: Datastores: PostgreSQL, NoSQL (MongoDB, Cassandra, CosmosDB) Streaming: Kafka, Flink, or Spark Streaming
  • Experience with end-to-end ML lifecycle: version control, CI/CD, Kubernetes, testing, monitoring, and production support
  • Experience with cloud providers (Azure, AWS or GCP) in production ML environments
  • Experience with observability tools and distributed systems monitoring, logging, tracing, and root cause analysis
  • Experience building multi-agent systems using LLMs and agentic frameworks (e.g., LangChain, LangGraph, AutoGen, Semantic Kernel, CrewAI)
  • Hands-on experience with RAG, semantic search, and vector databases (e.g., Milvus, pgvector, Qdrant, ElasticSearch)
  • Experience designing human-in-the-loop workflows and safety controls for autonomous systems
  • Strong architecture and design skills with ability to influence technical direction and roadmap
Job Responsibility
Job Responsibility
  • Design and build a multi-agent AI platform where specialized agents autonomously detect, diagnose, and resolve issues through agent-to-agent (A2A) collaboration
  • Develop intelligent agents using LLMs and agentic frameworks that coordinate detection, diagnostic, remediation, and knowledge tasks with minimal human intervention
  • Define agent interaction protocols, A2A communication standards, and evaluation frameworks for agent decision quality and autonomous action safety
  • Architect vector database solutions (Milvus, pgvector, Qdrant) for semantic search and RAG to enable context-aware agent decision-making
  • Build end-to-end ML pipelines for severity classification, anomaly detection, failure pattern recognition, and impact forecasting using observability data
  • Establish scalable orchestration infrastructure for multi-agent workflows with CI/CD, automated evaluation, canary releases, and rollback strategies
  • Implement monitoring for agent interactions, A2A communication patterns, decision quality, data drift, and system reliability
  • Lead technical architecture ensuring scalability, observability, and integration with existing alerting, logging, and monitoring systems
  • Define standards for agent safety, explainability, governance, and human-in-the-loop controls for high-impact automated actions
  • Partner with SRE, Product, and Engineering teams to translate reliability goals into measurable ML objectives and maintain pragmatic technical roadmaps
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right

Staff Software Engineer

GEICO is seeking an experienced Staff Software Engineer to join our Knowledge Gr...
Location
Location
United States , Chevy Chase; Seattle
Salary
Salary:
105000.00 - 230000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience designing and implementing knowledge management platforms, semantic data systems, content generation tools, or AI-driven developer platforms
  • Full-stack developer with extensive experience in modern front-end frameworks (React, TypeScript), web technologies (JavaScript, HTML, CSS/SASS), backend languages (Node.js, Python, Java), and cloud platforms (Azure, AWS, GCP)
  • Strong ability to architect distributed semantic systems and graph-based microservice architectures that handle complex data relationships and scale reliably
  • Experience with knowledge graphs, semantic technologies, and AI/ML platforms such as Neo4j, Apache Jena, TigerGraph, or similar graph databases, along with NLP frameworks and content generation models
  • Familiarity with semantic web standards (RDF, OWL, SPARQL), ontology design, knowledge representation, and automated reasoning systems
  • Deep understanding of content management ecosystems, headless CMS architectures, API-driven publishing workflows, and content delivery optimization
  • Experience with AI/ML frameworks for natural language processing, content generation (GPT, BERT, T5), recommendation systems, and knowledge extraction from unstructured data
  • Product mindset and passion for building intelligent tools that solve complex content challenges and enhance user experiences through semantic understanding
  • Excellent collaboration and communication skills with ability to explain complex semantic concepts to technical and non-technical stakeholders
  • In-depth knowledge of CS data structures, algorithms, particularly graph algorithms, semantic matching, and distributed system design patterns
Job Responsibility
Job Responsibility
  • Architect and design enterprise-scale knowledge graph platforms that capture and model GEICO's comprehensive insurance domain expertise, customer insights, product relationships, and market intelligence
  • Build automated semantic content generation systems that leverage knowledge graphs to create personalized insurance content, product descriptions, educational materials, and customer communications at scale
  • Develop intelligent content workflows and APIs that use graph traversal algorithms, natural language processing, and machine learning to automate content production, template generation, and multi-channel publishing
  • Design real-time content personalization engines that query knowledge graphs to deliver contextually relevant messaging based on customer profiles, policy information, and behavioral patterns
  • Create sophisticated data ingestion and enrichment pipelines that continuously build and maintain knowledge graphs from structured and unstructured data sources across the enterprise
  • Implement semantic search and content discovery platforms that understand customer intent and context through graph-based query processing and recommendation algorithms
  • Build internal dashboards and tooling for content performance monitoring, knowledge graph visualization, semantic relationship analysis, and content optimization insights
  • Lead cross-functional collaboration with product managers, data scientists, and content strategists to translate business objectives into scalable knowledge-driven technical solutions
  • Champion engineering excellence in semantic modeling, ontology design, graph database optimization, and AI/ML integration best practices
  • Mentor engineering teams on knowledge graph technologies, content automation frameworks, and distributed system design patterns for semantic platforms
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right

Staff Software Engineer (Applied AI)

This is a high-leverage opportunity to serve as a foundational technical leader ...
Location
Location
United States , Seattle; New York City; Boston
Salary
Salary:
180000.00 - 240000.00 USD / Year
pearlhealth.com Logo
Pearl Health
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8-12 years of experience in large-scale software engineering, with a proven track record in service-oriented architectures and large data systems
  • Demonstrated architectural experience designing and implementing highly observable, scalable distributed systems, including complex microservices and event-driven architectures
  • Hands-on experience in the full-stack productization of Applied AI/LLM features, specifically working with RAG architectures (chunking strategies, semantic search, vector databases) and Agentic workflows
  • Cloud platform proficiency (preferably AWS), including hands-on experience with infrastructure design, deployment pipelines, and cost optimization
  • Expertise in designing and optimizing data-intensive applications, including deep database expertise (schema design, query optimization) and data consistency patterns
  • A strong leadership track record in driving technical direction for significant, cross-functional projects
Job Responsibility
Job Responsibility
  • Architect and build the foundational LLM architecture (RAG, Agents, Vector DBs) that powers high-impact product features across the organization
  • Design and deliver high-performance, distributed systems (data pipelines, APIs, microservices) that process healthcare data at scale and securely integrate LLM outputs into the user experience
  • Drive Proof-of-Concepts (POCs) and technical evaluation of new AI technologies to validate their product viability and scalability
  • Design and build responsive web applications using modern frontend frameworks to deliver intuitive, user-facing intelligence and analytic features
  • Ensure observability, monitoring, and operational excellence, championing security and regulatory compliance (HIPAA, SOC2)
  • Drive system design and technical planning for all critical projects, including large, complex, and cross-functional initiatives across the organization
  • Partner strategically with Product, Operations, and Design leadership, translating complex business requirements into technical strategies and influencing roadmap prioritization
  • Mentor engineers at Software Engineer I, II, and III levels, accelerating their technical growth and leadership development
  • Educate the organization on the capabilities, best practices, and limitations of using LLMs and applied AI within the healthcare domain
  • Define and implement engineering standards, patterns, and best practices that significantly improve scalability, reliability, and developer productivity across the product
What we offer
What we offer
  • discretionary performance bonus
  • equity options
  • competitive benefits package
  • Fulltime
Read More
Arrow Right

Staff Software Engineer

We are seeking a Staff Software Engineer to help shape the evolution of the Bran...
Location
Location
Hungary; United Kingdom
Salary
Salary:
Not provided
brandwatch.com Logo
Brandwatch
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years professional software engineering experience
  • Expertise in scalable architectures and large-volume data processing
  • Proven technical leadership and ability to influence engineering direction
  • A collaborative mindset and strong problem-solving skills
Job Responsibility
Job Responsibility
  • Design and optimise APIs and data pipelines supporting high-throughput interactive analytics (real-time search, filtering and aggregations on billions of documents)
  • Build and enhance distributed systems responsible for large-scale indexing, search, and data processing
  • Drive performance, scalability, cost and reliability improvements in petabyte-scale systems
  • Tune systems to make optimal use of cloud and datacenter resources
  • Develop capabilities that enable customers to extract actionable insights from billions of documents across social and traditional media sources
  • Lead design discussions and guide cross-team technical initiatives
Read More
Arrow Right

Staff Software Engineer, Backend

Cresta is excited to expand our operations to Romania and build a tech hub in th...
Location
Location
Romania , Cluj-Napoca
Salary
Salary:
Not provided
cresta.com Logo
Cresta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience as a Backend Engineer with a focus on database design and system architecture
  • Strong expertise in ClickHouse or similar columnar databases for managing large-scale, real-time analytical queries
  • Hands-on experience with Elasticsearch for indexing and searching large datasets
  • Proficient in backend programming languages such as Python, Go
  • Experience with RESTful API design and development
  • Solid understanding of distributed systems, microservices architecture, and cloud infrastructure
  • Experience with performance tuning, data modeling, and query optimization
  • Strong problem-solving skills and attention to detail
  • Excellent communication and teamwork abilities
Job Responsibility
Job Responsibility
  • Design, implement, and maintain backend services and APIs to support applications
  • Build and optimize data storage solutions using Postgres, ClickHouse and Elasticsearch to ensure high performance and scalability
  • Collaborate with cross-functional teams, including frontend engineers, data scientists, and machine learning engineers, to deliver end-to-end solutions
  • Monitor and troubleshoot performance issues in distributed systems and databases
  • Write clean, maintainable, and efficient code following best practices for backend development
  • Participate in code reviews, testing, and continuous integration efforts
  • Ensure security, scalability, and reliability of backend services
  • Analyze and improve system architecture, focusing on performance bottlenecks, scaling, and security
Read More
Arrow Right

Staff Software Engineer - Open Source Server

Temporal is an open source programming model that can simplify code, make applic...
Location
Location
United States
Salary
Salary:
170000.00 - 250000.00 USD / Year
temporal.io Logo
Temporal
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Rich experience as an 'Arranger' and/or 'Builder/Enhancer' in large-scale distributed systems design (reliability, scalability)
  • Operational experience in large-scale distributed systems environments
  • Experience developing highly concurrent systems
  • Demonstrated experience writing concurrent code in production as Advanced or Expert levels
  • Knowledge and experience of reliability to ensure the high reliability of the Temporal system
  • Experience with Elastic Search or Clickhouse
  • Ideas and actions to improve the velocity of the team
Job Responsibility
Job Responsibility
  • Design and implement core backend service features
  • Provide appropriate test coverage for unit, integration, and performance for your feature ownership area
  • Clearly document design choices and operational knowledge to successfully deploy and run service with those features
  • Provide appropriate service level logs and metrics to make features operational for cloud service setup
  • Provide appropriate alerts, dashboards, and runbooks for production
What we offer
What we offer
  • Unlimited PTO, 12 Holidays + 2 Floating Holidays
  • 100% Premiums Coverage for Medical, Dental, and Vision
  • AD&D, LT & ST Disability, and Life Insurance (Standard & Supplemental Available)
  • Empower 401K Plan
  • Additional Perks for Learning & Development, Lifestyle Spending, In-Home Office Setup, Professional Memberships, WFH Meals, Internet Stipend and more
  • $3,600 / Year Work from Home Meals
  • $1,500 / Year Career Development & Learning
  • $1,200 / Year Lifestyle Spending Account
  • $1,000 / Year In-Home Office Setup (In addition to Temporal issued equipment)
  • $500 / Year Professional Memberships
  • Fulltime
Read More
Arrow Right