CrawlJobs Logo

Staff Software Engineer, Backend (AI Platform)

United States · Job Posted January 16, 2026
Apply Position
Job Link Share

Job Description

Cresta is on a mission to turn every customer conversation into a competitive advantage by unlocking the true potential of the contact center. Our platform combines the best of AI and human intelligence to help contact centers discover customer insights and behavioral best practices, automate conversations and inefficient processes, and empower every team member to work smarter and faster.

Job Responsibility

  • Own model serving: Design, build, and maintain low-latency, highly-available serving stacks for in-house ML model serving and integrating with LLM serving partners
  • Automate training pipelines: Orchestrate data prep, training, evaluation, and registry workflows on Kubernetes with solid MLOps practices
  • Optimize at scale: Profile and tune throughput, memory, and cost
  • introduce caching, sharding, batching, and GPU/CPU autoscaling where it pays off
  • Build platform primitives: Create reusable SDKs, templates, and CLI tools that let research and product teams ship models independently and safely
  • Raise the bar: Instrument deep observability (tracing, metrics, alerts), drive blameless post-mortems, and mentor engineers on production ML best practices

Requirements

  • 5+ years writing production software
  • 2+ years focused on ML platform or infra
  • Expert Python (async, typing, packaging, performance)
  • Working Golang knowledge for systems components
  • Proven experience with one or more serving frameworks (e.g., vLLM, Triton, TorchServe)
  • Kubernetes and cloud-native ops
  • Solid grasp of distributed systems, networking, and container security
  • Culture of rigorous testing, code review, and continuous delivery

Nice to have

  • Hands-on with large language models or real-time streaming inference
  • Terraform, Helm, or similar IaC tooling
  • Experience in speech or conversational AI domains

What we offer

  • Comprehensive medical, dental, and vision coverage with plans to fit you and your family
  • Flexible PTO to take the time you need, when you need it
  • Paid parental leave for all new parents welcoming a new child
  • Retirement savings plan to help you plan for the future
  • Remote work setup budget to help you create a productive home office
  • Monthly wellness and communication stipend to keep you connected and balanced
  • In-office meal program and commuter benefits provided for onsite employees

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Staff Software Engineer, Backend (AI Platform)

8 matching positions

Staff Software Engineer - Backend Gen Ai

The Media Platform team builds Uber's unified, scalable infrastructure for inges...
Location
Location
United States , Sunnyvale
Salary
Salary:
232000.00 - 258000.00 USD / Year
uber.com Logo
Uber
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of backend engineering experience, with deep expertise in distributed systems and large-scale service architecture
  • Strong backend engineering experience (Go, Java, C++, or similar) with expertise in system design, performance optimization, and reliability
  • Experience building high-throughput, low-latency services handling large data volumes (streaming, storage, or media systems)
Job Responsibility
Job Responsibility
  • Architect and scale distributed backend systems that support media ingestion, processing, intelligence, and delivery across global regions
  • Improve performance, reliability, and cost efficiency of high-throughput media pipelines
  • Design infrastructure that enables efficient integration and execution of ML inference workloads within media systems
  • Drive technical strategy and long-term architectural decisions across the Media Platform
  • Mentor engineers and raise the bar for engineering excellence, operational rigor, and system design
What we offer
What we offer
  • Bonus program
  • Equity award
  • 401(k) plan
  • Various benefits
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, AI Agent Platform

The Geico AI Agent Platform team is seeking an exceptional Staff Software Engine...
Location
Location
United States , Chevy Chase; New York City
Salary
Salary:
115000.00 - 260000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, Engineering, Mathematics, or a related field
  • an advanced degree (master’s or Ph.D.) is highly desirable
  • 6+ years of hands-on experience in designing, implementing, and maintaining multi-tenant AIML systems and platforms in production environments
  • 6+ years of experience working with cloud platforms such as Azure and AWS
  • Extensive expertise in designing and deploying large-scale data pipelines and real-time inference systems and managing the end-to-end AI Agent and/or AIML system development lifecycles, including configuration, evaluation, monitoring, observability and AuthN/AuthR considerations
  • 6+ years of experience working with common backend systems & tools (e.g, Kubernetes, Temporal, OpenSearch, PostgreSQL, Redis, Neo4J, etc.)
  • Deep understanding of Docker, container optimization, and multi-stage builds
  • Experience with Prometheus, Grafana, Open Telemetry and distributed tracing
  • 3+ years of experience building front-end web applications using frameworks such as React and/or Next.JS
  • Deep proficiency in programming languages such as Python, Java, Go, etc., with a strong emphasis on coding excellence
Job Responsibility
Job Responsibility
  • Architect and implement scalable multi-tenant backend systems for building AI agent workflows, including agent configuration, offline evaluation, synthetic data generation, workflow simulation, agent marketplace, etc. using Azure Kubernetes Service (AKS), FastAPI, etc., ensuring economy of scale and control cost of maintenance
  • Collaborate with Design team to architect and implement frontend experiences and workflows for onboarding both technical and non-technical stakeholders, maximizing user adoption and successful AI agent development
  • Develop observability frameworks to ensure 99.9%+ uptime for AI agent platforms through robust monitoring, alerting, and incident response procedures
  • Evaluate and (if desirable) integrate cutting-edge GenAI frameworks, libraries and vendors to maintain a state-of-the-art technology stack, including hybrid cloud solutions with AWS/GCP as backup or specialized use cases
  • Architect and implement scalable, high-performance machine learning platforms and systems capable of processing large data volumes and supporting real-time decision making and workflows
  • Oversee the end-to-end lifecycle of AI agent applications, ensuring robust testing, deployment, and ongoing monitoring
  • Ensure adherence to company production readiness standards, security protocols, and regulatory compliance throughout the development lifecycle
  • Continuously optimize platform performance, reducing latency and improving throughput for AI agent workloads
  • Design and implement backup, recovery, and business continuity plans for hosted platform applications & services
  • Design and maintain robust CI/CD pipelines for ML model deployment using Azure DevOps, GitHub Actions, and MLOps tools
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, (Backend) Uber AI Solutions

At Uber, our mission is to be the platform of choice for flexible earning opport...
Location
Location
United States , San Francisco, California; Sunnyvale, California
Salary
Salary:
232000.00 - 258000.00 USD / Year
uber.com Logo
Uber
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s (or Master’s) degree in Computer Science, Engineering or related discipline (or equivalent experience)
  • Expert in at least one major backend or infrastructure technology (languages, frameworks, distributed systems, data pipelines) and comfortable influencing architecture across teams
  • Strong record of mentoring and developing engineers, setting technical standards, and driving impact beyond a single team
  • Excellent communication and collaboration skills
  • able to engage with multiple teams, stakeholders, and articulate vision and trade-offs
  • Experience participating in hiring and helping build out engineering teams or capability
  • 8+ years of professional software engineering experience, with substantial experience designing, building, and operating large-scale systems across multiple teams
  • Deep understanding of ML Ops ecosystems, LLM or ML model lifecycle management, and large-scale data processing frameworks (e.g., Kubeflow, Airflow, Ray, Spark)
  • Proven experience architecting systems for data labeling, translation, or human-in-the-loop workflows supporting high-volume ML applications
  • Strong familiarity with GenAI, Physical AI and LLM infrastructure model hosting, fine-tuning, evaluation, and integration into production services
Job Responsibility
Job Responsibility
  • Architect and evolve core systems that span multiple teams, ensuring scalability, performance, and long-term maintainability of critical platform services
  • Provide technical leadership across teams, driving alignment on design patterns, service interfaces, and shared infrastructure investments
  • Mentor and develop senior engineers, elevating technical depth, decision-making, and design rigor across the broader group
  • Champion the adoption of AI-assisted development tools and modern engineering practices to improve code quality, reliability, and delivery speed across teams
  • Influence hiring and talent development, helping shape team composition and maintaining a high engineering bar across multiple teams
What we offer
What we offer
  • Eligible to participate in Uber's bonus program
  • May be offered an equity award & other types of comp
  • Eligible to participate in a 401(k) plan
  • Eligible for various benefits
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, AI Platform

GoodLeap is a technology company delivering best-in-class financing and software...
Location
Location
United States , AUSTIN; SAN FRANCISCO; IRVINE; ROSEVILLE
Salary
Salary:
173000.00 - 200000.00 USD / Year
goodleap.com Logo
GoodLeap
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience building and shipping scalable, robust backend services and APIs
  • Strong proficiency in Python and/or TypeScript
  • Solid understanding of distributed systems, service-oriented architecture, and event-driven patterns (e.g. Kafka, RabbitMQ, SQS)
  • Passion for software development, emerging technologies and culture of innovation
  • A collaborative mindset and interest in mentoring teammates and elevating team practices
  • Excellent communication and interpersonal skills
Job Responsibility
Job Responsibility
  • Build features and extensions to our agentic AI platform using scalable, robust, and AI-first software engineering practices
  • Design tools and infrastructure to enable teams at GoodLeap to easily build and enhance AI agents that empower homeowners, contractors, and operations staff
  • Work alongside a team of AI engineers, product managers, and data scientists to evaluate and improve our agent ecosystem
  • Collaborate with Staff engineers, product, architecture, and design leads to deliver highly-available, fault-tolerant products and services
  • Work on significant and unique technical challenges, evaluate and recommend solutions, and guide decision making by considering technical tradeoffs
  • Grasp both the technical and business perspective so you can help drive innovation
  • Work autonomously and be self-disciplined, requiring minimal supervision or guidance
  • Collaborate with other team members and coach more junior team members to grow both their technical skills and soft skills
What we offer
What we offer
  • May be eligible for a bonus and equity
  • Fulltime
Read More
Arrow Right

Staff Engineer - AI Platform

At Teradata, we're not just managing data; we're unleashing its full potential. ...
Location
Location
India , Telangana
Salary
Salary:
Not provided
teradata.com Logo
Teradata
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Engineering, Data Science, or a related field
  • Experience in UI/UX design and frontend engineering
  • Proficiency in Angular, TypeScript, Figma, and design systems
  • Strong engineering background (Python/Java/Golang, API integration, backend frameworks)
  • Strong system design skills and understanding of distributed systems
  • Experience developing native notebook interfaces or extensions (e.g., Jupyter, VS Code Notebooks, or custom notebook UIs)
  • Strong understanding of human-computer interaction (HCI)
  • Experience designing interfaces for complex workflows or ML-powered products
  • Experience with LLM-based tools or agent orchestration (e.g., LangChain, AutoGen)
  • Familiarity with containerized environments (Docker, Kubernetes) and CI/CD pipelines
Job Responsibility
Job Responsibility
  • Design and prototype interfaces for interacting with autonomous agents
  • Implement responsive, accessible, and explainable UI components that visualize AI decisions, uncertainty, and reasoning paths
  • Partner with AI researchers and software engineers to ensure interfaces support emerging agent capabilities
  • Conduct usability testing with humans-in-the-loop scenarios
  • Drive UX best practices around safety, trust calibration, and explainability for intelligent systems
  • Design and prototype intuitive, high-impact interfaces for interacting with autonomous and intelligent agents
  • Experiment with LLM APIs, agentic workflows, and cutting-edge open-source frameworks
  • Explore and implement planning systems, vector databases, or memory architectures such as graph-based storage on Teradata
  • Champion UX best practices around safety, trust calibration, and explainability for intelligent systems
  • Collaborate with AI researchers and software engineers to ensure interfaces evolve alongside emerging agent capabilities
What we offer
What we offer
  • We prioritize a people-first culture
  • We embrace a flexible work model
  • We focus on well-being
  • We are committed to actively working to foster an inclusive environment that celebrates people for all of who they are
  • Fulltime
Read More
Arrow Right

Staff, Software Engineer - Backend

Walmart's Enterprise Business Services (EBS) is a powerhouse of seven exceptiona...
Location
Location
United States , Bentonville
Salary
Salary:
110000.00 - 220000.00 USD / Year
walmart.com Logo
Walmart
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in computer science, computer engineering, computer information systems, software engineering, or related area and 4 years' experience in software engineering or related area
  • 6 years' experience in software engineering or related area
  • Python guru with a proven track record of writing high-performing, production-quality code
  • Hands-on experience designing and building Python-based web services in a production setting (FastAPI experience preferred)
  • Deep familiarity with version control using Git in collaborative team environments
  • Comfortable working with Linux environments and containerization technologies such as Docker
  • 4+ years of industry experience with demonstrated ownership and delivery of software products
  • Hands-on experience developing or deploying GenAI-based applications
  • Experience working with or integrating open-source and/or commercial GenAI libraries/frameworks such as Hugging Face Transformers, LangChain, OpenAI API, or similar
  • Ability to productionize and evaluate GenAI models
Job Responsibility
Job Responsibility
  • Design and develop platform features enabling advanced semantic routing for GenAI-powered services
  • Build and maintain evaluation pipelines for semantic router data
  • Collaborate with applied researchers and data scientists to continuously improve semantic routing algorithms
  • Develop and implement agent-to-agent (A2A) communication protocols
  • Contribute to the design and development of platform features using microservices (FastAPI) and event-driven architecture (Kafka, SSE, WebSocket)—all in Python
  • Uphold engineering and operational excellence standards
  • Support operational excellence for semantic routing and agent communication systems
  • Stay current with GenAI and multi-agent system best practices
  • Be an active member of a dynamic team
  • Support production operations by participating in on-call rotations
What we offer
What we offer
  • Medical coverage
  • Vision coverage
  • Dental coverage
  • 401(k) match
  • Stock purchase plan
  • Paid maternity and parental leave
  • PTO
  • Short-term disability
  • Long-term disability
  • Company discounts
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Backend

You'll lead critical backend initiatives as Gamma scales from millions to hundre...
Location
Location
United States , San Francisco
Salary
Salary:
230000.00 - 310000.00 USD / Year
gamma.app Logo
Gamma
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience as a software engineer with deep expertise in building large-scale production systems
  • Deep and comprehensive understanding of large relational databases, service-oriented architectures, and HTTP & REST protocols
  • Proven track record writing and maintaining highly-available web APIs
  • Extensive experience with event streaming systems like Redis Pubsub or Apache Kafka
  • Leadership experience in large, complex production-scale codebases
  • Passion for building APIs, scaling complex systems, and creating excellent web applications
  • Advanced experience with AI prompting and large language models
  • Experience programming with TypeScript, Prisma, and Apollo GraphQL
  • Experience with Terraform and AWS Services
Job Responsibility
Job Responsibility
  • Build APIs that power our expanding platform and design innovative data models that support our growing user base
  • Architect and lead scaling initiatives to handle our next order of magnitude of growth
  • Design and implement our tech stack including Kafka pipelines and real-time AI streaming systems
  • Lead performance optimization efforts and architect solutions to overcome bottlenecks
  • Implement live subscription updates as data changes using event streaming systems
  • Mentor engineers while collaborating across teams to create exceptional developer and user experiences
What we offer
What we offer
  • competitive equity
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, AI

Credit Genie is a mobile-first financial wellness platform designed to help indi...
Location
Location
United States , Pittsburgh; Philadelphia; Plymouth Meeting; New York
Salary
Salary:
150000.00 - 250000.00 USD / Year
creditgenie.com Logo
Credit Genie
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A Software Engineer with 5+ years of industry experience
  • Strong foundations in multiple programming languages (Python, Java, TypeScript, etc.)
  • Hands-on experience with cloud platforms (AWS, GCP, or Azure)
  • Experienced at designing and implementing distributed, production-grade systems
  • Comfortable with system design, APIs, version control, Infrastructure as Code, and testing
  • Curious, motivated, and eager to expand into AI/ML
  • Collaborative and excited by fast-moving, problem-solving environments
Job Responsibility
Job Responsibility
  • Lead the design and implementation of highly available, scalable backend services and APIs that serve and integrate our AI models and applications into production systems
  • Architect and optimize the services and data pipelines essential for deploying, monitoring, and maintaining real-time AI inferencing and retrieval at scale
  • Collaborate with AI and ML Engineers to improve model deployment, monitoring, and experimentation workflows (MLOps/AIOps)
  • Drive technical excellence, setting high standards for code quality, system reliability, and performance
  • Mentor and guide other engineers on best practices for building robust backend systems in an AI-focused environment
  • Have fun working on hard and highly impactful problems
What we offer
What we offer
  • Offers Equity
  • Offers Bonus
  • Comprehensive medical, vision, and dental coverage
  • 401(k) retirement plan with company match
  • Short & long term disability insurance
  • Life insurance
  • Flexible PTO
  • 100% company-paid medical, dental, and vision coverage for you and your dependents on your first day of employment
  • Receive up to $100 per month in fitness reimbursement or enjoy a complimentary full membership to LifeTime Fitness or Equinox
  • 401(k) with a 3.5% match and immediate vesting
  • Fulltime
Read More
Arrow Right