This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Cresta is on a mission to turn every customer conversation into a competitive advantage by unlocking the true potential of the contact center. Our platform combines the best of AI and human intelligence to help contact centers discover customer insights and behavioral best practices, automate conversations and inefficient processes, and empower every team member to work smarter and faster.
Job Responsibility:
Own model serving: Design, build, and maintain low-latency, highly-available serving stacks for in-house ML model serving and integrating with LLM serving partners
Automate training pipelines: Orchestrate data prep, training, evaluation, and registry workflows on Kubernetes with solid MLOps practices
Optimize at scale: Profile and tune throughput, memory, and cost
introduce caching, sharding, batching, and GPU/CPU autoscaling where it pays off
Build platform primitives: Create reusable SDKs, templates, and CLI tools that let research and product teams ship models independently and safely
Raise the bar: Instrument deep observability (tracing, metrics, alerts), drive blameless post-mortems, and mentor engineers on production ML best practices