This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Lead the design and development of core data storage, streaming, caching, and indexing platforms and underlying systems for Scale AI's products, which power the most advanced LLMs and generative models in the world.
Job Responsibility:
Drive the architecture, design, implementation, and reliability of our foundational data platforms and systems, working closely with stakeholders and internal customers to understand and refine requirements
Collaborate with cross-functional teams to define, design, and deliver new features
Proactively identify opportunities for, and driving improvements to, current programming practices, including process enhancements and tool upgrades
Present technical information to teams and stakeholders, providing guidance and insight on development processes and technologies
Provide technical leadership, including: upholding and upleveling engineering standards across the organization, mentoring junior engineers
Requirements:
8+ years of full-time engineering experience, post-graduation with specialties in back-end systems, specifically related to building large-scale data storage, streaming, and warehousing systems
Extensive experience in various database technologies (MongoDB, Postgres), streaming/processing solutions (Kinesis, Flink, Spark), indexing/caching (ElasticSearch, Redis), and various data query engines (Trino, Presto, Snowflake, etc.)
Show a track record of mentoring and leading teams in successful projects
Possess excellent communication and collaboration skills, and the ability to translate complex technical concepts to non-technical stakeholders
Experience working fluently with standard containerization & deployment technologies like Kubernetes and various public cloud offerings
Extensive experience in software development and a deep understanding of distributed systems, cloud platforms and data systems
Experience driving cross functional collaboration and communication at an organizational or broader level
Nice to have:
Strong knowledge of software engineering best practices and CI/CD tooling (CircleCI)
Experience with performance tuning and cost optimizations of cloud based data platforms
Experience defining a data lifecycle strategy and designing/implementing tooling for data privacy (i.e. GDPR) needs
Experience scaling products at hyper-growth startups