This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
The Director, Platform Engineering – TIP.AI Platform is a senior technical role responsible for designing, building, and operating the highly scalable, resilient platform infrastructure that powers AI systems across the enterprise. This role functions as a principal engineer, building and architecting technical solutions, establishing platform standards, and solving the most complex scalability, reliability, and performance challenges, while also providing technical leadership and mentorship across multiple engineering teams.
Job Responsibility:
Define and evolve the core platform architecture for TIP.AI, ensuring it is scalable, highly available, secure, and cost‑efficient at enterprise scale
Establish and maintain reference architectures, design patterns, and engineering standards for platform services
Own platform reliability, including availability, performance, resiliency, and disaster recovery
Define and enforce SLOs, SLIs, and error budgets for core platform services
Build and mature observability practices (metrics, logs, traces) to ensure deep operational insight
Partner with security teams to ensure platform designs meet enterprise security, privacy, and compliance requirements
Build internal platform capabilities and shared services that improve developer velocity and consistency
Establish CI/CD, deployment, and runtime patterns that enable safe, fast delivery at scale
Reduce cognitive load on application teams by abstracting infrastructure complexity behind well‑designed platform interfaces
Act as the principal technical authority for the TIP.AI Platform team
Mentor senior and staff‑level engineers, raising the technical bar across the organization
Influence without direct authority, partnering closely with product, data, AI, and security leaders
Partner with product, AI, and business leaders to align platform capabilities with TIP.AI’s long‑term roadmap
Translate business and growth goals into scalable technical strategies
Requirements:
8+ years of experience designing, building, and operating large‑scale distributed systems in production environments
Deep experience designing, building and operating large‑scale, distributed, systems
Strong background in fault tolerant, auto‑recoverable software systems at scale
Proven expertise in reliability engineering, scalability, and performance optimization
Hands‑on experience with at least one major cloud platform (AWS, Azure, or GCP)
Strong proficiency in infrastructure‑as‑code and CI/CD systems
Demonstrated ability to lead through influence, mentor senior engineers, and set technical direction across teams
Full‑Stack Entrepreneurial Mindset: Must have bootstrapped/built at least one full stack system delivered to real users
Excellent verbal communication skills, with the ability to articulate complex architectural decisions clearly
Ability to produce/review extremely clean software documentation
Ability to effectively communicate async with remote team members across the globe
Nice to have:
Experience supporting AI/ML, data platforms, or high‑scale automation systems
Background in platform teams serving multiple internal product or engineering groups
Familiarity with security, privacy, and compliance requirements in large enterprises
Experience driving cost governance and efficiency for large‑scale cloud platforms
Prior work in highly regulated or global enterprise environments