This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'll drive meaningful innovation, make a tangible impact, and join a team that’s setting the pace for responsible, transformative cloud infrastructure. The Crusoe Cloud Managed AI team seeks an ambitious and experienced Senior Software Engineer to join their team. You'll have a pivotal role in shaping the architecture and scalability of our next-generation managed AI services. You will lead the design and implementation of core systems for our AI offerings. This role offers the opportunity to build and scale critical infrastructure capable of handling millions of API requests per second across thousands of customers. The team is working on building the next chapter of products to accelerate the process of adding intelligence into software.
Job Responsibility:
Lead the design and implementation of core AI services, including: Resilient fault-tolerant queues for efficient task distribution
Model catalogs for managing and versioning AI models
Scheduling mechanisms optimized for cost and performance
Architect and scale infrastructure to handle millions of API requests per second
Implement robust monitoring and alerting to ensure system health and 24/7 availability
Collaborate closely with product management, business strategy, and other engineering teams to define the AI platform roadmap
Influence the long-term vision and architectural decisions of the platform
Contribute to open-source AI frameworks and actively participate in the AI community
Prototype and rapidly iterate on emerging technologies and new features
Requirements:
Advanced degree in Computer Science/Engineering
4-5+ years of industry experience with demonstrated history of consistent success leading a varied portfolio of initiatives across your function
Experience with distributed systems, cloud services (compute, storage, networking, database), and delivering early-stage projects quickly
Experience with Generative AI (LLMs, Multimodal) and familiar with AI infrastructure (training, inference, ETL pipelines)
Proficient with container runtimes (e.g., Kubernetes), microservices, REST APIs, gRPC, and the full software development lifecycle including CI/CD
Nice to have:
Proficiency in Golang, Python or Rust for production services
Contributions to open-source AI projects (e.g., VLLM)
Experience with performance optimizations on GPU systems and inference frameworks
What we offer:
Restricted Stock Units
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short-term and long-term disability