This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are looking for a Principal LLM Engineer to lead the design and development of advanced LLM applications that enhance our video-based medical education platform. This role offers a unique opportunity to shape the future of how healthcare professionals access and engage with cutting-edge clinical knowledge. If you are passionate about creating intelligent, scalable solutions to improve user experiences, we invite you to join our dynamic team in Oakland, California.
Job Responsibility:
Develop and deploy workflows powered by large language models (LLMs) to improve the search, recommendation, and personalization capabilities of the platform
Collaborate with product, data, and AI teams to create intelligent services such as classification, relevance ranking, and summarization
Establish and enforce architectural standards across backend, frontend, and infrastructure layers to ensure system reliability and scalability
Lead modernization efforts for backend systems built on Python/Django and frontend technologies like React
Mentor engineering teams, providing technical guidance and fostering best practices through code reviews and collaborative design sessions
Conduct technical exploration of emerging tools and technologies, including vector databases and real-time video personalization frameworks
Prototype and test innovative solutions to ensure the platform remains at the forefront of applied AI developments
Requirements:
10+ years of experience developing and scaling software systems
5+ years experience shipping LLM applications particularly in consumer-focused or AI-driven products
Proven track record as a Principal Engineer, Staff Engineer, or Architect in leading complex system designs
Proficient in Python, with preferred experience in frameworks such as Django, Flask, or FastAPI
Demonstrated knowledge in cloud architectures and distributed systems, including services like EC2, Lambda, S3, and CloudFront