CrawlJobs Logo

Principal LLM Engineer

https://www.roberthalf.com Logo

Robert Half

Location Icon

Location:
United States , Oakland

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

We are looking for a Principal LLM Engineer to lead the design and development of advanced LLM applications that enhance our video-based medical education platform. This role offers a unique opportunity to shape the future of how healthcare professionals access and engage with cutting-edge clinical knowledge. If you are passionate about creating intelligent, scalable solutions to improve user experiences, we invite you to join our dynamic team in Oakland, California.

Job Responsibility:

  • Develop and deploy workflows powered by large language models (LLMs) to improve the search, recommendation, and personalization capabilities of the platform
  • Collaborate with product, data, and AI teams to create intelligent services such as classification, relevance ranking, and summarization
  • Establish and enforce architectural standards across backend, frontend, and infrastructure layers to ensure system reliability and scalability
  • Lead modernization efforts for backend systems built on Python/Django and frontend technologies like React
  • Mentor engineering teams, providing technical guidance and fostering best practices through code reviews and collaborative design sessions
  • Conduct technical exploration of emerging tools and technologies, including vector databases and real-time video personalization frameworks
  • Prototype and test innovative solutions to ensure the platform remains at the forefront of applied AI developments

Requirements:

  • 10+ years of experience developing and scaling software systems
  • 5+ years experience shipping LLM applications particularly in consumer-focused or AI-driven products
  • Proven track record as a Principal Engineer, Staff Engineer, or Architect in leading complex system designs
  • Proficient in Python, with preferred experience in frameworks such as Django, Flask, or FastAPI
  • Demonstrated knowledge in cloud architectures and distributed systems, including services like EC2, Lambda, S3, and CloudFront
What we offer:
  • medical
  • vision
  • dental
  • life and disability insurance
  • eligible to enroll in our company 401(k) plan

Additional Information:

Job Posted:
March 13, 2026

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Principal LLM Engineer

Senior Principal Machine Learning Engineer - LLM Post-Training and Optimization

Atlassian is seeking a highly skilled and experienced Senior Principle Machine L...
Location
Location
United States , Mountain View
Salary
Salary:
243100.00 - 407200.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Ph.D. or Master’s degree in Computer Science, Machine Learning, Artificial Intelligence, or a related field
  • 8+ years of experience in machine learning, with a focus on large-scale model development and optimization
  • Deep expertise in LLM and transformer architectures (e.g., GPT, BERT, T5)
  • Strong proficiency in Python and ML frameworks such as PyTorch, JAX, or TensorFlow
  • Experience with distributed training techniques and large-scale data processing pipelines
  • Proven track record of deploying machine learning models in production environments
  • Familiarity with model optimization techniques, including quantization, pruning, and knowledge distillation
  • Strong problem-solving skills and ability to work in a fast-paced, collaborative environment
  • Excellent communication skills and ability to translate technical concepts for diverse audiences
Job Responsibility
Job Responsibility
  • Lead the fine-tuning and post-training optimization of large language models (LLMs) for diverse applications
  • Develop and implement techniques for model compression, quantization, pruning, and knowledge distillation to optimize performance and reduce computational costs
  • Conduct research on advanced techniques in transfer learning, reinforcement learning, and prompt engineering for LLMs
  • Design and execute rigorous benchmarking and evaluation frameworks to assess model performance across multiple dimensions
  • Collaborate with infrastructure teams to optimize LLM deployment pipelines, ensuring scalability and efficiency in production environments
  • Stay at the forefront of advancements in LLM technologies, sharing insights, driving innovation within the team, and leading agile development
  • Mentoring other team members, facilitating within/across team workshops, fostering a culture of technical excellence and continuous learning
What we offer
What we offer
  • health coverage
  • paid volunteer days
  • wellness resources
  • Fulltime
Read More
Arrow Right

Principal AI Engineer

We are looking for a Principal AI Engineer to lead the design and deployment of ...
Location
Location
United States
Salary
Salary:
200000.00 - 300000.00 USD / Year
apollo.io Logo
Apollo.io
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of software engineering experience
  • at least 3 years in applied LLM or agentic AI systems (2023–present)
  • proven success in deploying LLM-powered products used by real users at scale
  • deep backend & systems engineering expertise with Python, distributed systems, and scalable APIs
  • familiarity with LangChain, LlamaIndex, or similar orchestration frameworks
  • experience with RAG pipelines, vector DBs, embedding models, and semantic search tuning
  • experience managing performance across cloud providers (e.g., AWS Bedrock, OpenAI, Anthropic, etc.)
  • demonstrated experience building multi-step agents, planning workflows, chaining reasoning steps, and integrating APIs with agent memory/state
  • comfort with advanced prompting strategies, few-shot and chain-of-thought reasoning, and embedding retrieval setups
  • strong understanding of AI system evaluation, human ratings, A/B experimentation, and feedback loop pipelines
Job Responsibility
Job Responsibility
  • Architect and lead the development of multi-agent systems capable of long-horizon planning, reasoning, and API orchestration
  • build reusable agentic components that integrate deeply into sales and marketing processes
  • own and evolve our in-house platform for scalable, low-latency, and cost-efficient LLM and agent deployments
  • lead design of interfaces powered by natural language understanding and retrieval-augmented generation (RAG)
  • build embedding-based, intent-aware search and personalization systems tuned to business user needs
  • drive innovation in personalized outreach generation using context-aware generation pipelines
  • tune inference pipelines, caching layers, and model selection logic for high-scale, cost-aware performance
  • define and drive robust offline and online testing methodologies (A/B, sandboxing, human evals) across agents and LLM flows
  • architect human-in-the-loop systems and telemetry to improve accuracy, UX, and explainability over time
What we offer
What we offer
  • equity
  • company bonus or sales commissions/bonuses
  • 401(k) plan
  • at least 10 paid holidays per year
  • flex PTO
  • parental leave
  • employee assistance program
  • wellbeing benefits
  • global travel coverage
  • life/AD&D/STD/LTD insurance
  • Fulltime
Read More
Arrow Right

Principal AI Engineer

At JFrog, we’re reinventing DevOps to help the world’s greatest companies innova...
Location
Location
Israel , Netanya/Tel Aviv
Salary
Salary:
Not provided
jfrog.com Logo
JFrog
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A bachelor's degree or higher in Computer Science, Data Science, or a related field
  • Proven experience in software development
  • Proficiency in LLM-related tools, processes, and frameworks, including OpenAI Models and APIs, Hugging Face Transformers, LangChain, vector databases, and prompt management tools like PromptPerfect/PromptBase and Guardrails
  • Experience with cloud platforms, such as AWS, Google Cloud, or Azure
  • Proficiency in Python programming
  • Experience deploying LLM-based applications in a production environment
  • Excellent problem-solving and analytical skills
  • Experience with CI / CD tools
  • Strong communication skills and the ability to collaborate effectively in a team
Job Responsibility
Job Responsibility
  • Recommend and test agentic productivity tools
  • Collaborate with key organizational stakeholders to understand AI requirements and design end-to-end AI productivity solutions
  • Explore and experiment with novel ML and AI techniques and architectures to drive DevX and productivity innovation
  • Evaluate and recommend ML and AI tools and frameworks to enhance productivity and effectiveness
  • Provide technical guidance and mentorship to development teams on AI and ML technologies and practices
  • Define meaningful KPIs and closely monitor cost
Read More
Arrow Right

Principal Engineer

The Principal AI/ML Operations Engineer leads the architecture, automation, and ...
Location
Location
United States , Pleasanton, California
Salary
Salary:
251000.00 - 314500.00 USD / Year
blackline.com Logo
BlackLine
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Machine Learning, Data Science, or a related field
  • 10+ years in ML infrastructure, DevOps, and software system architecture
  • 4+ years in leading MLOps or AI Ops platforms
  • Strong programming skills in languages such as Python, Java, or Scala
  • Expertise in ML frameworks (TensorFlow, PyTorch, scikit-learn) and orchestration tools (Airflow, Kubeflow, Vertex AI, MLflow)
  • Proven experience operating production pipelines for ML and LLM-based systems across cloud ecosystems (GCP, AWS, Azure)
  • Deep familiarity with LangChain, LangGraph, ADK or similar agentic system runtime management
  • Strong competencies in CI/CD, IaC, and DevSecOps pipelines integrating testing, compliance, and deployment automation
  • Hands-on with observability stacks (Prometheus, Grafana, Newrelic) for model and agent performance tracking
  • Understanding of governance frameworks for Responsible AI, auditability, and cost metering across training and inference workloads
Job Responsibility
Job Responsibility
  • Define enterprise-level standards and reference architectures for ML-Ops and AIOps systems
  • Partner with data science, security, and product teams to set evaluation and governance standards (Guardrails, Bias, Drift, Latency SLAs)
  • Mentor senior engineers and drive design reviews for ML pipelines, model registries, and agentic runtime environments
  • Lead incident response and reliability strategies for ML/AI systems
  • Lead the deployment of AI models and systems in various environments
  • Collaborate with development teams to integrate AI solutions into existing workflows and applications
  • Ensure seamless integration with different platforms and technologies
  • Define and manage MCP Registry for agentic component onboarding, lifecycle versioning, and dependency governance
  • Build CI/CD pipelines automating LLM agent deployment, policy validation, and prompt evaluation of workflows
  • Develop and operationalize experimentation frameworks for agent evaluations, scenario regression, and performance analytics
What we offer
What we offer
  • short-term and long-term incentive programs
  • robust offering of benefit and wellness plans
  • Fulltime
Read More
Arrow Right

Principal Machine Learning Engineer

Location
Location
Salary
Salary:
Not provided
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ Years of Experience in Data Science, Machine Learning, Generative AI or related fields
  • Ability to craft analysis into well-written and persuasive content
  • Proficiency in SQL AND another data manipulation programming language (e.g Python, R)
  • Experience driving projects or programs which have had proven impact on business strategy and performance through your analytics skills
  • Experience in applying statistical concepts (e.g. regressions, A/B tests, clustering, probability) to business problems
  • Expertise at telling stories with data and familiarity with at least one visualization tool (e.g. Tableau, R-Shiny, Microstrategy, SAP Business Objects, Looker, etc.)
  • Familiarity with LLM or ML driven product algorithms and success metric selection and measurement
Job Responsibility
Job Responsibility
  • Influence AI product feature development and roadmaps, and drive impactful change through the structure and clarity of your analysis and recommendations
  • Collaborate on a variety of product and business problems with a diverse set of cross-functional partners and become a trusted strategic partner through the structure and clarity of your work
  • Apply technical expertise with quantitative analysis, experimentation, and the presentation of data to develop strategies for our business and help solve the business's biggest challenges, especially focussed on the fast-paced Atlassian Intelligence suite of features
  • Focus on developing hypotheses through analytical approaches, different methodologies, frameworks, and technical approaches to test them
  • Define, understand, and test opportunities to improve our products, guide business direction, and influence roadmaps through insights and recommendations
  • Partner with cross-functional teams to inform, influence, and execute strategy decisions
  • Identify and measure the success of product efforts through forecasting and monitoring of key product metrics to understand trends
  • Use data to shape product development, quantify new opportunities, identify upcoming challenges, and ensure the products we build bring value to people, businesses, and Atlassian.
What we offer
What we offer
  • Atlassian offers a wide range of perks and benefits designed to support you, your family and to help you engage with your local community. Our offerings include health and wellbeing resources, paid volunteer days, and so much more.
Read More
Arrow Right

Principal Data Scientist - Machine Learning Engineering

We are looking for a Principal Machine Learning Data Scientist to develop and im...
Location
Location
United States , Remote
Salary
Salary:
145300.00 - 233400.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience in Data Science or related fields
  • Expertise in applying a broad variety of ML methods including NLP and LLM to solve business problems using large amounts of data
  • Proven track record of delivering ML projects end-to-end, including designing, development, deployment and monitoring
  • Ability to communicate and explain data science concepts to diverse audiences, craft a compelling story
  • Expertise in programming languages such as Python or Java with and the ability to write performant code, familiarity with SQL, knowledge of Spark and cloud data environments (e.g. AWS, Databricks)
  • Agile development mindset, appreciating the benefit of constant iteration and improvement
Job Responsibility
Job Responsibility
  • Develop and implement our machine learning algorithms
  • Train sophisticated models
  • Collaborate with our technical and non-technical partner teams
  • Expand our AI/ML functionality in partnership with our CSS and/or Sales organization
What we offer
What we offer
  • health coverage
  • paid volunteer days
  • wellness resources
  • bonuses
  • commissions
  • equity
  • Fulltime
Read More
Arrow Right

Principal Data Scientist - Machine Learning Engineering

Atlassian is looking for a Principal Data Scientist to uncover valuable insights...
Location
Location
United States , San Francisco
Salary
Salary:
175100.00 - 233400.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience applying your Data Science skills to identify and lead projects which have had impact on business strategy and performance
  • 8+ years of experience in Data Science or related fields. (Preferred - 10+ years experience with a post-graduate degree in a quantitative discipline like Statistics, Mathematics, Econometrics, Computer science)
  • Expertise in applying a broad variety of ML methods including NLP and LLM to solve business problems and a strong sense of when to apply them to the problem at hand
  • Experience in managing ML projects end-to-end including deployment and monitoring
  • Expertise in SQL and a high level of proficiency in another data science programming language (e.g Python, R) with expertise in libraries like Pandas, Numpy, Scikit-learn etc.
  • A very high bar for output quality, while balancing 'having something now' vs. 'perfection in the future'
  • Comfort explaining complex concepts to diverse audiences and creating compelling stories for non-data experts
  • Proficiency in visualization tools (e.g. Streamlit, Tableau)
Job Responsibility
Job Responsibility
  • Influence strategy & important decisions around customer friction by surfacing data driven insights
  • Define, set and report on department level metrics or KRs to the CSS Executive team
  • Build and implement measurement frameworks, machine learning models and NLP/LLM tooling to accelerate Atlassian’s growth and improve product quality
  • Foster a world-class Data Science culture by leading training on technical concepts, driving continuous learning and mentoring Data Scientists on the team
What we offer
What we offer
  • health coverage
  • paid volunteer days
  • wellness resources
  • Fulltime
Read More
Arrow Right

Principal Software Engineer, AI Developer Tools

At Docker, we make app development easier so developers can focus on what matter...
Location
Location
United States , Seattle
Salary
Salary:
232000.00 - 319000.00 USD / Year
docker.com Logo
Docker
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years software engineering experience with 3+ years in Staff or Principal Engineer roles
  • Deep expertise in AI/ML technologies with hands-on production experience building LLM-powered applications, AI agents, or AI-assisted developer tools
  • Strong understanding of LLM APIs (OpenAI, Anthropic, etc.), prompt engineering, agent orchestration frameworks, and practical applications of AI in software development workflows
  • Proven track record of architecting and building highly scalable distributed systems and developer-facing platforms
  • Production experience with modern cloud-native infrastructure including Kubernetes, GitOps deployment patterns, observability systems, and CI/CD pipelines
  • Proficiency in Go (preferred), Rust, Java, or Python with strong software engineering fundamentals
  • Experience designing developer tools, platform engineering systems, or internal tools that enable other teams
  • Exceptional product and platform mindset considering business outcomes, developer experience, and technical trade-offs
  • Strong communication skills with ability to influence technical and non-technical stakeholders across the organization
  • Track record of technical mentorship and elevating engineering teams' capabilities
Job Responsibility
Job Responsibility
  • Define the long-term technical vision and architecture for AI-powered developer tools and the self-service platform that enables teams to build their own AI agents
  • Establish architectural patterns, technical standards, and best practices for LLM integration, AI agent development, and production AI systems serving developers
  • Lead technical strategy for platform capabilities including deployment frameworks (ArgoCD/GitOps), observability integration (Grafana), security controls, and operational tooling for AI developer tools
  • Design highly available, scalable infrastructure for hosting AI agents and developer tools with predictable performance and intelligent resource management
  • Drive technical decisions on AI technology choices, LLM provider strategies, prompt engineering approaches, and agent orchestration frameworks
  • Partner with Senior Manager and product leadership to align technical architecture with business objectives and productization opportunities
  • Architect and build production-ready AI agents for developer productivity including code review assistants, test generators, deployment diagnostics, and incident response automation
  • Design and implement the self-service platform infrastructure that reduces time-to-production for new AI tools from weeks to days
  • Build systems that accelerate adoption of AI-native development tools (Claude Code, Cursor, Warp) across Docker's engineering organization
  • Establish reliability, security, and performance standards for AI systems including SLOs, monitoring, incident response, and cost management
What we offer
What we offer
  • Freedom & flexibility
  • fit your work around your life
  • Designated quarterly Whaleness Days plus end of year Whaleness break
  • Home office setup
  • we want you comfortable while you work
  • 16 weeks of paid Parental leave
  • Technology stipend equivalent to $100 net/month
  • PTO plan that encourages you to take time to do the things you enjoy
  • Training stipend for conferences, courses and classes
  • Equity
  • Fulltime
Read More
Arrow Right