CrawlJobs Logo

Senior LLMOps Engineer

heidihealth.com Logo

Heidi

Location Icon

Location:
Australia , Sydney

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Working closely with our Engineering Manager, you’ll be a Senior LLMOps Engineer on the Model Platform team. You are a technical leader responsible for building and scaling the infrastructure that powers our entire model lifecycle. Your mission is to build a robust, scalable, and reliable platform for deploying and managing our LLMs. You will lead the design and implementation of our LLMOps strategy, ensuring our AI engineers can move models from development to production seamlessly and efficiently. You will combine your deep infrastructure knowledge with MLOps principles to solve the critical challenges of serving models at scale.

Job Responsibility:

  • Lead the architecture, design, and implementation of our end-to-end LLMOps platform, from data ingestion and model training pipelines to production deployment and monitoring
  • Build and maintain robust CI/CD/CT (Continuous Integration/Continuous Delivery/Continuous Training) pipelines to automate the testing, validation, and deployment of large language models
  • Engineer highly available and scalable model serving solutions using modern infrastructure like Kubernetes, ensuring low latency and high throughput for our production services
  • Collaborate closely with AI research and engineering teams to understand their needs, streamline workflows, and create the tooling that accelerates their development cycles
  • Champion and implement best practices for model versioning, experiment tracking, monitoring, and governance across the organization
  • Mentor mid-level and junior engineers, sharing your deep expertise in infrastructure, automation, and operational excellence to foster a culture of reliability and scalability

Requirements:

  • Proven track record of designing, building, and maintaining MLOps or LLMOps infrastructure in a production environment
  • Previous hands-on experience building scalable, cloud-native infrastructure and platforms
  • Deployed and managed large-scale machine learning models in a production environment
  • Expert in Python, cloud platforms (AWS, GCP, or Azure), containerization (Docker, Kubernetes), and Infrastructure as Code (e.g., Terraform, CloudFormation)
  • Deep and practical understanding of the entire machine learning lifecycle and the specific operational challenges of large language models
  • Ability to translate complex engineering and research requirements into concrete, robust, and automated platform solutions
  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field, or equivalent practical experience

Nice to have:

  • Experience with advanced model serving and optimization techniques (e.g., quantization, distillation, multi-model serving)
  • Experience with specialized MLOps frameworks like MLflow, Kubeflow, or Weights & Biases
  • Contributions to open-source MLOps or infrastructure-related projects
What we offer:
  • Flexible hybrid working environment, with 3 days in the office
  • Additional paid day off for your birthday and wellness days
  • Special corporate rates at Anytime Fitness in Melbourne, Sydney tbc
  • A generous personal development budget of $500 per annum
  • Learn from some of the best engineers and creatives, joining a diverse team
  • Become an owner, with shares (equity) in the company

Additional Information:

Job Posted:
February 18, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior LLMOps Engineer

Senior Machine Learning Engineer (Team Lead)

As our Artificial Intelligence (AI) and Machine Learning (ML) Team Leader, you w...
Location
Location
Australia , South Bank
Salary
Salary:
Not provided
fctgcareers.com Logo
Flight Centre Brand
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years delivering production grade ML or AI systems with proven commercial impact
  • 3+ years Leading and Mentoring engineers
  • Experience building AI agents, RAG systems or LLM powered applications in production
  • Demonstrated experience leading technical teams and managing complex AI programmes
  • Strong hands on experience across ML infrastructure, distributed systems and scalable AI architecture
  • Experience building and governing AI agent platforms including endpoints, gateways and tool orchestration
  • Familiarity with MCP servers and emerging agent communication standards and protocols
  • Experience defining evaluation frameworks, safety mechanisms and governance for LLM and agent based systems
  • Deep knowledge of Python, modern AI/ML frameworks and scalable AI platforms including Databricks
  • Strong expertise in Kubernetes and cloud native production environments
Job Responsibility
Job Responsibility
  • Lead the development and productionisation of ML models, LLM powered systems and agent based applications
  • Define and build end to end MLOps including CI CD, model registry, monitoring, drift detection and retraining for predictive ML systems
  • Establish LLMOps standards including context engineering, automated evaluation pipelines, red teaming, safeguards and policy guardrails
  • Architect and build AI agent workflows, endpoints, gateways and orchestration layers enabling secure tool access, structured reasoning and multi agent collaboration
  • Design and govern MCP servers and modern agent communication protocols to ensure interoperability, security and scalability
  • Implement strong observability across ML and GenAI systems including reliability, latency, evaluation metrics, usage tracking and cost control
  • Drive scalable ML infrastructure, feature stores and data platforms on Databricks
  • Oversee Kubernetes based deployments and cloud native AI infrastructure
  • Partner with senior stakeholders to prioritise and deliver multiple high impact AI initiatives
  • Coach and grow a high performing AI engineering team
What we offer
What we offer
  • Individualised, ongoing Learning & Development via communities of practice
  • Innovation Days
  • Dedicated Engineering Days
  • Access to 'LinkedIn Learning' for ongoing skills development
  • Women in PM&E group
  • Exclusive Staff Discounts
  • Travel Discounts
  • Career opportunities in a network of brands and businesses across the globe
  • Corporate Health Discounts
  • Mental Health Support and Employee Assistance Program for staff and family
  • Fulltime
Read More
Arrow Right

Senior / Lead AI Engineer

Omio is building the future of travel. We’re moving from manual, rule-based syst...
Location
Location
Singapore , Singapore
Salary
Salary:
Not provided
foodlabs.com Logo
FoodLabs & Atlantic Labs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum 10+ years of experience in software engineering, developing complex models and algorithms
  • Proven track record in designing, implementing, and deploying production-grade AI solutions at scale
  • Strong communication and presentation skills, with the ability to influence and collaborate effectively with non-technical stakeholders
  • Self-motivated and capable of working independently, driving initiatives with minimal supervision
  • Prior experience deploying scalable AI and LLM-based solutions for real-time, high-performance systems is highly desirable
  • Experience with diverse model evaluation techniques (quantitative and qualitative) and an iterative approach to improving AI system performance and user outcomes
  • Expertise in building AI applications using large language models such as OpenAI, Claude, Gemini, LLaMA
  • Experience with LLM orchestration frameworks like LangChain, LangGraph, vLLM, LMDeploy
  • Strong programming skills in Java, Python, and SQL
  • Familiarity with data preprocessing, feature engineering, model evaluation, MLOps, and LLMOps best practices
Job Responsibility
Job Responsibility
  • Develop AI solutions leveraging LLMs to improve productivity and deliver strong business impact
  • Lead the end-to-end development lifecycle from ideation to deployment of AI-powered solutions across various domains
  • Build scalable AI systems that support Omio’s global expansion goals
  • Act as an evangelist for AI adoption by demonstrating clear value to stakeholders
  • Collaborate with Business, Product, and Engineering teams to integrate AI into workflows and drive adoption
  • Present models, results, and systems to both technical and non-technical audiences, including C-level stakeholders
  • Fulltime
Read More
Arrow Right

Senior Lead AI Engineer

Omio is building the future of travel. We’re moving from manual, rule-based syst...
Location
Location
Singapore , Singapore
Salary
Salary:
Not provided
foodlabs.com Logo
FoodLabs & Atlantic Labs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum 10+ years of experience in software engineering, developing complex models and algorithms
  • Proven track record in designing, implementing, and deploying production-grade AI solutions at scale
  • Strong communication and presentation skills, with the ability to influence and collaborate effectively with non-technical stakeholders
  • Self-motivated and capable of working independently, driving initiatives with minimal supervision
  • Prior experience deploying scalable AI and LLM-based solutions for real-time, high-performance systems is highly desirable
  • Experience with diverse model evaluation techniques (quantitative and qualitative) and an iterative approach to improving AI system performance and user outcomes
  • Expertise in building AI applications using large language models such as OpenAI, Claude, Gemini, LLaMA
  • Experience with LLM orchestration frameworks like LangChain, LangGraph, vLLM, LMDeploy
  • Strong programming skills in Java, Python, and SQL
  • Familiarity with data preprocessing, feature engineering, model evaluation, MLOps, and LLMOps best practices
Job Responsibility
Job Responsibility
  • Develop AI solutions leveraging LLMs to improve productivity and deliver strong business impact
  • Lead the end-to-end development lifecycle from ideation to deployment of AI-powered solutions across various domains
  • Build scalable AI systems that support Omio’s global expansion goals
  • Act as an evangelist for AI adoption by demonstrating clear value to stakeholders
  • Collaborate with Business, Product, and Engineering teams to integrate AI into workflows and drive adoption
  • Present models, results, and systems to both technical and non-technical audiences, including C-level stakeholders
  • Fulltime
Read More
Arrow Right

Senior AI Engineer

Join our Digital & Data team working alongside product, design and a wide range ...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
paconsulting.com Logo
PA Consulting
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Software engineers with a passion for AI – or data scientists who’ve embraced engineering
  • People who experiment, prototype, and explore emerging AI tools in their own time
  • Strong foundation in a language suited to AI system development and data workflows
  • Experience integrating models with APIs, data sources, or production systems
  • Curiosity about LLMs, RAG pipelines (Graph & Vector based), and agent frameworks
  • Understanding of cloud-native (AWS / GCP / Azure) and DevOps / DevSecOps practices
  • A collaborative mindset and willingness to share, learn, and teach
  • Understanding of prompt and context engineering and model evaluation
  • Solid grasp of distributed systems, microservices, and RESTful APIs
  • An understanding of LLMOps tools for managing GenAI workflows
Job Responsibility
Job Responsibility
  • Work to agile best practices and cross-functionally with multiple teams and stakeholders
  • Using technical skills to problem solve with clients
  • Working on internal projects
  • Experimenting with Generative AI frameworks and tools such as LangChain, LlamaIndex, Hugging Face, and APIs from OpenAI and Anthropic
  • Building retrieval-augmented generation (RAG) prototypes with vector stores and knowledge graphs
  • Developing and testing agentic architectures through our own Genie Platform
  • Exploring LLMOps, evaluation tools, and model observability platforms like TruLens and LangSmith
  • Deploying solutions on modern cloud and DevOps environments (AWS, Azure, GCP)
What we offer
What we offer
  • Health and lifestyle perks accompanying private healthcare for you and your family
  • 25 days annual leave (plus a bonus half day on Christmas Eve) with the opportunity to buy 5 additional days
  • Generous company pension scheme
  • Opportunity to get involved with community and charity-based initiatives
  • Annual performance-based bonus
  • PA share ownership
  • Tax efficient benefits (cycle to work, give as you earn)
  • Fulltime
Read More
Arrow Right

Senior AI Engineer

Join our Digital & Data team working alongside product, design and a wide range ...
Location
Location
United Kingdom , Belfast
Salary
Salary:
Not provided
paconsulting.com Logo
PA Consulting
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Software engineers with a passion for AI – or data scientists who’ve embraced engineering
  • People who experiment, prototype, and explore emerging AI tools in their own time
  • Strong foundation in a language suited to AI system development and data workflows
  • Experience integrating models with APIs, data sources, or production systems
  • Curiosity about LLMs, RAG pipelines (Graph & Vector based), and agent frameworks
  • Understanding of cloud-native (AWS / GCP / Azure) and DevOps / DevSecOps practices
  • A collaborative mindset and willingness to share, learn, and teach
  • Understanding of prompt and context engineering and model evaluation
  • Solid grasp of distributed systems, microservices, and RESTful APIs
  • An understanding of LLMOps tools for managing GenAI workflows
Job Responsibility
Job Responsibility
  • Work to agile best practices and cross-functionally with multiple teams and stakeholders
  • Using technical skills to problem solve with our clients
  • Working on internal projects
  • Experimenting with Generative AI frameworks and tools such as LangChain, LlamaIndex, Hugging Face, and APIs from OpenAI and Anthropic
  • Building retrieval-augmented generation (RAG) prototypes with vector stores and knowledge graphs
  • Developing and testing agentic architectures through our own Genie Platform
  • Exploring LLMOps, evaluation tools, and model observability platforms like TruLens and LangSmith
  • Deploying solutions on modern cloud and DevOps environments (AWS, Azure, GCP)
What we offer
What we offer
  • Health and lifestyle perks accompanying private healthcare for you and your family
  • 25 days annual leave (plus a bonus half day on Christmas Eve) with the opportunity to buy 5 additional days
  • Generous company pension scheme
  • Opportunity to get involved with community and charity-based initiatives
  • Annual performance-based bonus
  • PA share ownership
  • Tax efficient benefits (cycle to work, give as you earn)
  • Budget to take courses (technical and non-technical training) and gain certifications
  • Fulltime
Read More
Arrow Right

Senior Machine Learning Engineer

As our Senior ML Engineer, you will play a key role in delivering high impact, f...
Location
Location
Australia , South Bank
Salary
Salary:
Not provided
fctgcareers.com Logo
Flight Centre Brand
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years delivering production grade ML or AI systems with commercial impact
  • Strong hands on experience across AI/ML lifecycle, infrastructure and scalable AI architecture
  • Experience building AI agents, RAG systems or LLM powered applications in production
  • Familiarity with agent endpoints, gateways and tool orchestration patterns
  • Exposure to MCP servers and emerging agent protocols
  • Experience implementing evaluation frameworks and safety mechanisms for LLM systems
  • Deep knowledge of Python and modern AI/ML frameworks
  • Experience working with Databricks
  • Strong understanding of Kubernetes and distributed production systems
Job Responsibility
Job Responsibility
  • Design, develop and productionise ML models, LLM powered systems and agent based applications
  • Implement end to end MLOps pipelines including CI CD, model registry, monitoring, drift detection and retraining for predictive ML systems
  • Build LLMOps workflows including context engineering, automated evaluation, safeguards and guardrails
  • Develop AI agent workflows, endpoints, gateways and orchestration layers enabling secure tool usage and structured reasoning
  • Contribute to MCP server implementation and modern agent communication protocols
  • Implement strong observability across ML and GenAI systems including reliability, latency, evaluation metrics, usage and cost tracking
  • Build scalable ML infrastructure, and AI/ML optimised workflows
  • Deploy and optimise workloads on Kubernetes and cloud native environments
  • Collaborate with product and business stakeholders to deliver high impact AI solutions
What we offer
What we offer
  • Individualised, ongoing Learning & Development via communities of practice
  • Innovation Days
  • Dedicated Engineering Days
  • Access to 'LinkedIn Learning' for ongoing skills development
  • Women in PM&E group
  • Exclusive Staff Discounts
  • Travel Discounts
  • Career opportunities in a network of brands and businesses across the globe
  • Corporate Health Discounts
  • Mental Health Support and Employee Assistance Program for staff and family
  • Fulltime
Read More
Arrow Right

Senior AI Engineer

Omio is building the future of travel. We’re moving from manual, rule-based syst...
Location
Location
Singapore , Singapore
Salary
Salary:
Not provided
foodlabs.com Logo
FoodLabs & Atlantic Labs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum 6+ years of experience in developing complex models and algorithms
  • Proven track record in designing and deploying production-grade AI solutions
  • Strong communication and presentation skills, with the ability to influence non-technical stakeholders
  • Self-motivated, independent contributor capable of driving initiatives with minimal guidance
  • Prior experience deploying scalable AI and LLM-based solutions for real-time, high-performance systems is highly desirable
  • Expertise in building AI applications using large language models such as OpenAI, Claude, Gemini, LLaMA
  • Experience with LLM orchestration frameworks like LangChain, LangGraph, vLLM, LMDeploy
  • Strong programming skills in Java, Python, and SQL
  • Familiarity with data preprocessing, feature engineering, model evaluation, MLOps, and LLMOps best practices
  • Ability to construct Retrieval-Augmented Generation (RAG) systems for advanced AI workflows
Job Responsibility
Job Responsibility
  • Develop AI solutions leveraging LLMs to improve productivity and deliver strong business impact
  • Lead the end-to-end development lifecycle from ideation to deployment of AI-powered solutions across various domains
  • Build scalable AI systems that support Omio’s global expansion goals
  • Act as an evangelist for AI adoption by demonstrating clear value to stakeholders
  • Collaborate with Business, Product, and Engineering teams to integrate AI into workflows and drive adoption
  • Present models, results, and systems to both technical and non-technical audiences, including C-level stakeholders
  • Fulltime
Read More
Arrow Right

Principal Research Engineer

As a Principal Research Engineer at Microsoft, you will set the technical vision...
Location
Location
United States , Redmond
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • PhD in AI/ML or related field with top-venue publications and/or patents
  • Experience architecting and deploying LLMs/multimodal models and multi-agent systems in production at scale
  • Familiarity with Responsible AI frameworks and bias-mitigation techniques
  • Demonstrated ability to shape product strategy and drive organizational change
  • Experience with Microsoft’s LLMOps stack: Azure AI Foundry, Azure Machine L
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Define and execute technical strategy for foundational models, multi-agent systems, and next-generation Copilot experiences, especially within Business & Industry Copilot
  • Lead cross-team efforts to deliver scalable, reliable, and responsible AI systems
  • Advance the state of the art and translate breakthroughs into measurable customer and business impact
  • Architect and deliver complex AI systems across model development, data, infra, evaluation, and deployment spanning multiple product lines
  • Set technical direction for large programs
  • drive alignment across Research, Engineering, and Product
  • Integrate LLMs, multimodal models, multi-agent architectures, and RAG into Microsoft’s ecosystem
  • Establish best practices for MLOps, governance, and Responsible AI, compliant with Microsoft principles and industry standards
  • Drive original research and thought leadership (whitepapers, internal notes, patents)
  • convert insights into shipped capabilities
  • Fulltime
Read More
Arrow Right