CrawlJobs Logo

Senior LLMOps Engineer

heidihealth.com Logo

Heidi

Location Icon

Location:
Australia , Sydney

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Working closely with our Engineering Manager, you’ll be a Senior LLMOps Engineer on the Model Platform team. You are a technical leader responsible for building and scaling the infrastructure that powers our entire model lifecycle. Your mission is to build a robust, scalable, and reliable platform for deploying and managing our LLMs. You will lead the design and implementation of our LLMOps strategy, ensuring our AI engineers can move models from development to production seamlessly and efficiently. You will combine your deep infrastructure knowledge with MLOps principles to solve the critical challenges of serving models at scale.

Job Responsibility:

  • Lead the architecture, design, and implementation of our end-to-end LLMOps platform, from data ingestion and model training pipelines to production deployment and monitoring
  • Build and maintain robust CI/CD/CT (Continuous Integration/Continuous Delivery/Continuous Training) pipelines to automate the testing, validation, and deployment of large language models
  • Engineer highly available and scalable model serving solutions using modern infrastructure like Kubernetes, ensuring low latency and high throughput for our production services
  • Collaborate closely with AI research and engineering teams to understand their needs, streamline workflows, and create the tooling that accelerates their development cycles
  • Champion and implement best practices for model versioning, experiment tracking, monitoring, and governance across the organization
  • Mentor mid-level and junior engineers, sharing your deep expertise in infrastructure, automation, and operational excellence to foster a culture of reliability and scalability

Requirements:

  • Proven track record of designing, building, and maintaining MLOps or LLMOps infrastructure in a production environment
  • Previous hands-on experience building scalable, cloud-native infrastructure and platforms
  • Deployed and managed large-scale machine learning models in a production environment
  • Expert in Python, cloud platforms (AWS, GCP, or Azure), containerization (Docker, Kubernetes), and Infrastructure as Code (e.g., Terraform, CloudFormation)
  • Deep and practical understanding of the entire machine learning lifecycle and the specific operational challenges of large language models
  • Ability to translate complex engineering and research requirements into concrete, robust, and automated platform solutions
  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field, or equivalent practical experience

Nice to have:

  • Experience with advanced model serving and optimization techniques (e.g., quantization, distillation, multi-model serving)
  • Experience with specialized MLOps frameworks like MLflow, Kubeflow, or Weights & Biases
  • Contributions to open-source MLOps or infrastructure-related projects
What we offer:
  • Flexible hybrid working environment, with 3 days in the office
  • Additional paid day off for your birthday and wellness days
  • Special corporate rates at Anytime Fitness in Melbourne, Sydney tbc
  • A generous personal development budget of $500 per annum
  • Learn from some of the best engineers and creatives, joining a diverse team
  • Become an owner, with shares (equity) in the company

Additional Information:

Job Posted:
February 18, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:
PREMIUM
More languages and countries
+ Unlock 31694 hidden job offers
Languages
English Čeština Deutsch Ελληνικά Español Français +15
Countries
United States United Kingdom India Canada Australia +
See plans
Plans from $2.99 / month

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior LLMOps Engineer

Senior Machine Learning Engineer (Team Lead)

As our Artificial Intelligence (AI) and Machine Learning (ML) Team Leader, you w...
Location
Location
Australia , South Bank
Salary
Salary:
Not provided
fctgcareers.com Logo
Flight Centre Brand
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years delivering production grade ML or AI systems with proven commercial impact
  • 3+ years Leading and Mentoring engineers
  • Experience building AI agents, RAG systems or LLM powered applications in production
  • Demonstrated experience leading technical teams and managing complex AI programmes
  • Strong hands on experience across ML infrastructure, distributed systems and scalable AI architecture
  • Experience building and governing AI agent platforms including endpoints, gateways and tool orchestration
  • Familiarity with MCP servers and emerging agent communication standards and protocols
  • Experience defining evaluation frameworks, safety mechanisms and governance for LLM and agent based systems
  • Deep knowledge of Python, modern AI/ML frameworks and scalable AI platforms including Databricks
  • Strong expertise in Kubernetes and cloud native production environments
Job Responsibility
Job Responsibility
  • Lead the development and productionisation of ML models, LLM powered systems and agent based applications
  • Define and build end to end MLOps including CI CD, model registry, monitoring, drift detection and retraining for predictive ML systems
  • Establish LLMOps standards including context engineering, automated evaluation pipelines, red teaming, safeguards and policy guardrails
  • Architect and build AI agent workflows, endpoints, gateways and orchestration layers enabling secure tool access, structured reasoning and multi agent collaboration
  • Design and govern MCP servers and modern agent communication protocols to ensure interoperability, security and scalability
  • Implement strong observability across ML and GenAI systems including reliability, latency, evaluation metrics, usage tracking and cost control
  • Drive scalable ML infrastructure, feature stores and data platforms on Databricks
  • Oversee Kubernetes based deployments and cloud native AI infrastructure
  • Partner with senior stakeholders to prioritise and deliver multiple high impact AI initiatives
  • Coach and grow a high performing AI engineering team
What we offer
What we offer
  • Individualised, ongoing Learning & Development via communities of practice
  • Innovation Days
  • Dedicated Engineering Days
  • Access to 'LinkedIn Learning' for ongoing skills development
  • Women in PM&E group
  • Exclusive Staff Discounts
  • Travel Discounts
  • Career opportunities in a network of brands and businesses across the globe
  • Corporate Health Discounts
  • Mental Health Support and Employee Assistance Program for staff and family
  • Fulltime
Read More
Arrow Right

Senior / Lead AI Engineer

Omio is building the future of travel. We’re moving from manual, rule-based syst...
Location
Location
Singapore , Singapore
Salary
Salary:
Not provided
foodlabs.com Logo
FoodLabs & Atlantic Labs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum 10+ years of experience in software engineering, developing complex models and algorithms
  • Proven track record in designing, implementing, and deploying production-grade AI solutions at scale
  • Strong communication and presentation skills, with the ability to influence and collaborate effectively with non-technical stakeholders
  • Self-motivated and capable of working independently, driving initiatives with minimal supervision
  • Prior experience deploying scalable AI and LLM-based solutions for real-time, high-performance systems is highly desirable
  • Experience with diverse model evaluation techniques (quantitative and qualitative) and an iterative approach to improving AI system performance and user outcomes
  • Expertise in building AI applications using large language models such as OpenAI, Claude, Gemini, LLaMA
  • Experience with LLM orchestration frameworks like LangChain, LangGraph, vLLM, LMDeploy
  • Strong programming skills in Java, Python, and SQL
  • Familiarity with data preprocessing, feature engineering, model evaluation, MLOps, and LLMOps best practices
Job Responsibility
Job Responsibility
  • Develop AI solutions leveraging LLMs to improve productivity and deliver strong business impact
  • Lead the end-to-end development lifecycle from ideation to deployment of AI-powered solutions across various domains
  • Build scalable AI systems that support Omio’s global expansion goals
  • Act as an evangelist for AI adoption by demonstrating clear value to stakeholders
  • Collaborate with Business, Product, and Engineering teams to integrate AI into workflows and drive adoption
  • Present models, results, and systems to both technical and non-technical audiences, including C-level stakeholders
  • Fulltime
Read More
Arrow Right

Senior Lead AI Engineer

Omio is building the future of travel. We’re moving from manual, rule-based syst...
Location
Location
Singapore , Singapore
Salary
Salary:
Not provided
foodlabs.com Logo
FoodLabs & Atlantic Labs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum 10+ years of experience in software engineering, developing complex models and algorithms
  • Proven track record in designing, implementing, and deploying production-grade AI solutions at scale
  • Strong communication and presentation skills, with the ability to influence and collaborate effectively with non-technical stakeholders
  • Self-motivated and capable of working independently, driving initiatives with minimal supervision
  • Prior experience deploying scalable AI and LLM-based solutions for real-time, high-performance systems is highly desirable
  • Experience with diverse model evaluation techniques (quantitative and qualitative) and an iterative approach to improving AI system performance and user outcomes
  • Expertise in building AI applications using large language models such as OpenAI, Claude, Gemini, LLaMA
  • Experience with LLM orchestration frameworks like LangChain, LangGraph, vLLM, LMDeploy
  • Strong programming skills in Java, Python, and SQL
  • Familiarity with data preprocessing, feature engineering, model evaluation, MLOps, and LLMOps best practices
Job Responsibility
Job Responsibility
  • Develop AI solutions leveraging LLMs to improve productivity and deliver strong business impact
  • Lead the end-to-end development lifecycle from ideation to deployment of AI-powered solutions across various domains
  • Build scalable AI systems that support Omio’s global expansion goals
  • Act as an evangelist for AI adoption by demonstrating clear value to stakeholders
  • Collaborate with Business, Product, and Engineering teams to integrate AI into workflows and drive adoption
  • Present models, results, and systems to both technical and non-technical audiences, including C-level stakeholders
  • Fulltime
Read More
Arrow Right

Senior Engineer, Machine Learning Engineering

Mastercard’s Business & Market Insights (B&MI) group empowers organizations to a...
Location
Location
India , Pune
Salary
Salary:
Not provided
mastercard.com Logo
Mastercard
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master’s/bachelor’s degree in computer science or engineering
  • Considerable work experience with a proven track-record of successfully leading and managing complex projects/products and delivering to aggressive market needs
  • Expert-level hands on experience designing, building and deploying both conventional AI/ML solutions and LLM/Agentic solutions
  • Strong analytical and problem-solving abilities, with quick adaptation to new technologies, methodologies, and systems
  • Strong applied knowledge and hands on experience in advanced statistical techniques, predictive modelling, machine learning algorithms, GenAI and deep learning frameworks
  • Experience with AI and machine learning platforms such as TensorFlow, PyTorch, or similar
  • Strong programming skills in languages such as Python/SQL is a must
  • Experience with data visualization tools (e.g., Tableau, Power BI) and understanding of cloud computing services (AWS, Azure, GCP) related to data processing and storage is a plus
Job Responsibility
Job Responsibility
  • Implement multi-agent intelligence frameworks (LangGraph, CrewAI, AutoGen) to enable reasoning, coordination, and adaptive decision-making across specialized AI agents
  • Design and operationalize multi-modal AI pipelines combining text, image, tabular, and graph data using transformer-based architectures (BERT, CLIP, LLaVA, T5, Whisper, etc.) for unified intelligence
  • Build scalable RAG and Graph-RAG systems integrating vector stores and knowledge graphs (Neo4j, AWS Neptune) to enable contextual retrieval, semantic linking, and entity-aware reasoning
  • Develop and productionize transformer-based models for NLP, vision-language understanding, and sequential prediction tasks leveraging Hugging Face, PyTorch, and TensorFlow ecosystems
  • Implement advanced Python-based backend services for inference orchestration, async job handling, and distributed data workflows supporting high-throughput AI operations
  • Establish end-to-end LLMOps and MLOps pipelines on Databricks (AWS) integrating MLflow, feature stores, model lineage, prompt evaluation, and continuous retraining frameworks
  • Apply traditional AI/ML and statistical modeling techniques (regression, clustering, forecasting, ensemble methods) alongside deep learning models for hybrid interpretability and explainability
  • Engineer state and memory management subsystems that preserve context, track embeddings, and enable agents to reason temporally across multiple modalities and interactions
  • Implement Responsible AI practices—bias detection, explainability dashboards, data ethics checks, and performance governance ensuring fairness and transparency of deployed models
  • Continuously research, benchmark, and productionize innovations in multimodal transformers, generative modeling, and agentic orchestration to drive enterprise-scale intelligence and automation
  • Fulltime
Read More
Arrow Right

Senior AI Engineer

Join our Digital & Data team working alongside product, design and a wide range ...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
paconsulting.com Logo
PA Consulting
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Software engineers with a passion for AI – or data scientists who’ve embraced engineering
  • People who experiment, prototype, and explore emerging AI tools in their own time
  • Strong foundation in a language suited to AI system development and data workflows
  • Experience integrating models with APIs, data sources, or production systems
  • Curiosity about LLMs, RAG pipelines (Graph & Vector based), and agent frameworks
  • Understanding of cloud-native (AWS / GCP / Azure) and DevOps / DevSecOps practices
  • A collaborative mindset and willingness to share, learn, and teach
  • Understanding of prompt and context engineering and model evaluation
  • Solid grasp of distributed systems, microservices, and RESTful APIs
  • An understanding of LLMOps tools for managing GenAI workflows
Job Responsibility
Job Responsibility
  • Work to agile best practices and cross-functionally with multiple teams and stakeholders
  • Using technical skills to problem solve with clients
  • Working on internal projects
  • Experimenting with Generative AI frameworks and tools such as LangChain, LlamaIndex, Hugging Face, and APIs from OpenAI and Anthropic
  • Building retrieval-augmented generation (RAG) prototypes with vector stores and knowledge graphs
  • Developing and testing agentic architectures through our own Genie Platform
  • Exploring LLMOps, evaluation tools, and model observability platforms like TruLens and LangSmith
  • Deploying solutions on modern cloud and DevOps environments (AWS, Azure, GCP)
What we offer
What we offer
  • Health and lifestyle perks accompanying private healthcare for you and your family
  • 25 days annual leave (plus a bonus half day on Christmas Eve) with the opportunity to buy 5 additional days
  • Generous company pension scheme
  • Opportunity to get involved with community and charity-based initiatives
  • Annual performance-based bonus
  • PA share ownership
  • Tax efficient benefits (cycle to work, give as you earn)
  • Fulltime
Read More
Arrow Right

Senior AI Engineer

Join our Digital & Data team working alongside product, design and a wide range ...
Location
Location
United Kingdom , Belfast
Salary
Salary:
Not provided
paconsulting.com Logo
PA Consulting
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Software engineers with a passion for AI – or data scientists who’ve embraced engineering
  • People who experiment, prototype, and explore emerging AI tools in their own time
  • Strong foundation in a language suited to AI system development and data workflows
  • Experience integrating models with APIs, data sources, or production systems
  • Curiosity about LLMs, RAG pipelines (Graph & Vector based), and agent frameworks
  • Understanding of cloud-native (AWS / GCP / Azure) and DevOps / DevSecOps practices
  • A collaborative mindset and willingness to share, learn, and teach
  • Understanding of prompt and context engineering and model evaluation
  • Solid grasp of distributed systems, microservices, and RESTful APIs
  • An understanding of LLMOps tools for managing GenAI workflows
Job Responsibility
Job Responsibility
  • Work to agile best practices and cross-functionally with multiple teams and stakeholders
  • Using technical skills to problem solve with our clients
  • Working on internal projects
  • Experimenting with Generative AI frameworks and tools such as LangChain, LlamaIndex, Hugging Face, and APIs from OpenAI and Anthropic
  • Building retrieval-augmented generation (RAG) prototypes with vector stores and knowledge graphs
  • Developing and testing agentic architectures through our own Genie Platform
  • Exploring LLMOps, evaluation tools, and model observability platforms like TruLens and LangSmith
  • Deploying solutions on modern cloud and DevOps environments (AWS, Azure, GCP)
What we offer
What we offer
  • Health and lifestyle perks accompanying private healthcare for you and your family
  • 25 days annual leave (plus a bonus half day on Christmas Eve) with the opportunity to buy 5 additional days
  • Generous company pension scheme
  • Opportunity to get involved with community and charity-based initiatives
  • Annual performance-based bonus
  • PA share ownership
  • Tax efficient benefits (cycle to work, give as you earn)
  • Budget to take courses (technical and non-technical training) and gain certifications
  • Fulltime
Read More
Arrow Right

Principal AI Engineer

As a Principal AI Engineer on the AI Foundations team, you are an established su...
Location
Location
Singapore , Singapore
Salary
Salary:
Not provided
mastercard.com Logo
Mastercard
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Engineering, Data Science, Applied Mathematics, or related technical field
  • advanced degree preferred
  • Strong foundation in software engineering, distributed systems, and applied machine learning relevant to production AI systems
  • Demonstrated understanding of responsible AI, model/system risk, privacy/security considerations, and governance requirements for enterprise deployments
  • Demonstrated, sustained ownership of production AI/ML systems, including design, build, deployment, and ongoing lifecycle operations
  • Real-world experience shipping complex agentic systems into production, including multi-agent coordination and multi-tool integration with safe action policies
  • Hands-on experience building production pipelines for evaluation, monitoring, versioning, and continuous improvement (including retraining or policy/guardrail updates)
  • Proven ability to define and operationalize observability and reliability practices for agentic systems (traceability, telemetry, SLOs, incident management)
  • Track record of influencing architecture and standards across multiple teams or programs, and mentoring engineers to raise overall engineering rigor
Job Responsibility
Job Responsibility
  • Serve as an established subject matter expert in AI Engineering, influencing stakeholders and shaping technical direction across multiple initiatives
  • Architect, design, develop, and maintain advanced AI/ML systems, with emphasis on complex agentic solutions (multi-agent orchestration, tool/function-calling, memory, reflection/self-correction, and autonomy policies)
  • Lead production implementation of agentic AI systems, including scalable training and evaluation pipelines, deployment frameworks, and runtime orchestration patterns
  • Define and implement safe tool-use patterns: structured outputs, robust error handling, permissioning and auditability, human-in-the-loop (HITL) approval steps for sensitive actions, and guardrail enforcement
  • Establish end-to-end AgentOps/LLMOps practices for agentic systems: release pipelines for prompts/tools/policies, canary strategies, safe rollback mechanisms, and continuous regression/safety evaluations as release gates
  • Build and optimize data ingestion, preprocessing, feature/embedding engineering, and retrieval/memory workflows to improve grounding quality and reduce failure modes
  • Own production observability for agentic systems: trace capture, cost/token telemetry, latency and reliability SLOs, and incident response practices for agent failures
  • Implement drift detection and performance decay monitoring (data drift, concept drift), and automate model/agent retraining, policy updates, and redeployment to maintain output quality over time
  • Drive measurable improvements in system effectiveness, safety, and efficiency by defining success metrics (task success, intervention rate, policy violations, cost and latency per task) and continuously improving evaluation coverage
  • Mentor and grow senior and junior engineers through design reviews, code reviews, hands-on coaching, and the creation of reusable patterns, playbooks, and standards for agentic delivery
  • Fulltime
Read More
Arrow Right

Senior Machine Learning Engineer

As our Senior ML Engineer, you will play a key role in delivering high impact, f...
Location
Location
Australia , South Bank
Salary
Salary:
Not provided
fctgcareers.com Logo
Flight Centre Brand
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years delivering production grade ML or AI systems with commercial impact
  • Strong hands on experience across AI/ML lifecycle, infrastructure and scalable AI architecture
  • Experience building AI agents, RAG systems or LLM powered applications in production
  • Familiarity with agent endpoints, gateways and tool orchestration patterns
  • Exposure to MCP servers and emerging agent protocols
  • Experience implementing evaluation frameworks and safety mechanisms for LLM systems
  • Deep knowledge of Python and modern AI/ML frameworks
  • Experience working with Databricks
  • Strong understanding of Kubernetes and distributed production systems
Job Responsibility
Job Responsibility
  • Design, develop and productionise ML models, LLM powered systems and agent based applications
  • Implement end to end MLOps pipelines including CI CD, model registry, monitoring, drift detection and retraining for predictive ML systems
  • Build LLMOps workflows including context engineering, automated evaluation, safeguards and guardrails
  • Develop AI agent workflows, endpoints, gateways and orchestration layers enabling secure tool usage and structured reasoning
  • Contribute to MCP server implementation and modern agent communication protocols
  • Implement strong observability across ML and GenAI systems including reliability, latency, evaluation metrics, usage and cost tracking
  • Build scalable ML infrastructure, and AI/ML optimised workflows
  • Deploy and optimise workloads on Kubernetes and cloud native environments
  • Collaborate with product and business stakeholders to deliver high impact AI solutions
What we offer
What we offer
  • Individualised, ongoing Learning & Development via communities of practice
  • Innovation Days
  • Dedicated Engineering Days
  • Access to 'LinkedIn Learning' for ongoing skills development
  • Women in PM&E group
  • Exclusive Staff Discounts
  • Travel Discounts
  • Career opportunities in a network of brands and businesses across the globe
  • Corporate Health Discounts
  • Mental Health Support and Employee Assistance Program for staff and family
  • Fulltime
Read More
Arrow Right