CrawlJobs Logo

Senior Machine Learning Operations Engineer

Australia, Melbourne · Job Posted January 11, 2026
Apply Position
Job Link Share

Job Description

Passionate about building and deploying Machine Learning pipelines at scale to drive business value? Join our growing Data Science Team as our first Senior Machine Learning Operations (MLOps) Engineer! As a Senior MLOps Engineer, you will work within our collaborative Data Science team to help deliver and accelerate multiple machine learning projects across our organisation. In your role with us, you will enhance our machine learning operations (MLOps), delivering: robust, scalable AWS cloud infrastructure and automation solutions, that empower our data science team. You will get the opportunity to work with petabyte-scale data across our global platforms, directly impacting millions of users.

Job Responsibility

  • Lead the design, implementation, and maintenance of end-to-end ML infrastructure and automation solutions, from: development, to deployment and production monitoring
  • Drive cloud infrastructure and architectural decisions supporting large-scale ML workloads, leveraging Infrastructure as Code (IaC), particularly using Terraform
  • Implement and maintain CI/CD pipelines, ensuring efficient model integration, deployment, and continuous delivery
  • Build and optimise monitoring, alerting and logging to ensure model reliability, performance and compliance
  • Collaborate closely with data scientists and stakeholders to identify infrastructure needs, streamline workflows, and effectively communicate complex technical concepts
  • Provide mentorship and technical guidance to junior MLOps engineers and data scientists to promote best practices in ML infrastructure

Requirements

  • 5+ years of experience in MLOps, DevOps, Data Engineering and/or cloud infrastructure roles, preferably supporting data science or Machine Learning teams
  • Bachelor’s degree in Computer Science, Engineering, or a related technical field
  • Expert proficiency in cloud infrastructure management using Terraform
  • Deep hands-on experience with major cloud platforms (AWS, Azure or GCP)
  • Strong experience in building and maintaining CI/CD pipelines specifically for ML workloads
  • Proficiency with containerisation technologies (Docker, Kubernetes)
  • Advanced proficiency in Python and scripting for infrastructure automation

Nice to have

  • Experience within iGaming
  • Experience working with large volumes of data, preferably at petabyte-scale
  • Extensive experience with distributed computing and big data technologies (e.g. Spark, Hadoop)
  • Familiarity with monitoring and observability platforms
  • Knowledge of data security, governance, and compliance practices relevant to ML operations

What we offer

  • In-house baristas serving free coffee, tea, fresh juices, and smoothies
  • Daily catered breakfast and regular company-wide events
  • Snack walls and drink fridges on every floor
  • Fun /modern office spaces with pool tables, table tennis, gaming consoles, and an F1 simulator
  • Access to our Employee Assistance Program for you and your loved ones
  • 9,000+ courses on our Learning & Development platform
  • One paid volunteer day per year
  • Weekly Wednesday massages by professional masseuses
  • Team budgets for lunches and activities to celebrate achievements
  • Social sports teams and participation in Corporate Games
  • Easygo branded swag
  • Birthday and work anniversary gift vouchers, plus a chance to win prizes
  • Company-wide talks with key partners such as Everton FC and Team Sauber in Formula 1
  • Office visits from big-name streamers
  • Meet Ambassadors like Alex Pereira, Israel Adesanya
  • Ballots for exclusive tickets to events like Formula 1, UFC, and more sporting and music events

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Senior Machine Learning Operations Engineer

8 matching positions

New

Senior Machine Learning Engineer, AI Platform

The AI Platform team is responsible for building the foundational infrastructure...
Location
Location
United States; Canada
Salary
Salary:
139000.00 - 218000.00 USD / Year
mozilla.org Logo
Mozilla
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree with 4–6 years of relevant industry experience, or Master’s degree with significant hands-on experience building and operating production ML systems, or work experience equivalent
  • Strong experience developing in Python for machine learning systems, backend services, or distributed data processing
  • Proven experience deploying and operating ML workloads in cloud environments, including production-grade infrastructure
  • Solid understanding of model serving architectures, inference pipelines, and performance tradeoffs (latency, throughput, cost, scaling strategies)
  • Hands-on experience working with GPU-based workloads and accelerated computing in production settings
  • Experience designing CI/CD pipelines and development workflows that support reliable ML system deployment
  • Ability to independently scope and drive technical initiatives while balancing product and operational priorities
  • Strong problem-solving skills and the ability to debug performance and reliability issues in distributed systems
  • Clear and effective communication skills, with experience collaborating across engineering, product, and infrastructure teams
Job Responsibility
Job Responsibility
  • Design, build, and operate core AI platform components used to train, deploy, and serve machine learning models in production environments
  • Own model serving and inference workflows end-to-end, driving improvements in reliability, scalability, performance, and operational excellence
  • Lead efforts to optimize inference systems for throughput, latency, and cost efficiency across CPU and GPU workloads
  • Design and manage GPU-based inference and training workloads, including performance tuning, capacity planning, and resource utilization optimization
  • Own and improve critical parts of the model lifecycle, including packaging, versioning, testing strategies, validation, and deployment automation
  • Implement and evolve observability practices (metrics, logging, tracing, alerting) to improve visibility and operational resilience of ML services and pipelines
  • Partner closely with product, infrastructure, security, and data teams to design scalable platform capabilities that enable AI-powered features
  • Contribute to technical design discussions, propose architectural improvements, and mentor junior engineers through code reviews and knowledge sharing
  • Participate in and help improve operational processes, including incident response, on-call rotations, and post-incident reviews
What we offer
What we offer
  • Generous performance-based bonus plans
  • Rich medical, dental, and vision coverage
  • Generous retirement contributions with 100% immediate vesting
  • Quarterly all-company wellness days
  • Country specific holidays plus a day off for your birthday
  • One-time home office stipend
  • Annual professional development budget
  • Quarterly well-being stipend
  • Considerable paid parental leave
  • Employee referral bonus program
  • Fulltime
Read More
Arrow Right

Senior Machine Learning Engineer - ML Training Infrastructure

We are seeking an experienced, technical oriented, impact delivering-driven expe...
Location
Location
United States , Mountain View
Salary
Salary:
170000.00 - 240000.00 USD / Year
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelors degree or higher in Computer Science or equivalent major OR equivalent relevant experience
  • 3+ years professional software engineering experience
  • 2+ years specialized experience in AI/ML infrastructure, e.g., enabling distributed training for scaling large ML models
  • Strong programming skills in Python, with proficiency in frameworks such as, PyTorch (preferred), TensorFlow, or similar
  • Experience with distributed computing, GPU computing, and cloud environments (AWS, GCP, Azure)
  • Willingness to travel to Sunnyvale, CA as needed
  • Comfortable working in highly ambiguous and dynamic environments
Job Responsibility
Job Responsibility
  • Design and development of scalable, reliable, high-performance ML framework to support model training at scale
  • Model training performance analysis and optimization solutions to scale distributed training workflows and maximize resource utilization across heterogeneous hardware environments, and save cost
  • Raise the bar on system observability, debuggability, and operational excellence, and user experience
  • Collaborate with cross-functional teams to integrate new features and technologies into the platform
What we offer
What we offer
  • medical
  • dental
  • vision
  • Health Savings Account
  • Flexible Spending Accounts
  • retirement savings plan
  • sickness and accident benefits
  • life insurance
  • paid vacation & holidays
  • tuition assistance programs
  • Fulltime
Read More
Arrow Right

Senior Machine Learning Engineer

Location
Location
United States , Raceland
Salary
Salary:
Not provided
bollingershipyards.com Logo
BOLLINGER MISSISSIPPI SHIPBUILDING LLC
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Information Systems, Engineering, Data Management, or related field
  • Minimum of 6–10 years of experience ML or software engineering
  • Strong Python and ML deployment experience
  • Experience with cloud ML systems
Job Responsibility
Job Responsibility
  • Deploy, integrate, and maintain machine learning and AI solutions within enterprise workflows and operational systems
  • Design and develop scalable ML pipelines, feature stores, APIs, and model-serving infrastructure
  • Collaborate with Data Scientists to productionize models and improve deployment readiness
  • Monitor model performance, drift, availability, and reliability across production environments
  • Implement processes for model retraining, versioning, governance, and lifecycle management
  • Partner with Data Engineering teams to support feature engineering and data pipeline integration
  • Ensure ML solutions are secure, scalable, maintainable, and aligned with enterprise architecture standards
  • Support AI applications across forecasting, operational optimization, bidding, scheduling, maintenance, and automation use cases
  • Troubleshoot and resolve issues related to model deployment and operational performance
  • Contribute to ML engineering standards, best practices, and platform improvements
What we offer
What we offer
  • Competitive Pay
  • Comprehensive Benefits Package
  • Hybrid Schedule Available
  • Career Development
  • Cutting-Edge Projects
  • Positive Work Environment & Company Values
  • Fulltime
Read More
Arrow Right

Senior Machine Learning Engineer, ML Training Platform

Location
Location
United States
Salary
Salary:
216700.00 - 303400.00 USD / Year
Reddit
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of software engineering experience, with a focus on Platform Engineering, ML Infrastructure, or Backend Systems
  • Deep Kubernetes Expertise: You know K8s beyond just 'deploying pods.' You understand CRDs, Controllers and the Operator pattern
  • Jupyter Ecosystem Knowledge: Experience customizing JupyterHub, JupyterLab extensions, or building similar interactive computing platforms
  • Strong Coding Skills: Proficiency in Python (for the ML ecosystem) and Go (for Kubernetes controllers/infrastructure tooling)
  • GPU Experience: Hands-on practice with CUDA environments, GPU virtualization/containerization, and doing it all within Kubernetes
  • Cloud Provider Experience: Familiarity with both managed ML offerings (Vertex AI, Sagemaker, etc) and building custom ML components in AWS and/or GCP
  • Experience working with distributed training frameworks, including Ray and Kubernetes
  • Comfortable with distributed systems, big data (Petabyte scale) and data-intensive systems
  • Strong focus on scalability, reliability, performance, and ease of use. You are an undying advocate for platform users and have a deep intuition for the machine learning development lifecycle
  • Strong organizational & communication skills
Job Responsibility
Job Responsibility
  • Lead the building, testing, and maintenance of ML training infrastructure at Reddit
  • Play a pivotal role in designing, building, and optimizing the infrastructure and tooling required to support large-scale machine learning workflows
  • Evolve the MLE experience, from provisioning interactive GPU environments through large-scale training, supporting on-demand and self-service workflows
  • Kubernetes Automation: Write custom Kubernetes Controllers and Operators to manage the lifecycle of interactive Jupyter workspaces and long-running ML training jobs, handle auto-idling, and ensure fault tolerance
  • GPU Orchestration: Work with the underlying compute team to ensure MLEs have efficient access to training hardware resources and handle resource contention gracefully
  • Developer Experience (DevX): Treat internal MLEs as your customers. Conduct user research, reduce friction in the 'Idea-to-Prototype' loop, and standardize software environments (Docker images, Python dependency management)
What we offer
What we offer
  • Comprehensive Healthcare Benefits and Income Replacement Programs
  • 401k Match
  • Family Planning Support
  • Gender-Affirming Care
  • Mental Health & Coaching Benefits
  • Flexible Vacation & Reddit Global Days off
  • Generous paid Parental Leave
  • Paid Volunteer time off
  • Fulltime
Read More
Arrow Right

Senior Machine Learning Engineer

Reddit is a community of communities. It’s built on shared interests, passion, a...
Location
Location
United States , Remote
Salary
Salary:
216700.00 - 303400.00 USD / Year
Reddit
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3-5+ years of experience building, deploying, and operating machine learning systems in production
  • Strong programming skills in Python, Java, Go, or similar languages, with solid software engineering fundamentals
  • ML Fundamentals: a strong grasp of algorithms, from classic statistical learning (XGBoost, Random Forests, regressions) to DL architectures (Transformers, CNNs, GNNs)
  • Hands-on experience with modern ML frameworks (e.g., PyTorch, TensorFlow)
  • Experience designing scalable ML pipelines, data processing systems, and model serving infrastructure
  • Ability to work cross-functionally and translate ambiguous product or business problems into technical solutions
  • Experience improving measurable metrics through applied machine learning
Job Responsibility
Job Responsibility
  • Design, build, and deploy production-grade machine learning models and systems at scale
  • Own the full ML lifecycle: from problem definition and feature engineering to training, evaluation, deployment, and monitoring
  • Build scalable data and model pipelines with strong reliability, observability, and automated retraining
  • Work with large-scale datasets to improve ranking, recommendations, search relevance, prediction, content/user understanding, and optimization systems
  • Partner cross-functionally with Product, Data Science, Infrastructure, and Engineering teams to translate complex problems into ML solutions
  • Improve system performance across latency, throughput, and model quality metrics
  • Research and apply state-of-the-art machine learning and AI techniques, including deep learning, graph & transformers based, and LLM evaluation/alignment
  • Contribute to technical strategy, architecture, and long-term ML roadmap
What we offer
What we offer
  • Comprehensive Healthcare Benefits and Income Replacement Programs
  • 401k with Employer Match
  • Global Benefit programs that fit your lifestyle, from workspace to professional development to caregiving support
  • Family Planning Support
  • Gender-Affirming Care
  • Mental Health & Coaching Benefits
  • Flexible Vacation & Paid Volunteer Time Off
  • Generous Paid Parental Leave
  • Medical, dental, and vision insurance
  • Fulltime
Read More
Arrow Right

Senior Machine Learning Engineer

Reddit is a community of communities. It’s built on shared interests, passion, a...
Location
Location
Canada , Ontario
Salary
Salary:
Not provided
Reddit
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3-5+ years of experience building, deploying, and operating machine learning systems in production
  • Strong programming skills in Python, Java, Go, or similar languages, with solid software engineering fundamentals
  • ML Fundamentals: a strong grasp of algorithms, from classic statistical learning (XGBoost, Random Forests, regressions) to DL architectures (Transformers, CNNs, GNNs)
  • Hands-on experience with modern ML frameworks (e.g., PyTorch, TensorFlow)
  • Experience designing scalable ML pipelines, data processing systems, and model serving infrastructure
  • Ability to work cross-functionally and translate ambiguous product or business problems into technical solutions
  • Experience improving measurable metrics through applied machine learning
Job Responsibility
Job Responsibility
  • Design and build production ML systems that power core experiences across the platform, including: Personalized recommendations, search, and ranking systems
  • Intelligent advertising systems including ranking, bidding, measurement, and optimization
  • Content, Advertisers, and User understanding
  • Large-scale machine learning pipelines, model serving infrastructure, and real-time decision systems
  • Applied AI and LLM-driven experiences
  • Design, build, and deploy production-grade machine learning models and systems at scale
  • Own the full ML lifecycle
  • Build scalable data and model pipelines
  • Work with large-scale datasets to improve ranking, recommendations, search relevance, prediction, content/user understanding, and optimization systems
  • Partner cross-functionally with Product, Data Science, Infrastructure, and Engineering teams
What we offer
What we offer
  • Comprehensive Healthcare Benefits and Income Replacement Programs
  • 401k with Employer Match
  • Global Benefit programs that fit your lifestyle, from workspace to professional development to caregiving support
  • Family Planning Support
  • Gender-Affirming Care
  • Mental Health & Coaching Benefits
  • Flexible Vacation & Paid Volunteer Time Off
  • Generous Paid Parental Leave
  • Fulltime
Read More
Arrow Right

Senior Machine Learning Engineer

We’re hiring a Senior Machine Learning Engineer to help build the core AI system...
Location
Location
United States , San Francisco
Salary
Salary:
230000.00 - 300000.00 USD / Year
signifytechnology.com Logo
Signify Technology
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong experience building production AI systems around LLMs, OCR, and unstructured data workflows
  • Proven track record shipping applied AI products, not just prototyping models offline
  • Deep familiarity with modern LLM workflows including prompting, structured outputs, tool use, retries, fallbacks, guardrails, and model evaluation
  • Experience with document intelligence systems such as OCR pipelines, document extraction, classification, post-processing, and confidence-based review flows
  • Experience with voice or conversational AI, or adjacent systems involving transcripts, call automation, and conversational extraction
  • Strong proficiency in Python and comfort working in production codebases with APIs, queues, and backend services
  • Experience deploying and operating AI systems in AWS or similar cloud environments, including serverless or event-driven architectures
  • Strong instincts around evaluation, benchmarking, monitoring, and quality assurance for real-world AI systems
  • Ability to work across structured and unstructured data and design systems that are robust to noisy, incomplete, and ambiguous inputs
Job Responsibility
Job Responsibility
  • Build the core AI systems behind a next-generation healthcare platform that turns smartphone video into clinically accurate 3D models of human anatomy
  • Own the pipeline that bridges raw computer vision data and physical 3D-printed medical solutions, transforming noisy real-world scans into precise, CAD-compatible models used to improve patient outcomes
  • Work closely with engineers, researchers, and product leaders to design systems that translate cutting-edge ML research into reliable production technology used in healthcare
What we offer
What we offer
  • Equity
  • Fulltime
Read More
Arrow Right

Senior Machine Learning Engineer, AI Personalization

Block builds simple, powerful tools that make progress towards an economy that's...
Location
Location
United States , Bay Area, CA
Salary
Salary:
194500.00 - 343100.00 USD / Year
block.xyz Logo
Block
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years building and operating quality software
  • Domain expertise in recommender systems, ranking systems, or similar with 5+ years of experience
  • Led the development of complex models trained on large datasets powering customer facing features
  • Strong software engineering skills
  • Strong communication skills and customer empathy
  • Experience with PyTorch, PySpark, Databricks and AWS is a plus
Job Responsibility
Job Responsibility
  • Lead strategic initiatives in AI Personalization, driving the vision, architecture, and execution
  • Develop and deploy new AI/ML models that power search & recommendations in traditional UX as well as new agentic interfaces
  • Deploy to production at scale to personalize every user's experience
  • Be a technical leader and establish quality practices that stick, make broader design decisions and set an example for others to follow
  • Collaborate with a cross functional team of designers, business partners, and software engineers to build new technologies and features
  • Design experiments, test them on production users, analyze and repeat
What we offer
What we offer
  • Remote work
  • medical insurance
  • flexible time off
  • retirement savings plans
  • modern family planning
  • Fulltime
Read More
Arrow Right