CrawlJobs Logo

ML Ops Engineer

nstarxinc.com Logo

NStarX

Location Icon

Location:
India , Hyderabad

Category Icon
Category:
IT - Software Development

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

As an MLOps Engineer, you will be responsible for building, maintaining, and optimizing machine learning (ML) operations infrastructure to enable smooth deployment and scaling of ML models. You will collaborate with data scientists, software engineers, and IT teams to streamline model workflows, enhance automation, and ensure high availability of ML solutions.

Job Responsibility:

  • Develop and manage infrastructure for end-to-end ML workflows including model training, deployment, monitoring, and maintenance
  • Implement CI/CD pipelines for ML models and data workflows
  • Collaborate with cross-functional teams to build scalable and robust ML infrastructure on cloud and on-premises environments
  • Monitor and optimize model performance and infrastructure to ensure efficient resource usage
  • Manage data versioning and model versioning across multiple environments
  • Implement security, governance, and compliance protocols in ML deployment and data pipelines
  • Support troubleshooting, debugging, and incident management for ML infrastructure issues

Requirements:

  • 4 to 10 years of experience in MLOps, DevOps, or ML Engineering
  • Strong proficiency with cloud platforms such as AWS, Azure, or GCP
  • Experience with containerization and orchestration tools like Docker and Kubernetes
  • Hands-on experience with ML model deployment, monitoring, and scaling
  • Proficiency with CI/CD tools such as Jenkins or GitLab CI
  • Familiarity with data versioning and management tools such as DVC
  • Strong coding skills in Python with knowledge of ML libraries like TensorFlow or PyTorch
  • Strong problem-solving skills and ability to work in a collaborative environment
  • Effective communication skills for cross-functional teamwork

Nice to have:

  • Experience with MLOps frameworks such as MLflow or Kubeflow
  • Knowledge of data engineering best practices and data pipeline tools like Apache Airflow or Kafka
  • Certification in any cloud platform such as AWS, Azure, or GCP
What we offer:
  • Competitive compensation
  • Opportunity to work with a dynamic team on cutting-edge AI and ML solutions
  • Professional growth and development opportunities

Additional Information:

Job Posted:
December 26, 2025

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for ML Ops Engineer

Senior Software Engineer - ML Infrastructure

We build simple yet innovative consumer products and developer APIs that shape h...
Location
Location
United States , San Francisco
Salary
Salary:
180000.00 - 270000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of industry experience as a software engineer, with strong focus on ML/AI infrastructure or large-scale distributed systems
  • Hands-on expertise in building and operating ML platforms (e.g., feature stores, data pipelines, training/inference frameworks)
  • Proven experience delivering reliable and scalable infrastructure in production
  • Solid understanding of ML Ops concepts and tooling, as well as best practices for observability, security, and reliability
  • Strong communication skills and ability to collaborate across teams
Job Responsibility
Job Responsibility
  • Design and implement large-scale ML infrastructure, including feature stores, pipelines, deployment tooling, and inference systems
  • Drive the rollout of Plaid’s next-generation feature store to improve reliability and velocity of model development
  • Help define and evangelize an ML Ops “golden path” for secure, scalable model training, deployment, and monitoring
  • Ensure operational excellence of ML pipelines and services, including reliability, scalability, performance, and cost efficiency
  • Collaborate with ML product teams to understand requirements and deliver solutions that accelerate experimentation and iteration
  • Contribute to technical strategy and architecture discussions within the team
  • Mentor and support other engineers through code reviews, design discussions, and technical guidance
What we offer
What we offer
  • medical, dental, vision, and 401(k)
  • Fulltime
Read More
Arrow Right

Senior Software Engineer – ML Model Compliance & Automation

We are seeking a highly skilled and motivated Senior Software Engineer to lead t...
Location
Location
India , Jaipur
Salary
Salary:
Not provided
infoobjects.com Logo
InfoObjects
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience Required: 3 - 7 yrs
  • GoLang (preferred)
  • Python (preferred)
  • Bash
  • MLOps Tools: KitOps, MLModelCI, MLflow, ONNX, TensorFlow, PyTorch, Docker
  • SBOM & Security: Syft, Grype, Trivy, CycloneDX, SPDX
  • CI/CD: GitHub Actions, GitLab CI, Jenkins, ArgoCD
  • Infra: Kubernetes, Docker, Helm, Terraform
  • Cloud: AWS, GCP, Azure (EKS/GKE/ECS preferred)
  • Version Control: Git, GitOps
Job Responsibility
Job Responsibility
  • Model Packaging & Artifact Management: Design and implement workflows for packaging ML models using KitOps, ONNX, MLflow, or TensorFlow SavedModel
  • Manage model artifact versioning, registries, and reproducibility
  • Ensure artifact integrity, consistency, and traceability across CI/CD pipelines
  • Model Profiling & Optimization: Automate model profiling (latency, size, ops) using MLModelCI, TorchServe, or ONNX Runtime
  • Apply quantization, pruning, and format conversions (e.g., FP32→INT8) for optimization
  • Embed profiling and optimization checks into CI/CD pipelines to assess deployment readiness
  • Compliance & SBOM Generation: Develop pipelines to generate and validate SBOMs for ML models
  • Implement compliance checks for licensing, vulnerabilities, and security using CycloneDX, SPDX, Syft, or Trivy
  • Validate schema, dependencies, and runtime environments for production readiness
  • Cloud Integration & Deployment: Automate model registration, endpoint creation, and monitoring setup in AWS/GCP/Azure
  • Fulltime
Read More
Arrow Right

Machine Learning Ops Engineer

The Customer AI & Rapid Prototyping department stands at the forefront of digita...
Location
Location
Portugal , Oporto; Lisbon; Funchal; Ponta delgada
Salary
Salary:
Not provided
https://www.tui.com Logo
TUI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience in productionising and using various AI models and algorithms
  • Experience in deploying AI solutions using CI/CD pipelines, API development and containers
  • Strong programming skills in Python
  • Understanding of machine learning/AI frameworks and libraries
  • Hands-on experience with cloud technologies and services (e.g., AWS, Azure, Google Cloud)
  • Experience with monitoring and log collection systems (e.g. DataDog)
  • Some experience with Generative AI technologies (e.g. Bedrock, Langchain, LangGraph)
  • Customer-focused engineer with a passion for crafting high-quality digital products, continuous improvement, and effective team collaboration
  • Strong problem-solving and communication skills, with an understanding of the social, legal, and ethical impact of AI technologies
Job Responsibility
Job Responsibility
  • Develop, implement, and maintain machine learning models and algorithms
  • Work closely with cross-functional teams to integrate ML solutions into production systems
  • Monitor and optimize the performance of deployed AI models
  • Collaborate with engineering colleagues on AI-related tasks to deliver impactful, data-driven solutions
  • Research, evaluate, and test new approaches, processes, and tools
What we offer
What we offer
  • Attractive remuneration
  • bonus opportunity
  • exclusive travel perks & discounts
  • extensive health & wellbeing support
  • Flexible working
  • hybrid or remote working models
  • Opportunities to upskill, reskill and grow your career
  • Access the TUI Tech Learning Hub
  • Participate in our tech communities and collaborate on global projects and teams
  • Get involved with incredible local charity and sustainability initiatives like the TUI Care Foundation and the Sustainable Tech Community
  • Fulltime
Read More
Arrow Right
New

Site Reliability Engineer SRE – ML platform

Location
Location
United States , Sunnyvale
Salary
Salary:
Not provided
thirdeyedata.ai Logo
Thirdeye Data
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of experience in ML Ops with strong knowledge in Kubernetes, Python, MongoDB and AWS
  • Good understanding of Apache SOLR
  • Proficient with Linux administration
  • Knowledge of ML models and LLM
  • Ability to understand tools used by data scientists and experience with software development and test automation
  • Ability to design and implement cloud solutions and ability to build MLOps pipelines on cloud solutions (AWS)
  • Experience working with cloud computing and database systems
  • Experience building custom integrations between cloud-based systems using APIs
  • Experience developing and maintaining ML systems built with open-source tools
  • Experience with MLOps Frameworks like Kubeflow, MLFlow, DataRobot, Airflow etc., experience with Docker and Kubernetes
Job Responsibility
Job Responsibility
  • Continuous Deployment using GitHub Actions, Flux, Kustomize
  • Design and implement cloud solutions, build MLOps on AWS cloud
  • Data science model containerization, deployment using Docker, VLLM, Kubernetes
  • Communicate with a team of data scientists, data engineers, and architects, and document the processes
  • Develop and deploy scalable tools and services for our clients to handle machine learning training and inference
  • Knowledge of ML models and LLM
  • Fulltime
Read More
Arrow Right

Data Infrastructure Engineer

A venture-backed startup at the intersection of AI and national security is buil...
Location
Location
United States , New York City Metropolitan Area
Salary
Salary:
Not provided
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong engineering experience in Python, Go, or C
  • Experience building and scaling production data systems
  • Hands-on expertise with model deployment and ML Ops practices
  • Knowledge of database design, performance tuning, and operations
  • Someone who thrives in early-stage, fast-paced environments and enjoys tackling complex challenges
Job Responsibility
Job Responsibility
  • Build and maintain the data pipelines and infrastructure that power ML applications
  • Deploy and manage models at scale, from training through production
  • Design APIs and services that integrate smoothly into mission-critical workflows
  • Ensure data is handled and secured properly across large, distributed environments
  • Collaborate closely with a small, fast-moving team to solve hard technical problems in real-world settings
What we offer
What we offer
  • Significant equity
  • Strong health & wellness benefits
  • Fulltime
Read More
Arrow Right

Staff Machine Learning Engineer

Machine Learning Engineers at Rocket Money further our mission by building produ...
Location
Location
United States , San Francisco; Washington, D.C.; New York City; Silver Spring; Miami; Denver
Salary
Salary:
210000.00 - 260000.00 USD / Year
truebill.com Logo
Truebill
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of professional experience in machine learning engineering or data science roles
  • Proven track record of designing and implementing ML systems at consumer tech scale and speed
  • Extensive hands-on experience integrating ML and AI methods into production workflows, including creating evaluation tooling and effective user feedback mechanisms
  • Experience with prompt engineering and management, creating robust systems for testing and optimizing LLM-based applications
  • Expert-level proficiency in Python, SQL, and at least a handful of common ML frameworks
  • Understanding of ML methods at a fundamental level
  • Master at taking ambiguous problems, creating clarity, and breaking down work into manageable chunks for implementation
  • Owned the development, launch, and maintenance for several scaled ML/AI powered product experiences
  • Understand basic software engineering and computer science fundamentals and have applied them at consumer grade scale to build ML powered products in production environments
  • Technical leader who can identify both emergent technical opportunities and gaps relative to best practice
Job Responsibility
Job Responsibility
  • Lead the architecture and development of complex AI and ML powered features across Rocket Money's product suite
  • Design, implement, and maintain robust evaluation frameworks
  • Develop novel new product experiences
  • Own end to end development and implementation of ML and AI product features in collaboration with cross-functional product development teams
  • Provide technical mentorship
What we offer
What we offer
  • Health, Dental & Vision Plans
  • Competitive Pay
  • 401k Matching
  • Unlimited PTO
  • Lunch daily (in-office only)
  • Snacks & Coffee (in-office only)
  • Commuter benefits (in-office only)
  • Bonus
  • Fulltime
Read More
Arrow Right
New

Senior Machine Learning Engineer

about job Create and maintain code, documentation, testing and deployment framew...
Location
Location
Singapore , Singapore
Salary
Salary:
8000.00 - 10500.00 SGD / Month
https://www.randstad.com Logo
Randstad
Expiration Date
January 01, 2026
Flip Icon
Requirements
Requirements
  • Bachelors in Computer Science or equivalent experience
  • Min 3 years in machine learning and operations, ideally in a hands-on MLOps role
  • Experience deploying APIs and packages
  • Experience with ML ops platforms (MLFlow, W&B
  • Kubernetes, Docker, dask, rapids.ai, Ray)
  • Experience with ML frameworks (Tensorflow, keras, Pytorch, xgboost, sklearn)
  • Strong Python coding skills – experience with unit testing, code reviews, version control
  • Or could be DevOps Engineer with ML experience. OR Bioinformatics Engineer with AI/ML ops related experience OR ML Engineer with DevOps experience
Job Responsibility
Job Responsibility
  • Create and maintain code, documentation, testing and deployment frameworks, tools and infrastructure, working closely with engineers and domain experts on AI/ML models and pipelines
  • Machine learning model training pipeline, how to manage the data, how to get data into production
  • Develop environments for building, testing, tracking production AI models and data across data pipelines used in primary and secondary genomic analysis
  • Be a technical expert to develop AI models within a consistent ML environment, automate training and data/model management
What we offer
What we offer
  • Bonus + Allowance
  • Hybrid working arrangement - 2 days WFH
  • Can end work early and have more personal time
  • shuttle bus provided from multiple MRT stations
!
Read More
Arrow Right

Engineering Manager - Machine Learning Infrastructure

We build simple yet innovative consumer products and developer APIs that shape h...
Location
Location
United States , San Francisco
Salary
Salary:
241200.00 - 400000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8–10 years of experience in ML infrastructure, including direct hands-on expertise as an engineer, IC/TL
  • 2+ years of experience managing infrastructure or ML platform engineers
  • Proven experience delivering and operating ML or AI infrastructure at scale
  • Solid technical depth across ML/AI infrastructure domains (e.g., feature stores, pipelines, deployment, inference, observability)
  • Demonstrated ability to drive execution on complex technical projects with cross-team stakeholders
  • Strong communication and stakeholder management skills
Job Responsibility
Job Responsibility
  • Lead and support the ML Infra team, driving project execution and ensuring delivery on key commitments
  • Build and launch Plaid’s next-generation feature store to improve reliability and velocity of model development
  • Define and drive adoption of an ML Ops “golden path” for secure, scalable model training, deployment, and monitoring
  • Ensure operational excellence of ML pipelines, deployment tooling, and inference systems
  • Partner with ML product teams to understand requirements and deliver solutions that accelerate model development and iteration
  • Recruit, mentor, and develop engineers, fostering a collaborative and high-performing team culture
What we offer
What we offer
  • medical
  • dental
  • vision
  • 401(k)
  • equity
  • commission
  • Fulltime
Read More
Arrow Right
Welcome to CrawlJobs.com
Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.