CrawlJobs Logo

Cloud Machine Learning Engineer

India, Pune · Job Posted April 24, 2026
Apply Position
Job Link Share

Job Description

We are seeking a Cloud ML Engineer (AWS) within Data & Analytics GSL to use ML/AI expertise alongside strong programming and software engineering skills to make machine learning models and analyses easier to use and access. The role supports local markets and group functions to obtain business value from machine learning, and focuses on designing ML systems, productionising prototypes, enabling robust data flows, and evolving Big Data capabilities through reusable assets and patterns.

Job Responsibility

  • Design and develop machine learning systems and implementation patterns
  • Automate predictive model software, including model training
  • Productionise data science prototypes and develop machine learning applications aligned to data science requirements
  • Facilitate the flow of data between ML/AI models and the organisation's data systems
  • Enhance data pipelines to ensure data is clean, accurate, and optimised for machine learning models
  • Partner with the architecture team to evolve Big Data platform capabilities (reusable assets/patterns) and components to meet business requirements and objectives
  • Research, investigate, and evaluate new technologies and methods to improve delivery and sustainability of machine learning applications and services
  • Contribute to defining best practice for agile development of applications running on the Big Data platform

Requirements

  • Experience managing the development lifecycle for agile software development projects (Kanban or Scrum exposure)
  • Robust data modelling and data architecture skills
  • Knowledge of Big Data frameworks such as Hadoop, Spark, Hive, Yarn, and Airflow
  • Experience with distributed ML frameworks (for example H2O and/or TensorFlow) and various ML libraries
  • Proven experience creating and deploying end-to-end ML pipelines in production, including MLOps
  • Programming experience in Java and Python
  • Experience with containerisation (Docker/Kubernetes) or cloud alternatives is advantageous
  • Mandatory working experience with AWS services including: SageMaker Pipelines, SageMaker Studio, CloudFormation, CloudTrail, SNS, EventBridge, CodePipeline, CodeBuild, CodeCommit
  • Experience with other distributed technologies, NoSQL databases, and streaming technologies is desirable
  • Strong written and verbal communication, with excellent interpersonal and collaboration skills
  • 3-year IT or IS degree or diploma (or related field) is essential
  • an advanced degree in Computer Science/Math/Statistics (or related discipline) is an advantage
  • Relevant cloud certification at professional or associate level
  • 5+ years of relevant experience as an AI/ML Engineer
  • 5+ years of BI or related software development experience

Nice to have

Experience with other distributed technologies, NoSQL databases, and streaming technologies

What we offer

  • Opportunities to deliver business value by enabling scalable ML capabilities for local markets and group functions
  • The chance to work on production-grade ML systems, end-to-end pipelines, and MLOps practices on AWS
  • Collaboration with architecture teams to shape reusable Big Data platform assets and engineering patterns
  • Scope to explore and evaluate new technologies and methods to improve sustainability and delivery of ML applications and services
  • Strengthening best practices for productionising data science prototypes into reliable ML applications
  • Deepening hands-on expertise in AWS-native MLOps and pipeline automation (including SageMaker and AWS developer tools)
  • Enhancing Big Data platform design skills through reusable patterns, components, and data pipeline optimisation
  • Improving cost and resource efficiency approaches across compute, network usage, and platform objectives

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Cloud Machine Learning Engineer

8 matching positions

Sr SOC Verification Engineer, Cloud-Scale Machine Learning Acceleration

Our Machine Learning Acceleration (MLA) team develops the Inferentia and Trainiu...
Location
Location
United States , Cupertino; Austin
Salary
Salary:
159200.00 - 247600.00 USD / Year
amazon.de Logo
Amazon Pforzheim GmbH
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree or above in Computer Science, Computer Engineering, Electrical Engineering, or related fields
  • 8+ years of design verification experience using System Verilog and UVM
  • 8+ YOE in testbench development including: stimulus, checkers, assertions and coverage
Job Responsibility
Job Responsibility
  • Verify custom chip designs at the SOC level
  • Integrate 3rd party IPs and VIPs into the SOC testbench
  • Create comprehensive testplans, write robust random testcases, and execute coverage plans
  • Maintain autosmoke and regression infrastructure
  • Dive deep into bugs and triages
  • Mentor junior engineers
What we offer
What we offer
  • health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
  • 401(k) matching
  • paid time off
  • parental leave
  • sign-on payments
  • restricted stock units (RSUs)
Read More
Arrow Right
New

Staff Artificial Intelligence Machine Learning Engineer

The Role: General Motors is seeking a Staff AI/ML Engineer for the Vehicle Mech...
Location
Location
United States , Austin
Salary
Salary:
Not provided
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Graduate degree (Master's or PhD) in Computer Science, Data Science, Machine Learning, Statistics, Engineering, or a closely related quantitative field
  • 7+ years of hands-on experience designing, building, and operating machine learning systems in production environments
  • Strong proficiency in Python (production-quality code, testing, packaging) and SQL, with experience working in shared, multi-developer codebases
  • Practical experience with core ML frameworks such as PyTorch, TensorFlow, or scikit-learn, and with MLOps tooling (e.g., MLflow, CI/CD, model registries, experiment tracking)
  • Experience building data and ML workloads on cloud platforms, preferably Microsoft Azure, and working with Databricks, Spark, or similar distributed processing frameworks
  • Demonstrated ability to turn ambiguous real-world problems into shippable AI/ML solutions, owning the details from data exploration through deployed service and ongoing operation
  • Strong understanding of ML system behavior in production (data issues, non-stationarity, latency, throughput, failure modes) and comfort debugging with logs, metrics, and traces
  • Excellent communication and collaboration skills, with a track record of influencing decisions and mentoring other AI/ML practitioners
Job Responsibility
Job Responsibility
  • Design, build, and operate end-to-end AI/ML solutions (data pipelines, models, services, and tools) for diagnostics, prognostics, and test analytics
  • Implement production-grade ML pipelines on platforms such as Azure and Databricks, covering data ingestion, feature engineering, training, evaluation, and inference for batch and streaming workloads
  • Develop and maintain robust, observable ML services and internal tools that make complex vehicle and field data easy to use for engineers and technical stakeholders
  • Apply practical ML and statistical methods (e.g., tree-based models, time-series and anomaly detection, deep learning where appropriate) with a focus on reliability, explainability, and impact
  • Own model and data observability in production, including metrics, dashboards, alerts, and remediation workflows for drift, data quality, and performance regressions
  • Partner with data engineering to define and use industrialized and vectorized data products that support search, RAG, and analytics at scale
  • Review designs and code, mentor AI/ML practitioners, and help set high standards for testing, logging, deployment, and documentation
  • Collaborate with diagnostics/prognostics SMEs, validation, safety, and program teams to prioritize work, define success metrics, and embed solutions in day-to-day engineering workflows
  • Fulltime
Read More
Arrow Right
New

Staff Machine Learning Engineer - ML Training Infrastructure

The Role:   We are seeking an experienced, technically strong, impact-driven ex...
Location
Location
United States , Austin; Mountain View
Salary
Salary:
185000.00 - 335300.00 USD / Year
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree or higher in Computer Science or a related field, or equivalent practical experience
  • 8+ years of professional software engineering experience
  • 5+ years of specialized experience in AI/ML infrastructure, such as enabling distributed training for large-scale ML models
  • Strong programming skills in Python, with deep proficiency in frameworks such as PyTorch (preferred), TensorFlow, or similar ML systems
  • Proven experience designing and operating distributed systems for ML training, including distributed computing, GPU computing, and cloud environments (AWS, GCP, Azure)
  • Demonstrated track record of leading technically ambiguous, cross-team infrastructure initiatives and driving them to measurable impact
  • Strong architectural judgment and ability to make sound technical tradeoffs across performance, reliability, usability, and cost
  • Willingness to travel to Sunnyvale, CA as needed
  • Comfortable operating in highly ambiguous and dynamic environments
Job Responsibility
Job Responsibility
  • Define and drive the architecture, design, and development of scalable, reliable, and high-performance ML frameworks and platform capabilities to support model training at scale
  • Lead model training performance analysis and optimization efforts across distributed training workflows, improving scalability, efficiency, and cost across heterogeneous hardware environments
  • Raise the bar on system observability, debuggability, operational excellence, and developer experience across the ML training stack
  • Own large, ambiguous, cross-functional technical initiatives from strategy through execution, including technical roadmap definition, tradeoff analysis, and delivery
  • Influence platform direction by identifying long-term infrastructure investments, setting engineering standards, and driving adoption of best practices across teams
  • Collaborate across organizational boundaries to align requirements, resolve technical disagreements, and integrate new capabilities into the platform ecosystem
  • Mentor engineers through design reviews, technical guidance, and hands-on partnership, while elevating engineering quality across the team
What we offer
What we offer
  • medical
  • dental
  • vision
  • Health Savings Account
  • Flexible Spending Accounts
  • retirement savings plan
  • sickness and accident benefits
  • life insurance
  • paid vacation & holidays
  • tuition assistance programs
  • Fulltime
Read More
Arrow Right
New

GenAI / Machine Learning Engineer

We are seeking a GenAI / ML Engineer with 4–5 years of overall professional expe...
Location
Location
India , Pune
Salary
Salary:
Not provided
vodafone.com Logo
Vodafone
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4–5 years of overall experience in software engineering, AI/ML engineering, data engineering, cloud engineering, or related technology roles
  • At least 2 years of relevant domain experience in Generative AI, LLM-based applications, NLP or conversational AI, data or analytics engineering, cloud-native development, or enterprise chatbot platforms
  • Strong hands-on experience with Python for application development, automation, AI/ML workflows, or backend services
  • Working knowledge of machine learning and Generative AI concepts, including LLMs, embeddings, prompts, and RAG-based patterns
  • Comfortable working with SQL and enterprise datasets
  • Good working knowledge of Google Cloud Platform or similar cloud environments
  • Experience developing or integrating APIs and backend or cloud-based applications
  • Ability to debug, test, and optimise AI/ML or data-driven solutions for accuracy, reliability, and performance
  • Effective communication with both technical and business stakeholders and collaborative work across teams
Job Responsibility
Job Responsibility
  • Design, develop, and enhance GenAI and ML-based solutions for enterprise business use cases
  • Build and improve Natural Language to SQL and Retrieval-Augmented Generation based chatbot capabilities
  • Develop and maintain Python-based backend services, APIs, and AI workflows supporting LLM-driven applications
  • Work with Google Cloud Platform services to build, deploy, and optimise scalable, cloud-native AI applications
  • Improve retrieval quality, prompt orchestration, SQL generation accuracy, chatbot response quality, and overall solution performance
  • Collaborate with internal stakeholders to translate business requirements into scalable AI/ML product capabilities
  • Contribute to testing, evaluation, monitoring, documentation, and production-readiness of GenAI solutions
  • Support continuous improvement of AI engineering practices, including prompt design, evaluation frameworks, observability, and responsible AI usage
What we offer
What we offer
  • Opportunity to work on production-grade Generative AI solutions with direct business impact
  • Hands-on exposure to modern GCP services, cloud-native deployment patterns, and enterprise-scale AI architectures
  • Experience across LLMs, RAG, Natural Language to SQL, enterprise data platforms, APIs, and chatbot engineering
  • A high-visibility role with clear growth pathways into senior engineering, technical leadership, solution architecture, or AI product ownership roles
  • Fulltime
Read More
Arrow Right
New

Sr Machine Learning Engineer

Let’s do this. Let’s change the world. We are looking for a highly motivated exp...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
amgen.com Logo
Amgen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Hands-on experience with AWS, Databricks, Apache Spark, PySpark, SparkSQL, Python, and SQL for large-scale data engineering
  • Strong proficiency in workflow orchestration, Spark performance tuning, and scalable batch and streaming data pipeline development
  • Experience with real-time data processing and integration using Apache Kafka, Debezium, or similar streaming technologies
  • Hands-on experience with MLOps tools and practices, including MLflow, model serving, feature stores, experiment tracking, deployment, and lifecycle management
  • Experience with GenAI engineering practices, including prompt engineering, LLM evaluation, AI observability, agentic workflows, and knowledge graphs
  • Ability to design and develop APIs or service interfaces for data, ML, and GenAI application integration
  • Experience with Agile/SAFe delivery models, DevOps practices, CI/CD concepts, and cross-functional team collaboration
  • Strong analytical, problem-solving, debugging, communication, and teamwork skills
  • Ability to quickly learn, adapt, and apply emerging technologies across data, ML, and AI engineering
  • Doctorate degree / Master's degree / Bachelor's degree and 8 to 13 years of experience years of experience in Computer Science, IT or related field
Job Responsibility
Job Responsibility
  • Design, deploy, monitor, and optimize production-grade ML and Generative AI applications for AI-enabled manufacturing solutions
  • Define technical architecture, engineering standards, and best practices across data engineering, ML, GenAI, analytics, and platform capabilities
  • Partner with business stakeholders, product owners, and cross-functional teams to translate manufacturing challenges into secure, scalable, production-ready AI and data solutions
  • Design, develop, and maintain complex ETL/ELT pipelines in Databricks using PySpark, Scala, and SQL for large-scale structured and unstructured data processing
  • Build efficient ingestion, transformation, migration, and deployment pipelines across databases, APIs, logs, event streams, images, PDFs, documents, and third-party platforms
  • Design and implement GenAI solutions including RAG, embeddings, vector databases, agentic workflows, tool-calling systems, LLM orchestration, serving optimization, knowledge graphs, and metadata-driven retrieval
  • Build GenAI applications using frameworks and platforms such as LangChain, LangGraph, LlamaIndex, DSPy, OpenAI APIs, Amazon Bedrock, or equivalent technologies
  • Develop evaluation and observability frameworks to monitor model quality, hallucination rates, drift, retrieval effectiveness, latency, token usage, cost, reliability, operational health, and business impact
  • Build and maintain MLOps and LLMOps capabilities, including experiment tracking, model registry, prompt management, versioning, CI/CD, automated testing, deployment automation, monitoring, governance, and release controls
  • Design scalable data quality, validation, security, privacy, access control, logging, governance, and interoperability capabilities across hybrid cloud environments
  • Fulltime
Read More
Arrow Right
New

Senior Machine Learning Engineer, Search Assistant

Roku is changing how the world watches TV. Roku is the #1 TV streaming platform ...
Location
Location
United States , San Jose
Salary
Salary:
361300.00 - 510000.00 USD / Year
roku.com Logo
Roku
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of industry experience (or PhD with 5+ years) applying ML at scale in search, recommendation, ads, personalization, or related domains
  • Strong expertise in ranking systems, recommendation systems, retrieval, personalization, and multi-objective optimization
  • Experience building large-scale ML systems leveraging deep learning, sequence models, LLMs, reinforcement learning, or bandit frameworks
  • Strong product intuition and experience optimizing user engagement, retention, and monetization simultaneously
  • Proficiency in Python, Java, or Scala
  • Experience with distributed systems and ML infrastructure such as Spark, Airflow, streaming systems, feature stores, and cloud platforms
  • Strong technical leadership, system design, communication, and problem-solving skills
  • MS or PhD in Computer Science, Statistics, or a related field
Job Responsibility
Job Responsibility
  • Lead the technical vision and roadmap for ranking, personalization, and recommendation systems powering Roku’s entertainment assistant
  • Develop and deploy state-of-the-art ML models using deep learning, transformers, LLMs, bandits, reinforcement learning, and causal inference techniques
  • Build multi-objective optimization systems balancing engagement, retention, relevance, and monetization goals
  • Drive innovation in conversational discovery, contextual recommendations, and personalized content experiences across the platform
  • Design, run, and analyze online A/B experiments tied to key product and business KPIs
  • Architect scalable ML systems, feature platforms, and data pipelines supporting rapid experimentation and long-term growth
  • Mentor engineers and provide technical leadership across cross-functional initiatives involving engineering, product, UX, and analytics teams
What we offer
What we offer
  • Health insurance
  • Equity awards
  • Life insurance
  • Disability benefits
  • Parental leave
  • Wellness benefits
  • Paid time off
  • Global access to mental health and financial wellness support and resources
  • Healthcare (medical, dental, and vision)
  • Life, accident, disability, commuter, and retirement options (401(k)/pension)
  • Fulltime
Read More
Arrow Right
New

Machine Learning Engineer

You will play a key role in a regulatory submission content automation initiativ...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
amgen.com Logo
Amgen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master’s / Bachelor’s degree and 5 - 8 years of experience in Software Engineering, Data Science or Machine Learning Engineering Experience
  • Strong foundation in machine learning algorithms and techniques
  • Proficiency in Python and relevant ML libraries (e.g., TensorFlow, PyTorch, Scikit-learn, langchain)
  • Familiar with AWS, Azure, or Google Cloud
Job Responsibility
Job Responsibility
  • Develop and deploy applications that utilize LLMs such as OpenAI GPT 4, Claude, Gemini
  • Build and maintain MLOps pipelines, including data ingestion, versioning, chunking, vectorization, feature engineering, model training, deployment, and monitoring
  • Leverage cloud platforms (AWS, GCP, Azure) for ML model development, training, and deployment
  • Implement DevOps/MLOps/LLMOps best practices to automate ML workflows and improve efficiency
  • Develop and implement monitoring systems to track model performance and identify issues
  • Conduct A/B testing and experimentation to optimize model performance
  • Work closely with data scientists, engineers, and product teams to deliver ML solutions
  • Stay updated with the latest trends and advancements
  • Fulltime
Read More
Arrow Right
New

Associate Machine Learning Engineer

The Associate Machine Learning Engineer position offers a unique opportunity to ...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
amgen.com Logo
Amgen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Any degree and 3 to 5 years of Computer Science, IT or related field experience
  • Strong foundations in machine learning algorithms and techniques
  • Experience in MLOps practices and tools (e.g., MLflow, Kubeflow, Airflow)
  • Experience in model monitoring, including model observability and explainability
  • Proficiency in Python (or R) and relevant ML libraries (e.g., TensorFlow, PyTorch, Scikit-learn)
  • Experience with big data technologies (e.g., Spark, Hadoop), and performance tuning in query and data processing
  • Understanding of the LLM inference lifecycle, including prompt execution, retrieval, and response generation
  • Awareness of common LLM failure modes such as hallucinations, prompt injection, and data leakage
Job Responsibility
Job Responsibility
  • Collaborate with data scientists to develop, train, and evaluate machine learning models
  • Build and maintain MLOps pipelines, including data ingestion, feature engineering, model training, deployment, and monitoring
  • Leverage cloud platforms (AWS, Databricks) for ML model development, training, and deployment
  • Develop solutions using DevSecOps framework that are secure, scalable, reliable, and aligned with enterprise architecture standards
  • Evaluate model performance using appropriate metrics and optimize models for accuracy and efficiency
  • Develop and execute unit tests, integration tests, and other testing strategies to ensure the quality of the software
  • Create and maintain documentation on software architecture, design, deployment, disaster recovery, and operations
  • Identify and resolve technical challenges effectively
  • Provide ongoing support and maintenance for applications, ensuring that they operate smoothly and efficiently
  • Analyze customer feedback and support data to identify pain points and opportunities for improvement
  • Fulltime
Read More
Arrow Right