ML / Data Engineer – Data Science Enablement Job at Amgen (Hyderabad)

Big Data Engineer - ML Analytics & Search

We train models on petabyte-scale automotive sensor data, but training is only h...

Location

Germany , Munich

Salary:

Not provided

BMW

Expiration Date

Until further notice

Requirements

University degree in Computer Science, Engineering, or a related field
3–5 years of experience in big data or data engineering with a focus on analytics and search over very large datasets
Strong Python and SQL skills, with experience in at least one distributed compute framework
Experience with columnar or analytical storage and query optimisation at PB scale
Familiarity with search and indexing technologies, including full-text search, vector/embedding search or metadata catalogues
Production experience with Kubernetes and AWS / Azure / Google Cloud, as well as hands-on experience with infrastructure-as-code
Experience with automotive measurement data (MDF4/ASAM MDF or MCAP) as well as with embedding-based retrieval, dataset management tools, stream processing, or graph-based metadata systems

Job Responsibility

Design and build high-performance search and query pipelines over PB-scale MDF4 and MCAP data lakes, enabling ML engineers to find relevant driving scenarios, sensor conditions, and edge cases across billions of records in seconds
Build and operate indexing and cataloguing systems for automotive sensor data, including metadata extraction, signal-level indexing, scene tagging, and embedding-based similarity search
Implement distributed compute pipelines for large-scale data evaluation, such as batch statistics, distribution analysis, annotation coverage reports, and data-quality scoring
Build fast analytical queries that enable interactive exploration on top of raw data
Develop dataset assembly pipelines that automatically assemble, version, and register training and evaluation datasets
Optimise for cost and performance through intelligent partitioning, tiered storage, caching strategies, and query pushdown to minimise scan volumes over PB-scale data
Operate observability stacks for data pipelines, including query latency dashboards, pipeline health, and data freshness monitors

What we offer

Challenging projects with which we shape the mobility of tomorrow together
Wide range of personal and professional development opportunities
Attractive, fair and performance-related remuneration
High level of job security
Annual special payments such as vacation pay, Christmas bonus, and profit sharing
Flexible working hours including six weeks annual leave and overtime compensation
Discounted BMW & MINI conditions

Senior Data Scientist – AI & ML | MLOps Enablement

The Developer Platform Organization’s mission is to accelerate the delivery of r...

Location

United States , Seattle

Salary:

166000.00 - 258000.00 USD / Year

Nordstrom

Expiration Date

Until further notice

Requirements

Bachelor’s, Master’s, or PhD in Statistics, Data Science, Computer Science, Engineering, or a related technical field required
10+ years of hands-on Data Science experience with production model delivery across multiple ML (classification, ranking, NLP, time-series, recommendation) and GenAI models
Deep expertise in model evaluation — defining metrics, thresholds, and evaluation pipelines for real-world production models
Experience with Feature Store design, feature engineering, and understanding of feature freshness, reuse, and drift across different model families
Proficiency in Python with experience writing clean, maintainable, production-quality ML code
Strong understanding of ML monitoring — data drift, prediction drift, and concept drift detection
Experience with experiment tracking and model lifecycle management
Ability to translate between DS practice and platform engineering — comfortable driving design decisions, authoring DS-native documentation, and engaging in technical design reviews
Self-directed
comfortable owning POC work end-to-end without a dedicated DS team structure

Job Responsibility

Run end-to-end POC validation for new platform capabilities — Feature Store, Endpoints, Model Evaluation, AutoML, BigQuery ML etc. — independently, before they reach DS teams at scale
Attend DS team planning and design sessions as an embedded practitioner
surface real workflow pain points and translate them into reusable MLOps platform requirements
Design and own the Model Evaluation Framework — defining metrics, thresholds, and evaluation pipelines for batch, online, and streaming use cases on Vertex AI
Build model-type-aware Feature Store schemas, endpoint configurations, and evaluation pipelines that accommodate the fundamentally different needs of different ML models
Lead benchmarking of Nordstrom’s platform against industry standards — SageMaker vs. Vertex AI — across feature parity, cost, and DS practitioner ergonomics
Author DS-native documentation, onboarding guides, and quickstart notebooks that lower the adoption barrier for new platform features
Contribute DS domain expertise to the emerging Vertex AI Agentic Platform — identifying DS workflow pain points as agent use cases and defining evaluation frameworks for agentic responses
Own model card standards — capturing what actually matters to a practitioner, not just governance checkboxes
Communicate complex trade-offs and platform decisions to technical and non-technical stakeholders across DS, engineering, and leadership

What we offer

Medical/Vision
Dental
Retirement
Paid Time Away
Life Insurance
Disability
Merchandise Discount
EAP Resources
401k
performance-based incentives/bonuses

Fulltime

Data Engineer II - Getting Customers Ready for AI

Security represents the most critical priorities for our customers in a world aw...

Location

United States , Redmond

Salary:

102100.00 - 202200.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science or related technical field AND 2+ years of software, data, or related engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
Ability to meet Microsoft, customer, and/or government security screening requirements are required for this role
Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years.

Job Responsibility

Design and build scalable data pipelines (batch and streaming) to process large volumes of security and operational data
Develop and optimize ETL/ELT workflows that transform raw telemetry into structured, consumable datasets
Implement data ingestion frameworks to integrate multi-source data from services, APIs, and event streams
Improve pipeline performance, reliability, and efficiency through partitioning, indexing, and optimization techniques
Design and evolve data models, schemas, and storage strategies for analytics and AI use cases
Work with distributed storage systems (e.g., data lakes, warehouses) to ensure scalability and cost efficiency
Maintain data partitioning, retention, and lifecycle strategies aligned to business and compliance needs
Ensure data is structured for downstream ML pipelines, feature engineering, and analytics workloads
Implement data validation, quality checks, and observability frameworks to ensure data accuracy and reliability
Apply best practices for data governance, lineage, and auditing across pipelines and datasets

Fulltime

Data Engineer III

Robert Half, one of FORTUNE’s World’s Most Admired Companies and a Fortune 100 B...

Location

United States , San Ramon

Salary:

Not provided

Robert Half

Expiration Date

Until further notice

Requirements

Bachelor's in Computer Science, Engineering, or related field (Master's preferred)
5+ years with Python and SQL in data engineering for big data ML/analytics workloads
5+ years designing, building, and troubleshooting scalable ETL/ELT pipelines for business-critical production systems
3+ years with cloud data services (AWS), container orchestration (Docker, Kubernetes), and IaC (Terraform, CloudFormation)
3+ years architecting ML workflows and data platforms with CI/CD, automated testing, and distributed processing (Spark)
3+ years collaborating cross-functionally with Data Science, MLOps, Platform Engineering, and DevOps teams
3+ years implementing data quality testing and optimizing SQL/Python for cost/performance in the cloud
Understanding of the full Data Science SDLC, and experience mentoring engineers
Strongly Preferred - 2+ years hands-on with Databricks (Delta Lake, Unity Catalog, Databricks SQL)
Experience with MLflow experiment tracking and model registry workflows

Job Responsibility

Lead architecture and design of complex data pipelines on Databricks lakehouse architecture (Unity Catalog, Delta Lake, Structured Streaming)
Define technical approach for data engineering initiatives, mentor less-senior engineers, and set standards for code quality through leadership and code reviews
Design and build data foundations that enable AI/ML capabilities — feature stores, embedding pipelines, vector search indexes, and model training datasets
Align data engineering solutions with business strategy, including support for Agentic AI workloads
Own health, scalability, and modernization of data infrastructure with Databricks as the strategic platform — including workload migration, compute optimization, and Unity Catalog adoption
Optimize pipeline performance (Delta Lake table layouts, clustering, Z-ordering) and establish monitoring/alerting best practices with clear SLAs
Build data infrastructure supporting Agentic AI systems — real-time data access layers, context retrieval pipelines, and agent-accessible data services
Collaborate cross-functionally with DevOps, Platform Engineering, and MLOps roles to integrate data solutions into the broader technology environment and shared AI infrastructure – Mlflow registries, feature stores, and agent orchestration layers
Provide consultation to Senior Leadership on complex projects and drive continuous improvement initiatives
Champion data governance at all layers for data, models, and AI assets

What we offer

medical
vision
dental
life and disability insurance
401(k) plan
free online training

Fulltime

Senior Manager, Data Science - Forecasting

We are seeking a Senior Manager, Data Science to lead the Forecasting team withi...

Location

India , Hyderabad

Salary:

Not provided

Amgen

Expiration Date

Until further notice

Requirements

12+ years of professional experience delivering data science, machine learning, forecasting, AI, analytics, or decision-support solutions that created measurable business value.
7+ years of experience managing, leading, or formally developing data science, machine learning, AI, analytics, or cross-functional technical teams.
Demonstrated experience setting data science strategy, prioritizing a portfolio of work, managing stakeholder expectations, and leading teams through ambiguous, high-impact business problems.
Deep experience with forecasting, predictive modeling, statistical modeling, probabilistic or Bayesian methods, uncertainty quantification, scenario analysis, experimentation, or optimization.
Experience partnering with machine learning engineering, software engineering, product, program, or platform teams to move models and analytics capabilities from prototype into production or scaled business use.
Experience with modern AI systems, including LLM-powered applications, AI agents, retrieval or information-retrieval systems, evaluation frameworks, guardrails, and human-in-the-loop operating patterns.
Strong analytical and technical fluency with Python, R, SQL, or equivalent tools, and familiarity with modern data science and ML frameworks such as scikit-learn, PyTorch, TensorFlow/JAX, Spark, MLflow, Airflow/Prefect/Dagster, or equivalent technologies.
Familiarity with cloud platforms, enterprise data platforms, model deployment patterns, MLOps, model monitoring, reproducibility, governance, security, privacy, and responsible AI practices.
Strong communication and executive-influence skills, including the ability to explain complex methods, forecast uncertainty, assumptions, model risks, and business implications to technical and non-technical audiences.
Demonstrated ability to hire, coach, mentor, and grow technical talent while fostering collaboration, inclusion, accountability, and a high bar for scientific and delivery excellence.

Job Responsibility

Lead, coach, and develop a team of data scientists, AI/ML scientists, and analytics professionals, establishing clear priorities, high standards, career development plans, and an inclusive, accountable team culture.
Define and own the data science roadmap for enterprise forecasting, simulation, scenario planning, uncertainty quantification, predictive analytics, LLM-enabled applications, and AI-assisted decision support aligned to Amgen's planning, supply, commercial, manufacturing, operations, and patient-focused priorities.
Partner with senior business, product, program, operations, commercial, manufacturing, supply chain, finance, engineering, and AI stakeholders to translate ambiguous planning and decision challenges into prioritized data science initiatives with clear outcomes and measurable value.
Establish rigorous standards for forecast quality, model validation, experimentation, evaluation frameworks, guardrails, explainability, auditability, reproducibility, model monitoring, drift detection, and responsible AI practices in high-impact and regulated business contexts.
Create and maintain measurement frameworks to evaluate forecast accuracy, uncertainty calibration, decision quality, operational efficiency, reliability, user adoption, and business impact
lead build-measure-learn cycles that improve solutions based on real-world performance.
Serve as a senior advisor to stakeholders by communicating forecasts, uncertainty, model assumptions, trade-offs, risks, and recommendations in a clear, actionable way for both technical and executive audiences.
Manage the team portfolio, roadmap trade-offs, resourcing, stakeholder expectations, delivery risks, and dependencies across data science, engineering, product, and business teams.
Identify reusable methods, patterns, platforms, and governance practices that accelerate forecasting and AI decision-support delivery across Amgen and reduce duplication across teams.
Research and evaluate emerging open-source, vendor, and internal tools related to forecasting, decision intelligence, LLMs, AI agents, MLOps, model evaluation, and AI governance for potential application to Amgen business problems.

Fulltime

Senior Data Engineer (Contractor)

As a Senior Data Engineer, you’ll design, build, and operate scalable, reliable ...

Location

Greece

Salary:

Not provided

myPOS

Expiration Date

Until further notice

Requirements

Bachelor's degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience)
6+ years of experience as a Data Engineer, building and maintaining production-grade pipelines and datasets
Strong Python and SQL skills with a solid understanding of data structures, performance, and optimization strategies
Hands-on experience with orchestration (like Airflow, Dagster, Databricks Workflows) and distributed processing in a cloud environment
Experience with analytical data modeling (star and snowflake schemas), DWH, ETL/ELT patterns, and dimensional concepts
Experience building reliable incremental data ingestion pipelines from DBs and APIs
Familiarity with at least one major cloud provider (GCP, AWS, Azure) and deploying data solutions in the cloud
Familiarity with CI/CD for data pipelines, IaC (Terraform), and/or DataOps practices
Strong troubleshooting mindset: ability to debug issues across data, infra, pipelines, and deployments
Collaborative mindset and clear communication across engineering, analytics, and business stakeholders

Job Responsibility

Build and maintain data pipelines for ingestion, transformation, and export across multiple sources and destinations
Develop and evolve scalable data architecture to meet business and performance requirements
Partner with analysts and data scientists to deliver curated, analysis-ready datasets and enable self-service analytics
Implement best practices for data quality, testing, monitoring, lineage, and reliability
Optimize workflows for performance, cost, and scalability (e.g., tuning Spark jobs, query optimization, partitioning strategies)
Ensure secure data handling and compliance with relevant data protection standards and internal policies
Contribute to documentation, standards, and continuous improvement of the data platform and engineering processes
Ensure secure, compliant handling of data and models, including access controls, auditability, and governance practices
Build and maintain MLOps automation: CI/CD for ML, environment management, artifact handling, versioning of data/models/code

What we offer

Vibrant international team operating in hi-tech environment
Excellent compensation package
myPOS Academy for upskilling and training
Unlimited access to courses on LinkedIn Learning
Refer a friend bonus as we know that working with friends is fun
Teambuilding, social activities and networks on a multi-national level

Senior Data Engineer

As a Senior Data Engineer, you'll design, build, and operate scalable, reliable ...

Location

Croatia

Salary:

Not provided

myPOS

Expiration Date

Until further notice

Requirements

Bachelor's degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience)
6+ years of experience as a Data Engineer, building and maintaining production-grade pipelines and datasets
Strong Python and SQL skills with a solid understanding of data structures, performance, and optimization strategies
Hands-on experience with orchestration (like Airflow, Dagster, Databricks Workflows) and distributed processing in a cloud environment
Experience with analytical data modeling (star and snowflake schemas), DWH, ETL/ELT patterns, and dimensional concepts
Experience building reliable incremental data ingestion pipelines from DBs and APIs
Familiarity with at least one major cloud provider (GCP, AWS, Azure) and deploying data solutions in the cloud
Familiarity with CI/CD for data pipelines, IaC (Terraform), and/or DataOps practices
Strong troubleshooting mindset: ability to debug issues across data, infra, pipelines, and deployments
Collaborative mindset and clear communication across engineering, analytics, and business stakeholders

Job Responsibility

Build and maintain data pipelines for ingestion, transformation, and export across multiple sources and destinations
Develop and evolve scalable data architecture to meet business and performance requirements
Partner with analysts and data scientists to deliver curated, analysis-ready datasets and enable self-service analytics
Implement best practices for data quality, testing, monitoring, lineage, and reliability
Optimize workflows for performance, cost, and scalability (e.g., tuning Spark jobs, query optimization, partitioning strategies)
Ensure secure data handling and compliance with relevant data protection standards and internal policies
Contribute to documentation, standards, and continuous improvement of the data platform and engineering processes
Ensure secure, compliant handling of data and models, including access controls, auditability, and governance practices
Build and maintain MLOps automation: CI/CD for ML, environment management, artifact handling, versioning of data/models/code

What we offer

Vibrant international team operating in hi-tech environment
Excellent compensation package
myPOS Academy for upskilling and training
Unlimited access to courses on LinkedIn Learning
Refer a friend bonus as we know that working with friends is fun
Teambuilding, social activities and networks on a multi-national level

Fulltime

Senior Data Engineer (Contractor)

At myPOS, we’re all about helping businesses grow and get paid. We make payments...

Location

Serbia

Salary:

Not provided

myPOS

Expiration Date

Until further notice

Requirements

Bachelor's degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience)
6+ years of experience as a Data Engineer, building and maintaining production-grade pipelines and datasets
Strong Python and SQL skills with a solid understanding of data structures, performance, and optimization strategies
Hands-on experience with orchestration (like Airflow, Dagster, Databricks Workflows) and distributed processing in a cloud environment
Experience with analytical data modeling (star and snowflake schemas), DWH, ETL/ELT patterns, and dimensional concepts
Experience building reliable incremental data ingestion pipelines from DBs and APIs
Familiarity with at least one major cloud provider (GCP, AWS, Azure) and deploying data solutions in the cloud
Familiarity with CI/CD for data pipelines, IaC (Terraform), and/or DataOps practices
Strong troubleshooting mindset
Collaborative mindset and clear communication across engineering, analytics, and business stakeholders

Job Responsibility

Build and maintain data pipelines for ingestion, transformation, and export across multiple sources and destinations
Develop and evolve scalable data architecture to meet business and performance requirements
Partner with analysts and data scientists to deliver curated, analysis-ready datasets and enable self-service analytics
Implement best practices for data quality, testing, monitoring, lineage, and reliability
Optimize workflows for performance, cost, and scalability (e.g., tuning Spark jobs, query optimization, partitioning strategies)
Ensure secure data handling and compliance with relevant data protection standards and internal policies
Contribute to documentation, standards, and continuous improvement of the data platform and engineering processes
Ensure secure, compliant handling of data and models, including access controls, auditability, and governance practices
Build and maintain MLOps automation: CI/CD for ML, environment management, artifact handling, versioning of data/models/code

What we offer

Vibrant international team operating in hi-tech environment
Excellent compensation package
myPOS Academy for upskilling and training
Unlimited access to courses on LinkedIn Learning
Refer a friend bonus
Teambuilding, social activities and networks on a multi-national level

Fulltime

Select Country

ML / Data Engineer – Data Science Enablement

Job Description

Job Responsibility

Requirements

Nice to have

Looking for more opportunities?