CrawlJobs Logo

Platform Engineer - Data Science Platform

United States, Columbus, OH or Dallas, TX or Minneapolis, MN · Job Posted May 03, 2026
Apply Position
Job Link Share

Job Responsibility

  • Support and maintain ongoing Data Science infrastructure operations
  • Design, build, and deploy AWS environments using automated CI/CD pipelines
  • Manage and scale large, secure cloud environments to support current and future Data Science initiatives
  • Implement, own, and improve the image management lifecycle process
  • Assist with the setup and ongoing management of AWS accounts dedicated to the Data Science platform
  • Develop and maintain infrastructure pipelines using CI/CD tools (e.g., Azure DevOps)
  • Build and manage environments using Infrastructure as Code (IaC) tools such as Terraform
  • Develop scripts and applications using programming languages such as Python
  • Manage and support database technologies including Athena, Oracle, MySQL, and PostgreSQL
  • Leverage AWS services to enable Data Lake, Data Science, and AI/ML workloads
  • Respond to requests from development and business users, removing technical roadblocks
  • Manage secured infrastructure environments, applying security controls and guardrails
  • Identify, remediate, and track infrastructure vulnerabilities within defined SLAs
  • Maintain audit logs and support compliance-related needs
  • Perform system upgrades, patching, and provide on-call support as required
  • Conduct root cause analysis and knowledge transfer sessions with internal teams
  • Collaborate closely with Network, Database, Infrastructure, and Architecture teams to align on platform strategy and delivery

Requirements

  • Bachelor's degree in Computer Science or a related field, or equivalent practical experience
  • 5+ years of experience supporting Data Science infrastructure
  • 5+ years of hands-on experience with AWS-hosted Data Lake, Data Science, or AI/ML platforms
  • 5+ years of working knowledge with Kubernetes
  • AWS services such as SageMaker, Glue, Lambda, Athena
  • CI/CD tools such as Azure DevOps
  • Infrastructure as Code tools such as Terraform
  • Container technologies including Docker and Amazon ECR
  • Security tools such as AQUA and Kenna
  • Experience producing technical documentation and written solutions

Nice to have

  • AWS, Terraform, or Kubernetes certifications (e.g., CKA)
  • Familiarity with technical service management and product lifecycle maintenance
  • Experience with scripting and programming languages such as Python and shell scripting
  • Strong working knowledge of database technologies including Athena, Oracle, and MySQL
  • Advanced experience with CI/CD and IaC tools, particularly Azure DevOps and Terraform

What we offer

  • medical
  • vision
  • dental
  • life and disability insurance
  • 401(k) plan

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Platform Engineer - Data Science Platform

8 matching positions

ML / Data Engineer – Data Science Enablement

This role will report to the Data Science Enablement Manager and support the Rar...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
amgen.com Logo
Amgen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s in Computer Science, Data Engineering, or related technical field
  • 3–5 years of experience in ML engineering, data engineering, or related roles
  • Strong programming skills in Python and SQL
  • Experience with data pipeline development and distributed computing (e.g., Spark/PySpark)
  • Working knowledge of Databricks and at least one cloud platform (AWS, Azure, or GCP)
  • Experience with ML lifecycle tools (e.g., MLflow, Git, CI/CD pipelines)
  • Understanding of model deployment, monitoring, and reproducibility practices
Job Responsibility
Job Responsibility
  • Build and maintain scalable data and ML pipelines to support patient finding use cases across the patient journey
  • Productionize machine learning models by developing deployment workflows, APIs, and batch/real-time scoring pipelines
  • Design and implement model evaluation, validation, and monitoring frameworks (performance tracking, drift detection, alerting)
  • Enable end-to-end ML lifecycle management, including training, versioning, deployment, and retraining workflows
  • Partner with RDBU data science teams to translate analytical solutions into production-ready systems
  • Develop ML-ready datasets and feature pipelines, ensuring data quality, consistency, and reusability
  • Support model tracking and experiment management using standardized tools and frameworks
  • Build tools and utilities to monitor, track, and operationalize model outputs for downstream consumption
  • Collaborate with enterprise data and platform teams to ensure compliance with data governance, security, and architecture standards
  • Follow engineering best practices for code quality, documentation, testing, and CI/CD integration
Read More
Arrow Right

Data Governance & Data Quality Platform Engineer

We are seeking a Data Governance & Data Quality Platform Engineer to own the tec...
Location
Location
United States , Houston
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Data Engineering, or a related field
  • Hands‑on experience administering data governance and data quality platforms (e.g., Atlan, Monte Carlo, DQ tools), including API‑based integrations
  • Strong understanding of data catalogs, data lineage, and metadata management
  • Solid knowledge of data governance principles and data quality frameworks
  • Familiarity with Data Mesh concepts and decentralized data ownership models
  • Experience building data products and supporting orchestration workflows
Job Responsibility
Job Responsibility
  • Configure and maintain data governance platforms for metadata management, data lineage, and governance workflows
  • Configure data quality tools for profiling, rule creation, and monitoring dashboards
  • Manage platform security, including user roles, authentication, SSO, RBAC, and access controls
  • Develop and maintain integrations across data sources, databases, data lakes, and BI tools
  • Automate metadata ingestion and data quality checks using APIs, Python scripts, or ETL frameworks
  • Configure and maintain connectors for analytics and reporting platforms
  • Monitor platform health and optimize performance and scalability
  • Apply upgrades, patches, and troubleshoot technical issues
  • Implement logging, alerting, and proactive monitoring for governance and data quality environments
  • Provide Tier 3 support for platform‑related incidents and escalations
What we offer
What we offer
  • medical
  • vision
  • dental
  • life and disability insurance
  • 401(k) plan
Read More
Arrow Right

Principal ML Systems Engineer, Data Platform (Autonomous Vehicles)

We are seeking a highly skilled and experienced Principal ML Systems Engineer to...
Location
Location
United States , Austin; Bellevue
Salary
Salary:
233400.00 - 339650.00 USD / Year
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BA or MS in Computer Science, Electrical Engineering, Mathematics, Physics, or another relevant field
  • 10+ years in building distributed data platforms using major cloud providers and open-source frameworks
  • Expert-level proficiency in Java, C++, or Python, with a proven track record of designing and implementing robust, distributed systems
  • Expertise in implementing Data Processing Frameworks (Beam, Spark) and serving layers optimized for high-throughput, low-latency delivery
  • Experience optimizing services for cost efficiency, performance & reliability
  • Experience with Micro services architecture and proven ability to manage the full operational lifecycle of systems
  • Deep understanding of full ML model lifecycle (feature engineering, training, validation, deployment, monitoring, etc.)
  • Strong passion, and understanding about self-driving technology and its potential impact on the world
  • Experience working with (100+) petabyte-scale ingestion, processing, and serving architectures
  • Experience with SQL engines / queries
Job Responsibility
Job Responsibility
  • Design & develop the next generation distributed ML data platform (Ingestion, Processing, Serving) using GCP and open-source frameworks
  • Leading the strategy of building performant and efficient multi-cloud platforms
  • Collaborate with stakeholders (ML & Data Engineers), translate needs & pain points into requirements, build self-serve capabilities and drive adoption
  • Deliver e2e technical projects owning major technical decisions and tradeoffs & contribute to the team’s strategic roadmap
  • Champion engineering & operational excellence by continuously improving systems and processes
  • Actively participate in team’s planning, code reviews and design discussions
  • Conduct technical interviews, onboard new and mentor junior engineers
What we offer
What we offer
  • Relocation benefits
  • Company vehicle evaluation program
  • medical
  • dental
  • vision
  • Health Savings Account
  • Flexible Spending Accounts
  • retirement savings plan
  • sickness and accident benefits
  • life insurance
  • Fulltime
Read More
Arrow Right

Senior Data Science Engineer

At T-Mobile Advertising Solutions, we're building privacy-first advertising prod...
Location
Location
United States , Philadelphia; New York
Salary
Salary:
116500.00 - 210100.00 USD / Year
https://www.t-mobile.com Logo
T-Mobile
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree plus 5 years of related work experience OR Advanced degree with 3 years of related experience
  • Acceptable areas of study include Quantitative Discipline (math, statistics, economics, computer science, physics, engineering, etc.)
  • 4-7 years experience building and deploying machine learning and deep learning solutions at scale
  • familiarity with MLOps and DevOps practices and tools
  • 4-7 years Experience working within big data architecture, modern analytical data platforms, and large-scale data warehousing technologies (e.g. BigQuery, Snowflake, Redshift)
  • 4-7 years Experience working with large-scale distributed data systems and cloud platforms (e.g. SQL, Python, Scala, AWS)
  • 4-7 years Experience solving complex data, machine learning, or algorithmic challenges in production environment using modern engineering practices
  • At least 18 years of age
  • Legally authorized to work in the United States
Job Responsibility
Job Responsibility
  • Lead the end-to-end development of machine learning and data products aligned to business objectives, from problem framing through deployment and monitoring
  • Build scalable data, training, and inference pipelines using distributed processing and cloud technologies
  • Apply statistical methods, experimentation, and validation frameworks to ensure solution quality and business impact
  • Write production-quality code and contribute to engineering best practices, including testing, CI/CD, and observability
  • Collaborate across engineering, product, and business teams while leading other engineers and data scientists
What we offer
What we offer
  • Competitive base salary
  • Annual stock grant
  • Employee stock purchase plan
  • 401(k)
  • Free, year-round money coaches
  • Medical, dental and vision insurance
  • Flexible spending account
  • Paid time off and up to 12 paid holidays
  • Paid parental and family leave
  • Family building benefits
  • Fulltime
Read More
Arrow Right

SVP Senior Market Data Platform Engineer

Citi's Equities Technology organization is seeking a hands-on Senior Market Data...
Location
Location
United States , Jersey City
Salary
Salary:
176720.00 - 265080.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 12+ years of hands-on engineering experience building low-latency, high-throughput systems, with strong proficiency in modern C++
  • Prior experience designing or owning real-time market data distribution platforms in a global financial institution (Equities required
  • multi-asset a plus)
  • Deep, hands-on experience with UDP multicast in production environments, including loss, recovery, and operational behavior
  • Strong understanding of low-latency techniques, including NUMA, core pinning / CPU isolation, IRQ affinity, kernel and network tuning, memory management, and cache locality
  • Production experience using Aeron, including transport/driver tuning, back-pressure management, loss/replay semantics, and performance diagnostics
  • Demonstrated ownership of wire-to-consumer latency observability, including histograms, tail analysis, and regression detection
  • Strong understanding of exchange microstructure and regional market structure differences across global venues
  • Ability to prioritize multiple initiatives and deliver in a fast-paced, globally distributed environment
  • Excellent communication skills, with the ability to reason about and clearly explain complex systems to engineers, quants, and trading stakeholders
Job Responsibility
Job Responsibility
  • Design and develop Citi's next-generation real-time market data distribution platforms delivering normalized market data to multiple internal consumers across regions
  • Build and evolve performance-critical C++ components for market data ingestion, normalization, and fan-out
  • Define and enforce platform standards for market data semantics, schemas, and correctness guarantees
  • Engineer the end-to-end wire-to-consumer hot path, with explicit focus on NUMA-aware design, core pinning / CPU isolation, lock-free concurrency, memory layout, and cache locality
  • Own operating-system and runtime-level optimizations, including Linux kernel and network tuning, IRQ affinity, threading models, and performance profiling
  • Ensure deterministic performance under burst load, packet loss, and volatile market conditions
  • Use Aeron (or equivalent) for high-performance fan-out and IPC, with hands-on responsibility for driver configuration, transport selection, and tuning. Own Aeron back-pressure behavior, loss/replay semantics, and performance diagnostics to meet latency and correctness targets
  • Own latency measurement and observability using histograms (p50/p95/p99/p99.9) and tail analysis
  • establish baselines and alerting on distribution shifts
  • Drive high standards in code quality, automated testing, and SDLC discipline, with particular emphasis on correctness and latency regression safety
What we offer
What we offer
  • medical, dental & vision coverage
  • 401(k)
  • life, accident, and disability insurance
  • wellness programs
  • paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays
  • Fulltime
Read More
Arrow Right

Data Platform Engineer

OneTrust’s mission is to enable innovation through the responsible use of data a...
Location
Location
United States , New York
Salary
Salary:
81150.00 - 121725.00 USD / Year
onetrust.com Logo
OneTrust
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or master's degree in computer science, Engineering or related field
  • 3+ years of overall experience and 3+ years of experience with very large-scale data warehouse projects
  • Strong experience in SQL, knowledge of dimensional modeling, supporting data warehouse, scaling, optimizing and performance tuning ETL pipelines
  • 1+ years of experience with Python and manipulation of various data formats for extraction and transformation
  • 1+ years of experience with Snowflake, Airflow, ETL/Integration tools administration
  • Hands-on experience with Data Warehouse technologies (Snowflake, Redshift) and Big Data technologies (Hadoop, Hive, Spark, Kafka, Kinesis)
  • Experience evaluating new tools and enriching existing architecture
  • You are a self-starter, hands-on technical expert who continuously gather and synthesize high-impact needs from business partners, design and implement the appropriate technical platform solution, and effectively communicate about deliverables,timelines and tradeoffs
Job Responsibility
Job Responsibility
  • Build and automate modern data platforms (AWS, AZURE, Snowflake) with native technologies, deploy applications, and provision of infrastructure
  • Perform Administration, Maintenance and provide support to production systems (Snowflake, Fivetran, DBT, Power BI, Airflow)
  • Build and maintain CI/CD pipelines and infrastructure as code (IaC) practice across the platform for safe and reproducible deployments
  • Design, build and implement automation framework for scale that deliver data with measurable quality under the SLA
  • Drive technical conversations with stakeholders to understand the platform's needs and come up with a provisioning strategy
  • Work effectively using scrum with multiple team members to deliver value effectively
  • Reduce technical debt over time with root cause identification, issue resolution and promote best practices across the platform
What we offer
What we offer
  • comprehensive healthcare coverage
  • flexible PTO
  • equity RSUs
  • annual performance bonus opportunities
  • retirement account support
  • 14+ weeks of paid parental leave
  • career development opportunities
  • company-paid privacy certification exam fees
  • Fulltime
Read More
Arrow Right

Data Platform Engineer - Assistant Vice President

We are seeking a talented and passionate engineer to join our growing team. As a...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Engineering, or a related field
  • Minimum 5 years of experience developing and deploying production-ready Java applications in a data engineering context
  • Strong experience with core Java (version 11 or higher), SQL, and database APIs
  • Proven experience working with distributed stream processing frameworks like Apache Flink, Spark Streaming, or Kafka Streams
  • Experience with event-driven architectures and real-time data processing
  • Solid understanding of OOP concepts, multithreading, and thread pools
  • Familiarity with containerization technologies like Docker and deployment platforms like Openshift, ECS, or Kubernetes is a plus
  • Experience producing high quality code using agentic coding assistants
  • Excellent communication and collaboration skills
Job Responsibility
Job Responsibility
  • Design, develop, and maintain robust and scalable data platform using Java and related technologies (e.g., Apache Flink, Kafka, Trino)
  • Advise data engineers on how to build and optimize real-time and batch data processing applications to support low-latency requirements
  • Extend the platform with data integration solutions between various data sources and targets, including databases, APIs, and streaming platforms
  • Contribute to the design and development of event-driven architectures
  • Write clean, well-documented, and testable code
  • Collaborate effectively with other engineers, product managers, and stakeholders throughout the software development lifecycle (SDLC), adhering to Agile methodologies
  • Stay up-to-date with the latest trends and technologies in the data engineering space
  • Fulltime
Read More
Arrow Right

Senior Data Platform Engineer

Join our Data Platform team at 10x Genomics to architect and implement our strat...
Location
Location
United States , Pleasanton
Salary
Salary:
168200.00 - 227600.00 USD / Year
10xgenomics.com Logo
10x Genomics
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Information Management, or a related field, or equivalent experience
  • 5+ years of hands-on experience in software engineering focused on data platform development, distributed systems, or enterprise integrations
  • Proven experience designing and implementing highly scalable data platforms on major cloud environments (e.g., AWS, GCP, or Azure)
  • Deep proficiency in one or more general-purpose programming languages (e.g., Python, Java, or similar)
  • Strong foundation in computer science fundamentals, including data structures, algorithms, and system design
Job Responsibility
Job Responsibility
  • Architect and implement the canonical data layer and Event-Driven Architecture (EDA) using technologies like Apache Iceberg and Kafka to decouple applications and ensure real-time data flow
  • Design, build, and optimize high-volume, code-first data pipelines (real-time and batch) across a large application landscape (e.g., Salesforce, Oracle, Workday)
  • Establish Amazon S3 as the Single Source of Truth (SSOT) and govern data using principles like the Medallion Architecture (Silver and Gold layers) and schema evolution
  • Develop, test, and maintain robust and scalable ELT pipelines and data models in Snowflake, including leveraging advanced features like Snowpipes, Streams, and Stored Procedures
  • Develop the data presentation layer for self-service analytics, including the Natural Language Query (NLQ) interface integrated with Generative AI (e.g., Bedrock)
  • Lead technical efforts to migrate key business domains off legacy middleware and onto the new platform, eliminating the 'Integration Bottleneck'
  • Define and enforce data governance, quality, and security standards across the Unified Data Platform
  • Collaborate with the Architecture Review Board (ARB) to promote modern approaches such as serverless computing and Domain-Driven Design
  • Take ownership of the full development lifecycle, from prototyping and design through deployment, monitoring, and operational excellence
What we offer
What we offer
  • Equity grants
  • Comprehensive health and retirement benefit programs
  • Annual bonus program or sales incentive program
  • Health Package
  • Easy-to-use Benefits
  • Family oriented policies including parental leave
  • Generous Time Off
  • Award-Winning Workplace
  • Fulltime
Read More
Arrow Right