CrawlJobs Logo

Data Infrastructure Engineer

United States, New York or DC · Job Posted December 11, 2025
Apply Position
Job Link Share

Job Description

This young, early-stage start-up challenger are currently looking for a hands-on Data Infrastructure Engineer to join their small team and help drive the business forwards. This could be an excellent opportunity for an experienced Data Infrastructure Engineer with founding/small start-up experience to take that next step into an exciting position with a very well-run and ambitious organisation in an innovative, exciting space. This young and agile company are building cutting-edge AI platforms. They are in a unique space where there is an opportunity for them to gain significant market share very quickly.

Job Responsibility

  • Developing secure data sharing middleware
  • Integrating software seamlessly into the workflows of specialized professionals, ensuring secure and efficient data access throughout the asset recruitment process
  • Providing solutions for the full data stack – from the data management, software development and model and deployment lifecycles

Requirements

  • Startup Energy: You thrive in fast-paced environments, manage ambiguity well, and focus on what moves the needle
  • Designing and deploying intuitive, user-friendly APIs
  • Demonstrated ability to train and deploy models at scale
  • Successfully launching machine learning services, particularly those leveraging LLMs, embeddings, and inference, into production environments
  • Handling and securing large-scale production data
  • Demonstrated proficiency in Python, Go, or C
  • A proactive approach to tackling complex challenges in a fast-paced, early-stage environment
  • A passion for innovation and a collaborative spirit

Nice to have

  • Skills in building and deploying ML-operated Search at scale
  • Experience in infrastructure management (Docker, Kubernetes, AWS)
  • Expertise in encryption, authentications3, DevOps, or SRE

What we offer

Equity

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Data Infrastructure Engineer

8 matching positions

Data Infrastructure Engineer

This young, early-stage start-up challenger are currently looking for a hands-on...
Location
Location
United States , New York or DC
Salary
Salary:
Not provided
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Startup Energy: You thrive in fast-paced environments, manage ambiguity well, and focus on what moves the needle
  • Designing and deploying intuitive, user-friendly APIs
  • Demonstrated ability to train and deploy models at scale
  • Successfully launching machine learning services, particularly those leveraging LLMs, embeddings, and inference, into production environments
  • Handling and securing large-scale production data
  • Demonstrated proficiency in Python, Go, or C
  • A proactive approach to tackling complex challenges in a fast-paced, early-stage environment
  • A passion for innovation and a collaborative spirit
Job Responsibility
Job Responsibility
  • Developing secure data sharing middleware
  • Integrating software seamlessly into the workflows of specialized professionals, ensuring secure and efficient data access throughout the asset recruitment process
  • Building, shipping and supporting mission critical services in support of the services that make up the Data platform
  • Providing solutions for the full data stack – from the data management, software development and model and deployment lifecycles
What we offer
What we offer
  • Competitive Salary + Equity
  • Fulltime
Read More
Arrow Right

Data Infrastructure Engineer

Data Infrastructure Engineer – New York or DC (hybrid) – Competitive Salary + Eq...
Location
Location
United States , New York or DC
Salary
Salary:
Not provided
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Startup Energy: You thrive in fast-paced environments, manage ambiguity well, and focus on what moves the needle
  • Designing and deploying intuitive, user-friendly APIs
  • Demonstrated ability to train and deploy models at scale
  • Successfully launching machine learning services, particularly those leveraging LLMs, embeddings, and inference, into production environments
  • Handling and securing large-scale production data
  • Demonstrated proficiency in Python, Go, or C
  • A proactive approach to tackling complex challenges in a fast-paced, early-stage environment
  • A passion for innovation and a collaborative spirit
Job Responsibility
Job Responsibility
  • Joining as part of the founding Engineering team, you will be a key part of developing secure data sharing middleware
  • Their software will integrate seamlessly into the workflows of specialized professionals, ensuring secure and efficient data access throughout the asset recruitment process
  • The data infrastructure engineer requires a mix of software development and ML Ops practices, resulting in an exciting, fast paced engineering role
  • You will be able to demonstrate experience building, shipping and supporting mission critical services in support of the services that make up the Data platform
  • This role requires the ability to provide solutions for the full data stack – from the data management, software development and model and deployment lifecycles
What we offer
What we offer
  • Competitive Salary + Equity
  • Fulltime
Read More
Arrow Right

Data Infrastructure Engineer

This young, early-stage start-up challenger is currently looking for a hands-on ...
Location
Location
United States , New York or DC
Salary
Salary:
Not provided
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Startup Energy: You thrive in fast-paced environments, manage ambiguity well, and focus on what moves the needle
  • Designing and deploying intuitive, user-friendly APIs
  • Demonstrated ability to train and deploy models at scale
  • Successfully launching machine learning services, particularly those leveraging LLMs, embeddings, and inference, into production environments
  • Handling and securing large-scale production data
  • Demonstrated proficiency in Python, Go, or C
  • A proactive approach to tackling complex challenges in a fast-paced, early-stage environment
  • A passion for innovation and a collaborative spirit
Job Responsibility
Job Responsibility
  • Developing secure data sharing middleware
  • Integrating software seamlessly into the workflows of specialised professionals, ensuring secure and efficient data access throughout the asset recruitment process
  • Providing solutions for the full data stack – from the data management, software development and model and deployment lifecycles
What we offer
What we offer
  • Equity
  • Opportunity to work with an Ambitious, Rapidly-Growing Start-Up
  • Fulltime
Read More
Arrow Right

Data Infrastructure Engineer

A venture-backed startup at the intersection of AI and national security is buil...
Location
Location
United States , New York City Metropolitan Area
Salary
Salary:
Not provided
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong engineering experience in Python, Go, or C
  • Experience building and scaling production data systems
  • Hands-on expertise with model deployment and ML Ops practices
  • Knowledge of database design, performance tuning, and operations
  • Someone who thrives in early-stage, fast-paced environments and enjoys tackling complex challenges
Job Responsibility
Job Responsibility
  • Build and maintain the data pipelines and infrastructure that power ML applications
  • Deploy and manage models at scale, from training through production
  • Design APIs and services that integrate smoothly into mission-critical workflows
  • Ensure data is handled and secured properly across large, distributed environments
  • Collaborate closely with a small, fast-moving team to solve hard technical problems in real-world settings
What we offer
What we offer
  • Significant equity
  • Strong health & wellness benefits
  • Fulltime
Read More
Arrow Right

Data Infrastructure Engineer

The Data Infrastructure team builds distributed systems and tools supporting Int...
Location
Location
Ireland , Dublin
Salary
Salary:
Not provided
intercom.com Logo
Intercom
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of full-time, professional work experience in the data space using Python and SQL
  • Solid experience building and running data pipelines for large and complex datasets including handling dependencies
  • Hands-on cloud provider experience (preferably AWS) including service integrations and automation via CLI and APIs
  • Solid understanding of data security practices and are passionate about privacy
  • Some DevOps experience
  • You care about your craft
Job Responsibility
Job Responsibility
  • Evolve the Data Platform by designing and building the next generation of the stack
  • Develop, run and support our data pipelines using tools like Airflow, PlanetScale, Kinesis, Snowflake, Tableau, all in AWS
  • Collaborate with product managers, data engineers, analysts and data scientists to develop tooling and infrastructure to support their needs
  • Develop automation and tooling to support the creation and discovery of high quality analytics data in an environment where dozens of changes can be shipped daily
  • Implement systems to monitor our infrastructure, detect and surface data quality issues and ensure Operational Excellence
What we offer
What we offer
  • Competitive salary and equity in a fast-growing start-up
  • We serve lunch every weekday, plus a variety of snack foods and a fully stocked kitchen
  • Regular compensation reviews - we reward great work!
  • Pension scheme & match up to 4%
  • Peace of mind with life assurance, as well as comprehensive health and dental insurance for you and your dependents
  • Open vacation policy and flexible holidays so you can take time off when you need it
  • Paid maternity leave, as well as 6 weeks paternity leave for fathers, to let you spend valuable time with your loved ones
  • If you’re cycling, we’ve got you covered on the Cycle-to-Work Scheme. With secure bike storage too
  • MacBooks are our standard, but we also offer Windows for certain roles when needed
  • Fulltime
Read More
Arrow Right

Senior Solutions Engineer – Big Data & Data Infrastructure

This is a great opportunity to be part of one of the fastest-growing infrastruct...
Location
Location
Israel , Tel Aviv
Salary
Salary:
Not provided
vastdata.com Logo
VAST Data
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 2–4 years in software / solution or infrastructure engineering
  • 2–4 years focused on building / maintaining large-scale data pipelines / storage & database solutions
  • Proficiency in Trino, Spark (Structured Streaming & batch) and solid working knowledge of Apache Kafka
  • Coding background in Python (must-have)
  • Deep understanding of data storage architectures including SQL, NoSQL, and HDFS
  • Solid grasp of DevOps practices, including containerization (Docker), orchestration (Kubernetes), and infrastructure provisioning (Terraform)
  • Experience with distributed systems, stream processing, and event-driven architecture
  • Hands-on familiarity with benchmarking and performance profiling for storage systems, databases, and analytics engines
  • Excellent communication skills
Job Responsibility
Job Responsibility
  • Build distributed data pipelines using technologies like Kafka, Spark (batch & streaming), Python, Trino, Airflow, and S3-compatible data lakes
  • Design, deploy, and troubleshoot hybrid cloud/on-prem environments using Terraform, Docker, Kubernetes, and CI/CD automation tools
  • Implement event-driven and serverless workflows
  • Create technical guides, architecture docs, and demo pipelines
  • Integrate data validation, observability tools, and governance directly into the pipeline lifecycle
  • Own end-to-end platform lifecycle: ingestion → transformation → storage (Parquet/ORC on S3) → compute layer (Trino/Spark)
  • Benchmark and tune storage backends (S3/NFS/SMB) and compute layers for throughput, latency, and scalability using production datasets
  • Work cross-functionally with R&D to push performance limits across interactive, streaming, and ML-ready analytics workloads
  • Operate and debug object store–backed data lake infrastructure
Read More
Arrow Right

Staff Data Engineer - Vehicle Telemetry and Data Infrastructure

We are looking for a Staff Data Engineer to own the telemetry data platform for ...
Location
Location
United States , Palo Alto
Salary
Salary:
230000.00 - 250000.00 USD / Year
ridealso.com Logo
ALSO
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience in data engineering and/or backend platform engineering operating production systems at scale
  • Deep hands-on experience with large-scale telemetry or IoT data, including high-throughput and low-latency ingestion
  • Strong expertise in AWS data and infrastructure services (S3, Kinesis/MSK, Glue, EMR, Lambda, Step Functions, EventBridge)
  • Proven experience owning end-to-end ETL/ELT infrastructure using Spark/PySpark (batch and streaming) on Databricks or EMR
  • Solid understanding of streaming architectures using Kafka or equivalent systems and time-series–optimized storage patterns
  • Strong backend engineering skills using Python and/or Java/Scala, including API design (REST/gRPC) and distributed systems fundamentals
  • Experience with data platform architectures such as data lakes and lakehouses, schema registries, and metadata systems
  • Hands-on experience with orchestration frameworks (Airflow, MWAA, Dagster) and production-grade observability (logging, metrics, tracing)
  • Infrastructure-as-code expertise using CloudFormation, Terraform, or CDK to manage scalable and reliable systems
  • A track record of building highly reliable, fault-tolerant systems with clear ownership, strong SLAs, and operational excellence
Job Responsibility
Job Responsibility
  • Design and own large-scale ingestion pipelines for vehicle telemetry data (events, metrics, time-series) with high throughput and low latency
  • Architect and operate end-to-end ETL/ELT systems from raw ingestion to warehouse/lake consumption
  • Define schema evolution, versioning, and backward-compatibility strategies for telemetry data at scale
  • Build safe and repeatable backfill, replay, and reprocessing mechanisms for historical and real-time data
  • Design data storage and lifecycle strategies across hot, warm, and cold paths to balance cost and performance
  • Develop fault-tolerant, observable, and debuggable pipelines with strong SLAs around freshness, completeness, and latency
  • Implement backend services and APIs for telemetry ingestion, configuration management, metadata, and orchestration
  • Apply strong software engineering practices including object-oriented design, automated testing, CI/CD, and code reviews
  • Establish automated data quality checks, anomaly detection, alerting, lineage, and auditability across the platform
  • Provide technical leadership by setting platform direction, reviewing designs, mentoring engineers, and influencing product and engineering roadmaps
What we offer
What we offer
  • Robust health coverage. Excellent health, dental and vision insurance covered up to 100% by ALSO with FSA & HSA options
  • One Medical membership and dedicated insurance advocates
  • Rich fertility and family building benefits with Progyny
  • Flexible time off
  • 401(k) match
  • Fulltime
Read More
Arrow Right

Senior AWS Data Engineer / Data Platform Engineer

We are seeking a highly experienced Senior AWS Data Engineer to design, build, a...
Location
Location
United Arab Emirates , Dubai
Salary
Salary:
Not provided
northbaysolutions.com Logo
NorthBay
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience in data engineering and data platform development
  • Strong hands-on experience with: AWS Glue
  • Amazon EMR (Spark)
  • AWS Lambda
  • Apache Airflow (MWAA)
  • Amazon EC2
  • Amazon CloudWatch
  • Amazon Redshift
  • Amazon DynamoDB
  • AWS DataZone
Job Responsibility
Job Responsibility
  • Design, develop, and optimize scalable data pipelines using AWS native services
  • Lead the implementation of batch and near-real-time data processing solutions
  • Architect and manage data ingestion, transformation, and storage layers
  • Build and maintain ETL/ELT workflows using AWS Glue and Apache Spark on EMR
  • Orchestrate complex data workflows using Apache Airflow (MWAA)
  • Develop and manage serverless data processing using AWS Lambda
  • Design and optimize data warehouses using Amazon Redshift
  • Implement and manage NoSQL data models using Amazon DynamoDB
  • Utilize AWS DataZone for data governance, cataloging, and access management
  • Monitor, log, and troubleshoot data pipelines using Amazon CloudWatch
  • Fulltime
Read More
Arrow Right