CrawlJobs Logo

Member of Technical Staff, Data Infrastructure

runwayml.com Logo

Runway

Location Icon

Location:

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

240000.00 - 290000.00 USD / Year

Job Description:

We're looking for a Data Engineer to build and scale the data infrastructure that powers Runway's AI research and business intelligence. You'll own critical data pipelines spanning production databases, analytics warehouses, and large-scale ML training datasets. This role sits at the intersection of data engineering, ML infrastructure, and analytics—you'll enable both world-class research and data-driven business decisions.

Job Responsibility:

  • Build and own pipelines for the creation, curation, and processing of large-scale multimodal datasets, including vector database (LanceDB) management and query optimization for ML metadata
  • Build and own ETL and CDC streams from Postgres and ClickHouse to analytics warehouses
  • Build standardized data transformation layers using dbt to replace ad-hoc SQL queries and create maintainable data models for business analytics
  • Manage production databases (Postgres, ClickHouse) and optimize for performance and reliability

Requirements:

  • 4+ years of industry experience in data engineering
  • Strong knowledge of Python
  • Experience with data quality, deduplication, and cleaning at scale
  • Comfortable working with cloud storage (S3) and managing large datasets
  • Experience building and maintaining ETL/CDC pipelines at scale
  • Strong SQL skills and experience with multiple database systems (Postgres, columnar databases like ClickHouse/Redshift)
  • Humility and open mindedness

Nice to have:

  • Experience with one or more frameworks for large-scale data processing (e.g. Spark, Ray, etc) and one or more ML frameworks (e.g. PyTorch, JAX)
  • Knowledge of cloud platforms (AWS, GCP, or Azure) and their data service offerings
  • Knowledge of data privacy and data security best practices
  • Experience with business intelligence and visualization tools (e.g., Looker, Tableau, PowerBI, Metabase, or similar)
  • Experience in a high-growth startup environment or similar fast-paced setting

Additional Information:

Job Posted:
January 20, 2026

Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Member of Technical Staff, Data Infrastructure

Member of Technical Staff, AI Training Infrastructure

As a Training Infrastructure Engineer, you'll design, build, and optimize the in...
Location
Location
United States , San Mateo
Salary
Salary:
175000.00 - 220000.00 USD / Year
fireworks.ai Logo
Fireworks AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, or related field, or equivalent practical experience
  • 3+ years of experience with distributed systems and ML infrastructure
  • Experience with PyTorch
  • Proficiency in cloud platforms (AWS, GCP, Azure)
  • Experience with containerization, orchestration (Kubernetes, Docker)
  • Knowledge of distributed training techniques (data parallelism, model parallelism, FSDP)
Job Responsibility
Job Responsibility
  • Design and implement scalable infrastructure for large-scale model training workloads
  • Develop and maintain distributed training pipelines for LLMs and multimodal models
  • Optimize training performance across multiple GPUs, nodes, and data centers
  • Implement monitoring, logging, and debugging tools for training operations
  • Architect and maintain data storage solutions for large-scale training datasets
  • Automate infrastructure provisioning, scaling, and orchestration for model training
  • Collaborate with researchers to implement and optimize training methodologies
  • Analyze and improve efficiency, scalability, and cost-effectiveness of training systems
  • Troubleshoot complex performance issues in distributed training environments
What we offer
What we offer
  • meaningful equity in a fast-growing startup
  • comprehensive benefits package
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Cloud Infrastructure

As a Software Engineer on our Cloud Infrastructure team, you'll be at the forefr...
Location
Location
United States , New York, NY; San Mateo, CA; Redwood City, CA
Salary
Salary:
175000.00 - 220000.00 USD / Year
fireworks.ai Logo
Fireworks AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience)
  • 5+ years of experience designing and building backend infrastructure in cloud environments (e.g., AWS, GCP, Azure)
  • Proven experience in ML infrastructure and tooling (e.g., PyTorch, TensorFlow, Vertex AI, SageMaker, Kubernetes, etc.)
  • Strong software development skills in languages like Python, or C++
  • Deep understanding of distributed systems fundamentals: scheduling, orchestration, storage, networking, and compute optimization
Job Responsibility
Job Responsibility
  • Architect and build scalable, resilient, and high-performance backend infrastructure to support distributed training, inference, and data processing pipelines
  • Lead technical design discussions, mentor other engineers, and establish best practices for building and operating large-scale ML infrastructure
  • Design and implement core backend services (e.g., job schedulers, resource managers, autoscalers, model serving layers) with a focus on efficiency and low latency
  • Drive infrastructure optimization initiatives, including compute cost reduction, storage lifecycle management, and network performance tuning
  • Collaborate cross-functionally with ML, DevOps, and product teams to translate research and product needs into robust infrastructure solutions
  • Continuously evaluate and integrate cloud-native and open-source technologies (e.g., Kubernetes, Ray, Kubeflow, MLFlow) to enhance our platform’s capabilities and reliability
  • Own end-to-end systems from design to deployment and observability, with a strong emphasis on reliability, fault tolerance, and operational excellence
What we offer
What we offer
  • Meaningful equity in a fast-growing startup
  • Competitive salary
  • Comprehensive benefits package
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - Platform Engineer

Platform Engineer to join our team building backend infrastructure for new ML-po...
Location
Location
United States , Palo Alto
Salary
Salary:
175000.00 - 350000.00 USD / Year
inflection.ai Logo
Inflection AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Backend engineering experience with Python, TypeScript, or Node.js
  • Hands-on experience working with production PyTorch models, model checkpoints, and inference logic
  • Strong knowledge of building APIs and services that are scalable, stable, and secure
  • Passion for bridging backend engineering and ML systems, especially at the infrastructure layer
  • Familiarity with tools such as FastAPI, Postgres, Redis, Kubernetes, and React
  • Desire to be hands-on and contribute to shaping the foundation of a new enterprise ML product
  • Have a bachelor’s degree or equivalent in a related field to the offered position requirements
Job Responsibility
Job Responsibility
  • Build and maintain backend services to support LLM integration, inference orchestration, and data flow
  • Write clean, reliable Python code for experimentation, model integration, and production systems
  • Collaborate closely with ML researchers to rapidly iterate on product ideas and deploy features
  • Design and implement infrastructure to handle scalable inference workloads and enterprise-level use cases
  • Own system components and ensure reliability, observability, and maintainability from day one
What we offer
What we offer
  • Diverse medical, dental and vision options
  • 401k matching program
  • Unlimited paid time off
  • Parental leave and flexibility for all parents and caregivers
  • Support of country-specific visa needs for international employees living in the Bay Area
  • Competitive stock options
Read More
Arrow Right

Member of Technical Staff, Infrastructure Data & Analytics

We are seeking experienced Infrastructure Data & Analytics Engineers to join our...
Location
Location
United States , Multiple Locations; Mountain View; San Francisco Bay area; New York City metropolitan area
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, or related technical field AND 8+ years technical engineering experience with data engineering, analytics, or data science, with increasing technical ownership in startup environment AND 6+ years experience with distributed data processing frameworks and large-scale data systems
  • OR equivalent experience
  • Master's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with technical engineering experience with data engineering, analytics, or data science, with increasing technical ownership in startup environment AND 10+ years experience with distributed data processing frameworks and large-scale data systems
  • OR equivalent experience
  • Proven technical leadership in data engineering, analytics platforms, or large-scale telemetry systems
  • Hands-on experience with ETL orchestration frameworks such as Airflow, Dagster, or similar
  • Strong communication skills
  • can explain complex systems clearly to senior leader
Job Responsibility
Job Responsibility
  • Act as the technical lead and owner for infrastructure analytics across compute, storage, and networking
  • Design and build durable, scalable data pipelines that ingest telemetry from clusters, schedulers, health systems, and capacity trackers into Data Warehouse
  • Define and standardize core metrics and semantics (e.g., utilization, occupancy, MFU, goodput, capacity readiness, delivery-to-production)
  • Architect and maintain self-service dashboards and APIs for fleet, cluster, and squad-level visibility
  • Partner closely with stakeholders across Supercomputing Infra, Researchers, Strategy and Executives to ensure metrics reflect operational and business reality
  • Implement robust and fault-tolerant systems for data ingestion and processing
  • Lead data architecture and engineering decisions, applying strong technical judgment to proactively shape executive-level discussions and decisions
  • Identify data gaps and instrumentation issues
  • drive fixes by influencing upstream engineering teams
  • Establish data quality, validation, documentation, and governance so metrics are trusted and repeatable
  • Fulltime
Read More
Arrow Right

Data Architect

The Data Architect will be responsible for designing, developing, and implementi...
Location
Location
United States , Andover, Massachusetts
Salary
Salary:
155500.00 - 376000.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years of relevant experience in the industry delivering technical and business strategy at an advanced/strategist level
  • Bachelor's, Master's, or PhD degree in Computer Science, Information Systems, Engineering, or equivalent
  • Strong understanding of data architecture, cloud infrastructure, and data management technologies
  • Proven experience driving innovations in data solutions and the productization of advanced development activities
  • Must have a track record of architecting, building, and deploying mission-critical, highly distributed, data-centric applications and solutions
  • Experience with at least one major IaaS and/or PaaS technology (OpenStack, AWS, Azure, VMware, etc.), including defining and scripting full topologies
  • Must be able to work in a global, complex, and diverse environment
Job Responsibility
Job Responsibility
  • Designing, developing, and implementing robust data solutions to support business objectives
  • Collaborating with cloud and data architects to design and set standards for the HPE GreenLake Hybrid Cloud platform and data solution portfolio
  • Driving evaluation of data storage and integration solutions and conducting research on platform behavior under different workloads
  • Designing and implementing data pipelines
  • Optimizing data storage solutions
  • Establishing best practices for data integration and analysis
  • Leading advanced development teams building proof-of-concept implementations for data platforms and solutions
  • Acting as a cross-functional product and technical expert for hybrid cloud and data technologies
  • Providing consultation, design input, and feedback for product development and design reviews across multiple organizations
  • Guiding and mentoring less-experienced staff members
What we offer
What we offer
  • Comprehensive suite of benefits that supports physical, financial and emotional wellbeing
  • Specific programs catered to career goals
  • Unconditionally inclusive work culture that celebrates individual uniqueness
  • Flexibility to manage work and personal needs
  • Opportunities for professional growth and development
  • Fulltime
Read More
Arrow Right

Staff Machine Learning Engineer

Join PagerDuty as a Staff Machine Learning Engineer to tackle complex problems, ...
Location
Location
Canada , Toronto
Salary
Salary:
156000.00 - 232000.00 CAD / Year
https://www.pagerduty.com Logo
PagerDuty
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience building, designing, and evolving data architecture for large-scale systems
  • Excellent communication skills
  • Experience working with Product teams, ensuring and driving a timely delivery
  • Have a deep understanding of the trade-offs to be considered when designing and delivering machine learning solutions to production
  • Experience leading cross-team architecture discussions, building technical prototypes, and driving the adoption of best practices across diverse teams
  • Demonstrated experience with data engineering processes, working with unstructured data and cloud-based data infrastructures
  • Passionate about ML engineering and interested in driving discussions with stakeholders and executives
Job Responsibility
Job Responsibility
  • Build and improve the capabilities of the data platform that enable and accelerate the production of ML/AI-based solutions
  • Drive and define standards for AI/ML across the organization
  • Provide guidance, technical leadership, and mentoring to other members of the team
  • Mentor junior members and participate in scaling up the existing team
  • Proactively recommend improvements and new approaches addressing potential systemic pain points and technical debt
  • Anticipate technical demands on the data platform based on the organization’s roadmap and systematically drive the evolution of the architecture toward those ends
  • Develop a long-term plan for ML/AI investments
What we offer
What we offer
  • Competitive salary
  • Comprehensive benefits package from day one
  • Flexible work arrangements
  • Company equity
  • ESPP (Employee Stock Purchase Program)
  • Retirement or pension plan
  • Generous paid vacation time
  • Paid holidays and sick leave
  • Dutonian Wellness Days & HibernationDuty - companywide paid days off in addition to PTO
  • Paid parental leave: 22 weeks for pregnant parent, 12 weeks for non-pregnant parent
  • Fulltime
Read More
Arrow Right

Senior Member of technical staff (Infrastructure)

About the Team: The Infrastructure team aims to make it seamless for our researc...
Location
Location
United Kingdom; France , London; Paris
Salary
Salary:
Not provided
hcompany.ai Logo
H Company
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Infrastructure as code (CDK, Terraform, ...)
  • Experience architecting and deploying distributed systems on public cloud (AWS, Azure, GCP)
  • Observability and monitoring (Datadog, Prometheus, Grafana, …)
  • Good knowledge of a modern programming language (ideally Python or JS/Typescript)
Job Responsibility
Job Responsibility
  • Designing and managing the infrastructure to support Research efforts in Model and Agent development incl. training infrastructure, data pipelines and inference
  • Designing and managing the infrastructure to support Product Engineering efforts on H Company’s agent platform including client-facing APIs and agent runtimes within various deployment scenarios (multi-tenant and on-prem)
  • Setup and maintain observability and monitoring strategies
  • Mentor and grow other engineers in infrastructure-related topics as well as general engineering practices
What we offer
What we offer
  • Join the exciting journey of shaping the future of AI, and be part of the early days of one of the hottest AI startups
  • Collaborate with a fun, dynamic and multicultural team, working alongside world-class AI talent in a highly collaborative environment
  • Enjoy a competitive salary
  • Unlock opportunities for professional growth, continuous learning, and career development
  • Fulltime
Read More
Arrow Right

Staff Data Scientist

Neo Financial seeks an experienced and strategic Staff Data Scientist to provide...
Location
Location
Canada , Calgary
Salary
Salary:
Not provided
neofinancial.com Logo
Neo Financial
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience deploying impactful ML models in production, driving commercial outcomes, and leading technical teams
  • Profound technical expertise in Python (pandas, scikit-learn), XGBoost (tuning, custom objectives, SHAP), and AWS (SageMaker, ECS)
  • Exceptional ML validation skills, including mitigating data leakage, rigorous OOT testing, and robust backtesting framework design
  • Extensive hands-on experience with Snowflake / Databricks
  • strong familiarity with MLflow & dbt, with proven ability to architect and optimize data workflows
  • A strategic, business-centric mindset
  • adept at navigating ambiguity, prioritizing in a fast-paced environment, and delivering high-value solutions on tight timelines
  • Demonstrated experience managing the technical development of data science teams of 5+ members, including mentoring staff and fostering a culture of technical excellence
  • Excellent communication and stakeholder management skills, able to articulate complex technical concepts to diverse audiences, including executive leadership
Job Responsibility
Job Responsibility
  • Spearheading technical strategy and end-to-end delivery of sophisticated ML models across marketing and loyalty
  • Managing the technical development of data scientists, guiding complex projects, fostering skill growth, and ensuring high-quality model implementation
  • Championing and evolving model explainability and business trust via advanced SHAP insights, validation reports, and clear cross-functional communication with senior stakeholders
  • Architecting and enhancing MLOps infrastructure, including automating model pipelines, implementing advanced versioning/drift detection, and streamlining auto-retraining
  • Establishing and enforcing rigorous model validation frameworks (e.g., advanced OOT validation, sophisticated temporal splits, comprehensive cross-validation) for exceptional model quality, generalization, and compliance
  • Mentoring and developing data scientists at all levels, leading technical design, reviewing code/model logic, and spearheading knowledge-sharing
  • Collaborating with executive and business leaders to identify and prioritize high-value data science initiatives, ensuring models address strategic problems and deliver commercial impact
  • Staying current with data science, ML, and MLOps advancements, and driving adoption of innovative technologies within the team
What we offer
What we offer
  • All team members have a stake in Neo’s success and earn meaningful equity through stock options
  • Fulltime
Read More
Arrow Right