CrawlJobs Logo

AI Research Engineer, Data Infrastructure

1x.tech Logo

1X Technologies

Location Icon

Location:
United States, Palo Alto

Category Icon
Category:
IT - Software Development

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

180000.00 - 250000.00 USD / Year

Job Description:

As a Research Engineer in Infrastructure, you will design and implement a robust data engine to manage the data collected by our humanoid robot fleet. You’ll be responsible for making this data easily accessible for querying and training purposes. Your work will support high-quality data pipelines, enabling efficient model development, large-scale annotation, and integration across robotic, on-premise, and cloud systems.

Job Responsibility:

  • Optimize operational efficiency of data collection across the NEO robot fleet
  • Design intelligent triggers to determine when and what data should be uploaded from the robots
  • Automate ETL pipelines to make fleet-wide data easily queryable and training-ready
  • Collaborate with external dataset providers to prepare diverse multi-modal pre-training datasets
  • Build frontend tools for visualizing and automating the labeling of large datasets
  • Develop machine learning models for automatic dataset labeling and organization

Requirements:

  • Strong experience in building data pipelines and ETL systems
  • Ability to design and implement systems for data collection and management from robotic fleets
  • Familiarity with architectures that span on-robot components, on-premise clusters, and cloud infrastructure
  • Experience with data labeling tools or building dataset visualization and annotation tooling
  • Proficiency in creating or applying machine learning models for dataset organization and automated labeling
What we offer:
  • Equity
  • Health, dental, and vision insurance
  • 401(k) with company match
  • Paid time off and holidays

Additional Information:

Job Posted:
December 14, 2025

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for AI Research Engineer, Data Infrastructure

New

AI Research Engineer

We're seeking a Research Engineer to conduct innovative research in key AI areas...
Location
Location
United Kingdom
Salary
Salary:
Not provided
prolific.com Logo
Prolific
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of engineering experience with significant AI/ML focus
  • Demonstrated research experience through publications, open-source contributions, or impactful projects
  • Strong engineering fundamentals and experience implementing AI systems in production environments
  • Deep knowledge of LLM evaluation methodologies, alignment techniques, and model optimization approaches
  • Experience with model fine-tuning, adapters, quantization, and distillation frameworks
  • Self-motivation and ability to define and pursue research directions independently
  • Excellent understanding of current challenges in AI safety, reliability, and alignment
  • Strong communication skills and ability to explain complex research concepts clearly
  • Passion for staying current with the rapidly evolving AI research landscape
Job Responsibility
Job Responsibility
  • Lead independent research projects in AI evaluation methodologies, alignment techniques, and synthetic data generation
  • Design and implement novel evaluation frameworks for LLMs and agent systems that are grounded in human data
  • Contribute to the academic AI community through publications and open-source contributions
  • Stay at the forefront of AI research and pioneer innovative approaches to tackle pressing open challenges in the field
  • Design and conduct rigorous experiments to study AI models and systems with sound methodological approaches
  • Develop scalable frameworks for systematic evaluation of model behaviours and capabilities
  • Create tools and frameworks that transform research insights into practical applications
  • Build infrastructure to support large-scale research experiments when needed
  • Apply knowledge of model fine-tuning, optimization techniques, distillation, and other ML engineering practices to support research goals
  • Work closely with ML engineers, data scientists, and product teams to translate research insights into practical applications
What we offer
What we offer
  • competitive salary
  • benefits
  • remote working
  • impactful, mission-driven culture
Read More
Arrow Right
New

Software Engineer, AI Infrastructure

As a Software Engineer on our AI Infrastructure team, you will help design the c...
Location
Location
United States , New York, NY; San Mateo, CA
Salary
Salary:
Not provided
fireworks.ai Logo
Fireworks AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience)
  • 3 years of experience in software engineering, with a focus on infrastructure or machine learning systems
  • Strong programming skills in Python, Go, or a similar language
  • Proven experience in ML infrastructure and tooling (e.g., PyTorch, MLflow, Vertex AI, SageMaker, Kubernetes, etc.)
  • Basic understanding of LLM knowledge (e.g., context length, disaggregated prefill, KV cache memory estimation, etc)
Job Responsibility
Job Responsibility
  • Contribute to the design and development of scalable backend infrastructure that supports distributed training, inference, and data pipelines
  • Build and maintain core backend services such as LLM CI/CD pipeline, control plane, and model serving systems
  • Support performance optimization, cost efficiency, and reliability improvements across compute, storage, and networking layers
  • Building frameworks and safeguards to ensure Fireworks AI has the best model quality in the industry
  • Collaborate with performance, training, and product teams to translate research and product needs into infrastructure solutions
  • Participate in code reviews, technical discussions, and continuous integration and deployment processes
What we offer
What we offer
  • Solve Hard Problems: Tackle challenges at the forefront of AI infrastructure
  • Build What’s Next: Work with bleeding-edge technology that impacts how businesses and developers harness AI globally
  • Ownership & Impact: Join a fast-growing, passionate team where your work directly shapes the future of AI—no bureaucracy, just results
  • Learn from the Best: Collaborate with world-class engineers and AI researchers who thrive on curiosity and innovation
  • Fulltime
Read More
Arrow Right
New

Software Engineer, Data Infrastructure

The Data Infrastructure team at Figma builds and operates the foundational platf...
Location
Location
United States , San Francisco; New York
Salary
Salary:
149000.00 - 350000.00 USD / Year
figma.com Logo
Figma
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of Software Engineering experience, specifically in backend or infrastructure engineering
  • Experience designing and building distributed data infrastructure at scale
  • Strong expertise in batch and streaming data processing technologies such as Spark, Flink, Kafka, or Airflow/Dagster
  • A proven track record of impact-driven problem-solving in a fast-paced environment
  • A strong sense of engineering excellence, with a focus on high-quality, reliable, and performant systems
  • Excellent technical communication skills, with experience working across both technical and non-technical counterparts
  • Experience mentoring and supporting engineers, fostering a culture of learning and technical excellence
Job Responsibility
Job Responsibility
  • Design and build large-scale distributed data systems that power analytics, AI/ML, and business intelligence
  • Develop batch and streaming solutions to ensure data is reliable, efficient, and scalable across the company
  • Manage data ingestion, movement, and processing through core platforms like Snowflake, our ML Datalake, and real-time streaming systems
  • Improve data reliability, consistency, and performance, ensuring high-quality data for engineering, research, and business stakeholders
  • Collaborate with AI researchers, data scientists, product engineers, and business teams to understand data needs and build scalable solutions
  • Drive technical decisions and best practices for data ingestion, orchestration, processing, and storage
What we offer
What we offer
  • equity
  • health, dental & vision
  • retirement with company contribution
  • parental leave & reproductive or family planning support
  • mental health & wellness benefits
  • generous PTO
  • company recharge days
  • a learning & development stipend
  • a work from home stipend
  • cell phone reimbursement
  • Fulltime
Read More
Arrow Right

AI Research Infrastructure Engineer

Block is scaling Customer Insights into an AI-powered insights accelerator that ...
Location
Location
United States , Bay Area
Salary
Salary:
168300.00 - 297000.00 USD / Year
block.xyz Logo
Block
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in research, automation implementation, analytics, or related technical fields with hands-on workflow optimization experience
  • 3+ years implementing AI/ML solutions, with experience in automation, LLM integration, or applied AI/analytics workflows
  • Hands-on technical skills in programming languages (Python, R, SQL) for automation development, API/MCP integrations, cloud platforms, and research data pipeline creation
  • Experience with research and analytic platforms and tools (Qualtrics, Snowflake, etc) or transferable experience with analytics and automation platforms
  • Strong technical communication and translation skills with ability to make complex AI/ML concepts, data architecture decisions, and automation workflows accessible and actionable for researchers, product managers, and business stakeholders
  • Proven ability to build stakeholder confidence and alignment during technology transformation
  • Strong project management skills with ability to coordinate multiple complex automation initiatives, manage competing priorities, and deliver measurable operational efficiency gains (reduced cycle times, improved quality outcomes, increased research capacity)
Job Responsibility
Job Responsibility
  • Design, build, and deploy AI agents and agentic workflows that automate research operations from study design through insights delivery, using LLMs, prompt engineering, MCP (Model Context Protocol) integrations, and workflow orchestration integrated with existing research and analytics tech stack
  • Design, build, and maintain automated data pipelines that ingest, transform, and unify research data from diverse sources (surveys, transcripts, analytics, behavioral logs) into AI-ready repositories with RAG capabilities for instant insight access via tools like Goose
  • Architect ETL/ELT frameworks using Python, SQL or equivalent tools to ensure data consistency, traceability, and scalability
  • Develop data models and schemas for research metadata, participant data, and AI-generated insights to support efficient querying and analysis
  • Design and prototype research automation systems using AI/ML techniques, partnering with design & engineering teams to productionize solutions
  • Partner with engineering, design, and platform teams to integrate research automation systems with Block's tech stack (i.e. Goose, GitHub, etc.) and establish governance frameworks for quality, ethics, and compliance
  • Mentor team members on AI agent development, agentic system design, and research automation best practices to build organizational capabilities in intelligent automation
What we offer
What we offer
  • Remote work
  • medical insurance
  • flexible time off
  • retirement savings plans
  • modern family planning
  • Fulltime
Read More
Arrow Right

Research Engineer, Data Infrastructure

As a Research Engineer in Data Infrastructure, you will design and implement a “...
Location
Location
United States , Palo Alto
Salary
Salary:
180000.00 - 250000.00 USD / Year
1x.tech Logo
1X Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong experience in building data pipelines and ETL systems
  • Ability to design and implement systems that collect, upload, and manage data from robotic fleets
  • Familiarity with architectures combining on‑robot components, on‑premises clusters, and cloud systems
  • Experience with data labeling tools or building tooling for dataset visualization and annotation
  • Skills in creating or applying machine learning models for dataset organization / automated labeling
Job Responsibility
Job Responsibility
  • Optimize operational efficiency of data collection on the NEO fleet
  • Design triggers on the robot to determine if and when data should be uploaded
  • Automate ETL pipelines so fleet‑wide data is easily queryable and available for training
  • Work with external dataset providers to prepare diverse multi-modal pre-training datasets
  • Build frontend tools for visualizing and automating labeling of very large datasets
  • Develop machine learning models to automatically label and organize datasets
What we offer
What we offer
  • Health, dental, and vision insurance
  • 401(k) with company match
  • Paid time off and holidays
  • Fulltime
Read More
Arrow Right

AI Data Engineer

The AI Data Engineer role involves designing and implementing cloud platforms fo...
Location
Location
United States , San Juan
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, engineering, information systems, or closely related quantitative discipline
  • 4-7 years’ experience
  • strong programming skills in Python, Java, Golang, or JavaScript
  • good understanding of distributed systems, event-driven programming paradigms, and designing for scale and performance
  • experience with cloud-native applications, developer tools, managed services, and next-generation databases
  • knowledge of DevOps practices like CI/CD, infrastructure as code, containerization, and orchestration using Kubernetes
  • good written and verbal communication skills
  • comfortable with AWS services
  • familiarity with the landscape of big data exploration, visualization, and prototyping platforms
  • familiarity with statistical and machine learning techniques
Job Responsibility
Job Responsibility
  • Research, propose, design, implement, operate and maintain cloud platforms for big data exploration and visualization, in support of a team of data scientists
  • deploy data science solutions into cloud environments
  • work with data scientists to troubleshoot cloud workflows
  • closely collaborate with our datalake team on cloud technologies
  • identify and implement cost-saving strategies to reduce ongoing cloud expenses
  • build CI/CD pipelines
  • deploy and maintain orchestration and monitoring systems for big data processing
  • help build images and containerize applications
What we offer
What we offer
  • Comprehensive suite of benefits that supports physical, financial, and emotional wellbeing
  • specific programs catered to professional development
  • inclusive working environment
  • Fulltime
Read More
Arrow Right

Data Engineer

As a Data Engineer, you’ll build and refine the pipelines, data models, and serv...
Location
Location
United States , Redmond
Salary
Salary:
155000.00 - 175000.00 USD / Year
2a.consulting Logo
2A Consulting
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven ability to design and build end-to-end data systems, from ingestion through cleaning, structuring, storage, and serving
  • Experience building and shipping data products that deliver practical value
  • Demonstrated impact using AI models in data workflows (applied use, not ML research)
  • 5+ years of software or data engineering experience, including at least 2 years of hands-on work with data pipelines
  • Comfortable defining architecture and starting systems from scratch, working independently in a small cross-functional team
  • Proficiency in Python, SQL, or similar languages used in data engineering workflows
Job Responsibility
Job Responsibility
  • Build and maintain core data pipelines
  • Build and maintain end-to-end ingestion pipelines for documents, datasets, code repositories, videos, transcripts, and internal knowledge sources
  • Clean, normalize, structure, and store data in formats that support both web applications and AI-driven use cases
  • Use “out of the box” Microsoft tools—such as Fabric, Azure services, Cosmos DB, or Copilot Studio—to create reliable, maintainable systems
  • Enrich and model research data
  • Use AI models to transform unstructured content into structured metadata and durable knowledge assets
  • Design the architecture and foundational data systems, establishing the patterns and infrastructure for a new, scalable environment
  • Develop and refine embeddings, vector indexes, and retrieval components to support semantic search and grounding scenarios
  • Build backend and data services
  • Build data services, APIs, and backend components that power internal applications and agent-supported workflows
What we offer
What we offer
  • Flexible time-off plan
  • 100% employer-paid medical, dental, and vision insurance
  • Employer-paid life insurance for those enrolled in medical coverage
  • 401(k) plan with company match
  • Fertility, surrogacy, and adoption benefits
  • Fitness and caregiver benefits
  • Employee Assistance Program
  • 100% employer-paid short- and long-term disability coverage
  • Fulltime
Read More
Arrow Right

Director of AI Engineering

We are entering a hyper-growth phase of AI innovation and are hiring a Director ...
Location
Location
Canada; United States
Salary
Salary:
300000.00 - 450000.00 USD / Year
apollo.io Logo
Apollo.io
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10–15+ years in software engineering, with significant leadership experience owning AI/ML or applied LLM systems at scale
  • Proven history shipping LLM-powered features, agentic workflows, or AI assistants used by real customers in production
  • Deep understanding of LLM orchestration frameworks (LangChain, LlamaIndex), RAG pipelines, vector search, embeddings, and prompt engineering
  • Expert in backend & distributed systems (Python strongly preferred) and cloud infrastructure (AWS/GCP)
  • Strong experience with telemetry, observability, and cost-aware real-time inference optimizations
  • Demonstrated ability to lead senior engineers, define technical roadmaps, and deliver outcomes aligned to business metrics
  • Experience building or scaling teams working on experimentation, optimization, personalization, or ML-powered growth systems
  • Exceptional ability to simplify complex problems, set clear standards, and drive alignment across Product, Data, Design, and Engineering
  • Strong product sense, ability to weigh novelty vs. impact, focus on user value, and prioritize speed with guardrails
  • Fluent in integrating AI tools into engineering workflows for code generation, debugging, delivery velocity, and operational efficiency
Job Responsibility
Job Responsibility
  • Define the multi-year technical vision for Apollo’s AI stack, spanning agents, orchestration, inference, retrieval, and platformization
  • Prioritize high-impact AI investments by partnering with Product, Design, Research, and Data leaders to align engineering outcomes with business goals
  • Establish technical standards, evaluation criteria, and success metrics for every AI-powered feature shipped
  • Lead the architecture and deployment of long-horizon autonomous agents, multi-agent workflows, and API-driven orchestration frameworks
  • Build reusable, scalable agentic components that power GTM workflows like research, enrichment, sequencing, lead scoring, routing, and personalization
  • Own the evolution of Apollo’s internal LLM platform for high-scale, low-latency, cost-optimized inference
  • Oversee model-driven experiences for natural-language interfaces, RAG pipelines, semantic search, personalized recommendations, and email intelligence
  • Partner with Product & Design to build intuitive conversational UX that hides underlying complexity while elevating user productivity
  • Implement rigorous evaluation frameworks, including offline benchmarking, human-in-the-loop review, and online A/B experimentation
  • Ensure robust observability, monitoring, and safety guardrails for all AI systems in production
What we offer
What we offer
  • Equity
  • Company bonus or sales commissions/bonuses
  • 401(k) plan
  • At least 10 paid holidays per year
  • Flex PTO
  • Parental leave
  • Employee assistance program and wellbeing benefits
  • Global travel coverage
  • Life/AD&D/STD/LTD insurance
  • FSA/HSA
  • Fulltime
Read More
Arrow Right
Welcome to CrawlJobs.com
Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.