CrawlJobs Logo

Data Engineer + Scientist Hybrid

115000.00 - 150000.00 USD / Year · Job Posted January 20, 2026
Apply Position
Job Link Share

Job Description

Spoak is looking for a hybrid data engineer / data scientist to join us on our mission to build the world’s most loved interior design platform. As a company committed to using data to drive our business and roadmap, we are looking for a talented data engineer/data scientist hybrid who can help us to develop and maintain our data infrastructure and use advanced analytics techniques to uncover insights that will help us grow our business and achieve our mission.

Job Responsibility

  • Design, build and maintain our data infrastructure, including ETL pipelines and databases
  • Develop and implement advanced analytics models and algorithms to uncover insights that can be used to optimize our products and customer experience
  • Work closely with product managers, designers, and engineers to identify data needs and build out new data-driven features
  • Develop and maintain data documentation, ensuring that our data is accurate, consistent, and well-documented
  • Participate in cross-functional projects and collaborate with other teams to share insights and knowledge

Requirements

  • Bachelor's degree in computer science, statistics, mathematics or a related field
  • Strong knowledge of data engineering and data science concepts and techniques, including ETL, data warehousing, statistical modeling, machine learning, and data visualization
  • Proficiency in programming languages such as Python, R, or SQL
  • Experience with cloud platforms such as AWS or GCP
  • Ability to work collaboratively in a fast-paced, startup environment
  • Excellent communication skills and ability to explain technical concepts and insights to non-technical stakeholders

Nice to have

  • Experience with data visualization tools such as Tableau or Power BI
  • Experience with distributed computing systems such as Hadoop or Spark
  • Experience with big data technologies such as Apache Kafka, BigQuery or Cassandra
  • Experience with containerization technologies such as Docker or Kubernetes
  • Experience with machine learning platforms like TensorFlow or PyTorch

What we offer

  • To build an amazing company from scratch
  • To build tools that enable creativity
  • Remote-first team, EST hours
  • Medical, dental, and vision insurance
  • 401K
  • A four-day work week every other week
  • Flexible time-off
  • Monthly virtual team events
  • A close-knit team

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Data Engineer + Scientist Hybrid

8 matching positions

Data Scientist Senior - GenAI Solutions

The ideal candidate as Senior Data Scientist in NTT DATA will have a mix of skil...
Location
Location
Italy , Milano
Salary
Salary:
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience of at least 3-5 years as data scientist
  • Strong Python (mandatory) and SQL coding
  • Knowledge of Java/Scala or C++/R is a plus
  • Tensorflow and Pytorch knowledge with a focus over Computer Vision solutions
  • Knowledge of ML solutions architecture and Generative AI pipelines
  • Cloud and hybrid ML implementations
  • Incorporate new, cutting edge business intelligence, machine learning, and alternative data practices into wider organization to drive efficiencies and generate revenue
  • Own the full stack execution of data science projects from data wrangling, model development, presentation, measurement to deployment
  • Work with data engineers to produce fully productionized pipelines including data ingestion, cleaning, feature engineering and model training/prediction
  • Conduct research and analysis into reinsurance and advisory products to augment industry-standard models and develop new market offerings
Job Responsibility
Job Responsibility
  • Collaborate with data scientists, data engineers, ML Engineers, product managers, business analysts, and stakeholders from internal teams and external clients
  • Manage multiple projects simultaneously
  • Lead and coordinate the Data Scientists involved in the projects
  • Act as the main point of contact with client-side stakeholders
  • Ensure that project tasks are delivered on time and executed to a high standard
Read More
Arrow Right

Machine Learning Data Scientist, Forecasting

The Strategic Finance team at OpenAI plays a critical role in shaping the compan...
Location
Location
United States , San Francisco
Salary
Salary:
230000.00 - 385000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Advanced degree (MS or PhD) in a quantitative field (e.g., Statistics, Computer Science, Economics, Operations Research)
  • 7+ years of experience in applied data science, with deep hands-on exposure to forecasting, predictive modeling, or marketplace systems
  • Expertise in time-series forecasting techniques and practical understanding of model trade-offs across performance, explainability, and scalability
  • Proficiency in Python, SQL, and tools such as scikit-learn, PyTorch/TensorFlow, and forecasting libraries
  • Demonstrated experience with model monitoring, debugging, and long-term maintenance in production environments
  • Strong communication and storytelling skills - able to simplify complexity and influence executive stakeholders
  • Self-directed, intellectually curious, and comfortable leading ambiguous projects from 0→1
Job Responsibility
Job Responsibility
  • Build statistical and machine learning models to solve forecasting needs across product, finance, infrastructure, and GTM domains
  • Own the end-to-end modeling lifecycle, including scoping, feature engineering, model development and prototyping, experimentation, deployment, monitoring, and explainability
  • Develop and productionize scalable, interpretable forecasts for user growth, monetization, compute load, customer lifetime value, and profitability
  • Contribute to self-service forecasting tools and internal platforms, enabling teams across OpenAI to access and act on real-time predictions
  • Research and evaluate emerging tools and techniques in the forecasting space, such as TimeGPT, large language model extensions, causal forecasting, and hybrid approaches
  • Drive strategic insight generation by translating technical outputs into business-aligned recommendations and decision frameworks
  • Collaborate closely with cross-functional teams to ensure forecasts are well-integrated into planning processes, experimentation workflows, and executive decision-making
What we offer
What we offer
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Daily meals in our offices, and meal delivery credits as eligible
  • Fulltime
Read More
Arrow Right

Senior Data Scientist

At Valtech, you’ll find an environment designed for continuous learning, meaning...
Location
Location
North Macedonia
Salary
Salary:
Not provided
valtech.com Logo
Valtech
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Several years of experience as a Senior Data Scientist / ML Engineer / Software Engineer (or equivalent) with demonstrable productive ML systems
  • In-depth knowledge of ML (Supervised/Unsupervised, Sequences/Time Series, Anomaly Detection, Classification/Regression) and solid understanding of statistics/evaluation
  • First-principles thinking: You can not only "apply" models, but also derive them, question them, and combine them with domain knowledge (hybrid approaches)
  • Scientific curiosity paired with pragmatism: forming hypotheses, testing experimentally, delivering results
Job Responsibility
Job Responsibility
  • Development and validation of models/algorithms for use cases such as: Gait analysis and movement pattern recognition (gait pattern, stability, deviations, trend analyses)
  • New features such as delirium, complex risk indicators, clinical "events"
  • Experiment Design & Measurability: Definition of metrics, offline evaluation, golden sets, reproducibility, performance/robustness
  • Feature Engineering & Representation Learning: Derive meaningful representations from radar data (incl. domain understanding)
  • Evaluation of new approaches for radar-based patient monitoring (classic ML, deep learning, probabilistic models, first principles, and hybrid methods)
What we offer
What we offer
  • Private health insurance
  • Education program with training and certification
  • Wellbeing program
  • Free beverages
  • Events
  • Competitive salary and 24 days of vacation
  • Challenging projects
  • Cool colleagues
  • Honest feedback
Read More
Arrow Right

Research Scientist / Engineer — Video / Audio Generation

This is a rare and foundational opportunity to define the future of creative AI....
Location
Location
United States , Palo Alto
Salary
Salary:
250000.00 - 450000.00 USD / Year
lumalabs.ai Logo
Luma AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong foundation in machine learning and generative modeling, with experience in video, audio, or multimodal domains
  • Deep understanding of autoregressive, diffusion/flow-based, or hybrid generative models, and their tradeoffs for long-horizon generation
  • Hands-on experience with PyTorch and large-scale training (distributed, mixed precision, large datasets)
Job Responsibility
Job Responsibility
  • Architect large-scale video and audio generative models, focusing on strong temporal coherence and high perceptual quality
  • Design, implement, and run robust data pipelines for curating, filtering, and captioning massive video and audio datasets
  • Train large-scale video and audio generative models on massive datasets and GPU clusters
  • Define and build novel evaluation frameworks to measure realism, temporal consistency, controllability, and human-aligned creative quality
  • Fulltime
Read More
Arrow Right

Cloud Data Engineer

Join BIP – xTech, BIP's Centre of Excellence specializing in innovative consulti...
Location
Location
Albania , Tirana
Salary
Salary:
Not provided
businessintegrationpartners.com Logo
Business Integration Partners
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience in the implementation and management of cloud‑based data platforms
  • Knowledge of one or more cloud platforms (e.g., Google GCP, Amazon AWS, Microsoft Azure) and their services supporting data management, processing, and analysis
  • Knowledge of key concepts of modern data platforms (e.g., Big Data, Data Lake, Data Warehouse, Data Virtualization, Data Mesh, Data Governance, etc.)
  • Experience in designing data models and creating/maintaining data pipelines for data extraction, transformation, and loading
  • Knowledge of one or more programming and data query languages (e.g., SQL, Java, Ruby, Python, R…)
  • In‑depth knowledge of one or more technologies for managing structured and unstructured databases (e.g., BigQuery, MongoDB, Hadoop, Redis, etc.)
Job Responsibility
Job Responsibility
  • Working on Big Data or hybrid Data Platforms, on‑premises or cloud, implementing batch or real‑time processing pipelines, and performing transformation and handling of structured and unstructured data
  • Autonomous in the phases of data acquisition, historization, cleansing, anonymization/crypting/masking, null-value and outlier handling, aggregation, structuring of unstructured data, creation of quality KPIs, and preparation of data for different users
  • Receiving Machine Learning models from Data Scientists, optimizing them for scalable execution—for example by applying computational parallelism—and integrating/scheduling them for efficient automated execution
  • Working with various technologies, chosen based on the specific project and client, favoring cloud‑native and globally scalable architectures
  • Provide Advisory support to select Engineering technologies within emerging Big Data architectures or to improve existing ecosystems
  • Build relationships with clients, vendors, and partners
  • Actively contribute to the Community through research activities, scouting, new solution concepts, and business development
  • Fulltime
Read More
Arrow Right

Data Scientist AI

We are the central Advanced Analytics and Artificial Intelligence team in Airbus...
Location
Location
Portugal , Lisbon; Coimbra
Salary
Salary:
Not provided
airbus.com Logo
Airbus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years industry experience in data science, AI or a similar field with a track record of end-to-end model development and handling large data-sets
  • Strong, fundamental understanding of common ML algorithms
  • Bachelor’s, Master’s or PhD degree in Computer Science, Data Science, Statistics, Engineering, Mathematics or a related quantitative discipline
  • Deepened knowledge in at least one of the following fields: NLP, Computer Vision, Time Series Analysis
  • High Proficiency in Python including relevant open source libraries such as Pandas, Scikit-Learn, Tensorflow and PyTorch
  • Fluent in written and spoken English
  • Experience in customer facing roles focused on business impact and managing expectations
  • Experience in process analysis and process optimization including methodologies such as value stream mapping
  • Development and deployment of models in cloud environments like AWS, GCP or SAP
  • Agile and product-oriented development
Job Responsibility
Job Responsibility
  • Partner with business functions on missions to absorb domain expertise and priorities, build analytical insights and translate business challenges into AI related opportunities that you will tackle with our wider team
  • Carry out hands-on, end-to-end AI / ML model development from early ideation, exploration and feasibility studies on to design, development, deployment as well as monitoring of scaled AI services
  • Support our AI capability teams in defining and delivering on AI capability roadmaps (built around 5 technology pillars: computer vision, knowledge extraction, Time-series anomaly detection, decision making and hybrid modeling) that target reusable services and components which are consumed by a wide range of IT products
  • Representing the central AI organization, evangelize and establish best practices and guidance on delivering and running AI solutions in the rapidly growing Airbus Lisbon office
  • Engaging in the global AI and analytics community as well as growing the local community footprint, through holding project showcases, sharing expertise as well as best practices
What we offer
What we offer
  • Diverse career opportunities within Airbus European core countries or in other regions around the world
  • A hybrid working model, allowing you to combine onsite and offsite work
  • A modern office at Parque das Nacoes, well connected to public transportation
  • A motivated and fun crew to grow and build and shape the GBS together
  • An intense and exciting onboarding experience
  • Fulltime
Read More
Arrow Right

Data Engineer, Solutions Architecture

We are seeking a talented Data Engineer to design, build, and maintain our data ...
Location
Location
United States , Scottsdale
Salary
Salary:
90000.00 - 120000.00 USD / Year
clearwayenergy.com Logo
Clearway Energy
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 2-4 years of hands-on data engineering experience in production environments
  • Bachelor's degree in Computer Science, Engineering, or a related field
  • Proficiency in Dagster or Airflow for pipeline scheduling, dependency management, and workflow automation
  • Advanced-level Snowflake administration, including virtual warehouses, clustering, security, and cost optimization
  • Proficiency in dbt for data modeling, testing, documentation, and version control of analytical transformations
  • Strong Python and SQL skills for data processing and automation
  • 1-2+ years of experience with continuous integration and continuous deployment practices and tools (Git, GitHub Actions, GitLab CI, or similar)
  • Advanced SQL skills, database design principles, and experience with multiple database platforms
  • Proficiency in AWS/Azure/GCP data services, storage solutions (S3, Azure Blob, GCS), and infrastructure as code
  • Experience with APIs, streaming platforms (Kafka, Kinesis), and various data connectors and formats
Job Responsibility
Job Responsibility
  • Design, deploy, and maintain scalable data infrastructure to support enterprise analytics and reporting needs
  • Manage Snowflake instances, including performance tuning, security configuration, and capacity planning for growing data volumes
  • Optimize query performance and resource utilization to control costs and improve processing speed
  • Build and orchestrate complex ETL/ELT workflows using Dagster to ensure reliable, automated data processing for asset management and energy trading
  • Develop robust data pipelines that handle high-volume, time-sensitive energy market data and asset generation and performance metrics
  • Implement workflow automation and dependency management for critical business operations
  • Develop and maintain dbt models to transform raw data into business-ready analytical datasets and dimensional models
  • Create efficient SQL-based transformations for complex energy market calculations and asset performance metrics
  • Support advanced analytics initiatives through proper data preparation and feature engineering
  • Implement comprehensive data validation, testing, and monitoring frameworks to ensure accuracy and consistency across all energy and financial data assets
What we offer
What we offer
  • generous PTO
  • medical, dental & vision care
  • HSAs with company contributions
  • health FSAs
  • dependent daycare FSAs
  • commuter benefits
  • relocation
  • a 401(k) plan with employer match
  • a variety of life & accident insurances
  • fertility programs
  • Fulltime
Read More
Arrow Right

Junior Data Scientist

Aramark Sports + Entertainment is hiring a Junior Data Scientist - Oracle Park, ...
Location
Location
United States , San Francisco
Salary
Salary:
70000.00 - 95000.00 USD / Year
aramark.com Logo
Aramark
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Must be legally authorized to work in the United States without the need for current or future employment-based sponsorship from Aramark
  • Bachelor’s degree in Mathematics, Statistics, Computer Science, Data Science, or a related field
  • equivalent practical experience may be considered
  • 1–3 years of experience in an analytical or data science role
  • Proficiency in data manipulation and transformation using Python or R
  • Familiarity with SQL for data querying and analysis
  • Knowledge of data science workflows including data cleaning, feature engineering, and predictive modeling
  • Effective organizational and time management skills, with the ability to manage multiple projects simultaneously
  • Solid understanding of statistics, experimental design, and core data science concepts
  • Strong communication skills with the ability to present findings and recommendations to technical and non-technical audiences
Job Responsibility
Job Responsibility
  • Analyze consumer behavior at Oracle Park by leveraging purchasing and dining data to identify key customer segments. Present insights using statistical methods and engaging visualizations tailored to diverse stakeholder audiences
  • Evaluate operational performance by integrating data from labor tracking, Point of Sale (POS), and inventory systems. Identify inefficiencies and recommend actionable improvements to enhance venue operations
  • Conduct ad-hoc analyses to assess the effectiveness of short-term strategies. Collaborate with cross-functional teams to define success metrics and deliver timely, data-backed evaluations
  • Support the development of automated reporting workflows that deliver key performance metrics to stakeholders, including Oracle Park operations and the San Francisco Giants
  • Assist in building scalable data pipelines using Python, R, and SQL to streamline data access and support analytics and reporting initiatives
  • Perform machine learning experiments and model evaluation tasks under the guidance of the team’s Lead Data Scientist
What we offer
What we offer
  • medical, dental, vision, and work/life resources
  • retirement savings plans like 401(k)
  • paid days off such as parental leave and disability coverage
  • Fulltime
Read More
Arrow Right