CrawlJobs Logo

Data Engineer (Spark)

Poland, Warsaw B2B 15120.00 - 31920.00 PLN / Month · Job Posted January 10, 2026
Apply Position
Job Link Share

Job Description

Addepto is a leading AI consulting and data engineering company that builds scalable, ROI-focused AI solutions for some of the world’s largest enterprises and pioneering startups, including Rolls Royce, Continental, Porsche, ABB, and WGU. With our exclusive focus on Artificial Intelligence and Big Data, we help organizations unlock the full potential of their data through systems designed for measurable business impact and long-term growth. As a Data Engineer, you will have the exciting opportunity to work with a team of technology experts on challenging projects across various industries, leveraging cutting-edge technologies.

Job Responsibility

  • Develop and maintain a high-performance data processing platform for automotive data, ensuring scalability and reliability
  • Design and implement data pipelines that process large volumes of data in both streaming and batch modes
  • Optimize data workflows to ensure efficient data ingestion, processing, and storage using technologies such as Spark, Cloudera, and Airflow
  • Work with data lake technologies (e.g., Iceberg) to manage structured and unstructured data efficiently
  • Collaborate with cross-functional teams to understand data requirements and ensure seamless integration of data sources
  • Monitor and troubleshoot the platform, ensuring high availability, performance, and accuracy of data processing
  • Leverage cloud services (AWS) for infrastructure management and scaling of processing workloads
  • Write and maintain high-quality Python (or Java/Scala) code for data processing tasks and automation

Requirements

  • At least 4 years of commercial experience implementing, developing, or maintaining Big Data systems, data governance and data management processes
  • Strong programming skills in Python (or Java/Scala): writing a clean code, OOP design
  • Hands-on with Big Data technologies like Spark, Cloudera, Data Platform, Kafka, Airflow, NiFi, Docker, and Iceberg
  • Excellent understanding of dimensional data and data modeling techniques
  • Experience implementing and deploying solutions in cloud environments
  • Consulting experience with excellent communication and client management skills, including prior experience directly interacting with clients as a consultant
  • Ability to work independently and take ownership of project deliverables
  • Fluent English (at least C1 level)
  • Bachelor’s degree in technical or mathematical studies

Nice to have

  • Experience with an MLOps framework such as Kubeflow or MLFlow
  • Familiarity with Databricks and/or dbt

What we offer

  • Work in a supportive team of passionate enthusiasts of AI & Big Data
  • Engage with top-tier global enterprises and cutting-edge startups on international projects
  • Enjoy flexible work arrangements, allowing you to work remotely or from modern offices and coworking spaces
  • Accelerate your professional growth through career paths, knowledge-sharing initiatives, language classes, and sponsored training or conferences, including a partnership with Databricks, which offers industry-leading training materials and certifications
  • Choose your preferred form of cooperation: B2B or a contract of mandate, and enjoy 20 fully paid days off
  • Participate in team-building events and utilize the integration budget
  • Celebrate work anniversaries, birthdays, and milestones
  • Access medical and sports packages, eye care, and well-being support services, including psychotherapy and coaching
  • Get full work equipment for optimal productivity, including a laptop and other necessary devices
  • With our backing, you can boost your personal brand by speaking at conferences, writing for our blog, or participating in meetups
  • Experience a smooth onboarding with a dedicated buddy, and start your journey in our friendly, supportive, and autonomous culture

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Data Engineer (Spark)

8 matching positions

Senior AWS Data Engineer / Data Platform Engineer

We are seeking a highly experienced Senior AWS Data Engineer to design, build, a...
Location
Location
United Arab Emirates , Dubai
Salary
Salary:
Not provided
northbaysolutions.com Logo
NorthBay
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience in data engineering and data platform development
  • Strong hands-on experience with: AWS Glue
  • Amazon EMR (Spark)
  • AWS Lambda
  • Apache Airflow (MWAA)
  • Amazon EC2
  • Amazon CloudWatch
  • Amazon Redshift
  • Amazon DynamoDB
  • AWS DataZone
Job Responsibility
Job Responsibility
  • Design, develop, and optimize scalable data pipelines using AWS native services
  • Lead the implementation of batch and near-real-time data processing solutions
  • Architect and manage data ingestion, transformation, and storage layers
  • Build and maintain ETL/ELT workflows using AWS Glue and Apache Spark on EMR
  • Orchestrate complex data workflows using Apache Airflow (MWAA)
  • Develop and manage serverless data processing using AWS Lambda
  • Design and optimize data warehouses using Amazon Redshift
  • Implement and manage NoSQL data models using Amazon DynamoDB
  • Utilize AWS DataZone for data governance, cataloging, and access management
  • Monitor, log, and troubleshoot data pipelines using Amazon CloudWatch
  • Fulltime
Read More
Arrow Right

Lead Data Engineer Spark and SQL – Vice President

The Lead Data Engineer Spark and SQL – Vice President is responsible for establi...
Location
Location
Canada , Mississauga
Salary
Salary:
120800.00 - 170800.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6-10 years of relevant experience in Apps Development or systems analysis role (JAVA)
  • Experience with Spark and Scala
  • Experience with Ab Initio
  • Experience with ETL and SQL
  • Extensive experience system analysis and in programming of software applications
  • Experience in managing and implementing successful projects
  • Subject Matter Expert (SME) in at least one area of Applications Development
  • Ability to adjust priorities quickly as circumstances dictate
  • Demonstrated leadership and project management skills
  • Consistently demonstrates clear and concise written and verbal communication
Job Responsibility
Job Responsibility
  • Partner with multiple management teams to ensure appropriate integration of functions to meet goals as well as identify and define necessary system enhancements to deploy new products and process improvements
  • Resolve variety of high impact problems/projects through in-depth evaluation of complex business processes, system processes, and industry standards
  • Provide expertise in area and advanced knowledge of applications programming and ensure application design adheres to the overall architecture blueprint
  • Utilize advanced knowledge of system flow and develop standards for coding, testing, debugging, and implementation
  • Develop comprehensive knowledge of how areas of business, such as architecture and infrastructure, integrate to accomplish business goals
  • Provide in-depth analysis with interpretive thinking to define issues and develop innovative solutions
  • Serve as advisor or coach to mid-level developers and analysts, allocating work as necessary
  • Appropriately assess risk when business decisions are made, demonstrating particular consideration for the firm's reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behavior, conduct and business practices, and escalating, managing and reporting control issues with transparency
  • Fulltime
Read More
Arrow Right

Lead Big Data Spark Engineer

We are seeking an experienced and highly skilled Big Data Engineer to lead the d...
Location
Location
Canada , Mississauga
Salary
Salary:
120800.00 - 170800.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field
  • 8+ years of progressive experience in software development
  • 5+ years focusing on big data technologies
  • 3+ years of experience in a leadership or senior architectural role
  • Extensive hands-on experience with Scala for big data processing
  • Demonstrated expertise with Apache Spark (Spark Core, Spark SQL, Spark Streaming)
  • Strong experience with distributed systems and big data ecosystems (e.g., Hadoop, Kafka, Cassandra, HBase, Delta Lake, Snowflake, Databricks)
  • Proficiency with cloud platforms (AWS, Azure, GCP) and their big data services
  • Experience with containerization technologies (Docker, Kubernetes) and CI/CD pipelines
  • Solid understanding of data warehousing concepts, ETL/ELT processes, and data modeling
Job Responsibility
Job Responsibility
  • Lead the architecture, design, and development of high-performance, scalable, and reliable big data processing systems using Scala and Apache Spark
  • Drive technical vision and strategy for big data initiatives
  • Evaluate and recommend new technologies and tools
  • Design, develop, and optimize data pipelines for ingestion, transformation, and storage of massive datasets
  • Implement robust and efficient data processing jobs using Scala and Spark (batch and streaming)
  • Ensure data quality, integrity, and security
  • Promote and enforce best practices in coding, testing, and deployment
  • Mentor and guide a team of talented big data engineers
  • Conduct code reviews, provide constructive feedback
  • Participate in the recruitment and hiring
  • Fulltime
Read More
Arrow Right

Senior Data Engineer (ETL+Spark)

Our client is a leading US‑based insurance company serving more than 25 million ...
Location
Location
Argentina , Buenos Aires
Salary
Salary:
Not provided
eleks.com Logo
ELEKS
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience as a Data Engineer
  • Strong proficiency with Databricks (3+ years), Spark, and ETL pipelines
  • Experience with Delta Lake
  • Familiarity with the Hadoop ecosystem
  • Experience or understanding of on‑prem HDFS
  • Upper-Intermediate level of English
Job Responsibility
Job Responsibility
  • Design, develop, and maintain reliable software in line with technical requirements, focusing on performance and availability
  • Analyze requirements, review designs, and estimate user stories following project methodology (Agile, Waterfall, etc)
  • Proactively propose code refactoring and optimization improvements according to the best software development practices and coding standards
  • Help maintain and improve high-quality standards within the developer community by sharing knowledge, conducting tech talks, and participating in the internal promotion verification process
  • Stay up-to-date with modern technology and obtain professional certifications
  • Support less experienced developers by providing training, distributing, and monitoring tasks
What we offer
What we offer
  • Close cooperation with a customer
  • Challenging tasks
  • Competence development
  • Ability to influence project technologies
  • Team of professionals
  • Dynamic environment with low level of bureaucracy
Read More
Arrow Right

Data Engineer – Java & Spark

We are looking for a skilled Data Engineer with strong expertise in Java and Apa...
Location
Location
India , Bangalore South
Salary
Salary:
Not provided
votredircom.fr Logo
Wissen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4–7 years of strong hands-on experience in Data Engineering and Java development
  • Strong expertise in Apache Spark (Spark Core, Spark SQL, DataFrames, Structured Streaming)
  • Solid experience in data ingestion, ETL/ELT, and building data pipelines
  • Working knowledge on Java
  • Experience handling large-scale data processing and distributed systems
  • Familiarity with Maven/Gradle, Git, and CI/CD practices
  • Strong SQL skills and understanding of data modeling concepts
  • Excellent problem-solving and communication skills
  • Must be open to working from Bangalore location
Job Responsibility
Job Responsibility
  • Design, develop, and maintain scalable data ingestion pipelines using Java and Apache Spark
  • Build and optimize Spark jobs (Spark Core, Spark SQL, DataFrames, Streaming) for large-scale batch and real-time processing
  • Develop reusable ingestion frameworks for structured and semi-structured data from multiple sources (APIs, databases, files, streaming systems)
  • Implement high-performance ETL/ELT solutions with strong focus on data quality, reliability, and scalability
  • Collaborate with data architects, analysts, and cross-functional teams to design robust data workflows
  • Optimize Spark performance (partitioning, caching, tuning, memory management) for production environments
  • Contribute to CI/CD pipelines, code reviews, and best practices in data engineering
  • Troubleshoot data pipeline failures and implement monitoring and alerting mechanisms
  • Document technical designs and mentor junior engineers
  • Fulltime
Read More
Arrow Right

Real-Time Data Engineer (Flink / Spark Streaming)

Bright Vision Technologies is looking for a skilled Real-Time Data Engineer (Fli...
Location
Location
United States
Salary
Salary:
Not provided
bvteck.com Logo
Bright Vision Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Apache Flink
  • Spark Streaming
  • Apache Kafka
  • Event-Driven Architecture
  • Real-Time Data Pipelines
  • Java / Scala / Python
  • Windowing & State Management
  • Stream Processing Semantics
  • SQL
  • NoSQL Databases
What we offer
What we offer
  • H-1B sponsorship for 2026 quota
  • H-1B filing with level 4 prevailing wage
  • Fulltime
Read More
Arrow Right

Backend Data Engineer — Data Platform

We're looking for an engineer to support hands-on implementation and migration w...
Location
Location
Poland
Salary
Salary:
140.00 - 170.00 PLN / Hour
devire.pl Logo
Devire
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong Java development skills, with experience in data platform or data engineering contexts
  • Practical experience with at least one JVM-based data processing framework — Flink experience is a plus
  • Beam, Dataflow, or Spark also relevant
  • Comfortable with SQL and cloud data analytics platforms, particularly BigQuery
  • DevOps is part of your day-to-day: you work with cloud infrastructure, containerised applications, and are familiar with Kubernetes basics
  • Experience working with data engineering pipelines in Scala and/or Python
  • You write quality code and understand what it means to ship reliably in a production environment
  • You can work autonomously in an ambiguous environment and move quickly without waiting to be directed
Job Responsibility
Job Responsibility
  • Support hands-on implementation and migration work as we evolve our data processing stack
  • Contribute to company wide migration efforts and platform development
  • Embedded in a team in Data Infrastructure PA, contributing to hands-on engineering work including large-scale pipeline migrations — validating performance and cost outcomes and helping move workloads to our evolving stack
  • Contribute to platform development across our Flink platform, Lakehouse architecture and beyond
What we offer
What we offer
  • Access to benefits package at attractive prices (medical care, Multisport card, life insurance, cafeteria system)
  • Long-term cooperation
  • Fulltime
Read More
Arrow Right

Lead Data Engineer - Data Pipelines

Economic Intelligence is a Mastercard Services program within Economic & Busines...
Location
Location
Czechia , Prague
Salary
Salary:
Not provided
mastercard.com Logo
Mastercard
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong professional experience with Python building production-grade systems
  • Strong SQL skills, including writing readable and well-tuned queries for large-scale datasets
  • Experience leading design and delivery of data pipelines and/or data platform components, with attention to scalability, reliability, and maintainability
  • Familiarity with Databricks (or similar Spark-based data platforms)
  • Comfortable using LLM‑based coding tools, with the discipline to validate, test, and take full ownership of the generated code
  • Strong foundation in object-oriented software design and engineering best practices
  • Comfortable with Linux command line and production troubleshooting
  • Clear communicator with strong written and verbal English
  • Ownership mindset: motivation, creativity, self-direction, and ability to thrive in small, collaborative teams while influencing across boundaries
  • Passion for analytical/quantitative problem solving, quality, and continuous improvement of both platform and team processes
Job Responsibility
Job Responsibility
  • Lead technical execution to deliver end-to-end capabilities from data processing through publishing and customer consumption
  • Design, build, and evolve scalable, maintainable data pipelines that deliver insights from economic data
  • Drive performance, reliability, observability, and readability in the data platform codebase
  • Own and unify the data layer across multiple products, establishing shared patterns, reusable components, and clear interfaces
  • Translate business needs into durable technical designs
  • Tackle technical debt strategically
  • Set engineering direction and standards for pipeline development and cross-team integration
  • Write, test, and review code
  • Provide technical leadership through mentorship, rigorous code reviews, and support for other engineers’ growth and delivery
  • Continuously innovate, evaluating new approaches, tools, and technologies
  • Fulltime
Read More
Arrow Right