CrawlJobs Logo

Staff Software Engineer, Spark (Java)

Hungary, Budapest · Job Posted January 26, 2026
Apply Position
Job Link Share

Job Description

At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises. The Data Platform Pillar is the bedrock of Cloudera’s technology, where we design and build the core components that let our customers store, manage, and process data with unmatched scalability, security, and performance. Cloudera is seeking a Staff Software Engineer, Spark (Java) with strong distributed systems expertise to work on the Cloudera distribution of Apache Spark and Livy. The role involves building enterprise-grade systems for customers running Spark on thousands of nodes and processing petabytes of data. We are looking for a passionate engineer eager to enhance a product already supporting major production systems and to drive the next-generation Data Engineering experience. You will collaborate with a distributed team across the United States and Hungary, including multiple Apache Spark committers.

Job Responsibility

  • Design new features for Cloudera’s data engineering experience, and take them from prototypes to leading a team to deliver the feature in production at scale
  • Contribute to Apache Spark, Livy
  • Develop new features in Scala/Java/Python on a modern platforms
  • Gain expertise in distributed data processing, from SQL planners and optimizers, to data layout and table formats like Apache Parquet and Iceberg, to fault tolerance in distributed systems
  • Gain a solid understanding and deep technical knowledge of components across the Cloudera Data Engineering Experience stack, but focusing on Iceberg and Spark
  • Get to work on large scale distributed systems, from 100s to 1000s of nodes, in production clusters
  • Debug system level deployment issues, root cause analysis, perform system test analysis and resolve failures
  • Work on improving internal infrastructure
  • Collaborate with other team members and stakeholders

Requirements

  • 6+ years professional software development
  • Experience leading and delivering complex product enhancements
  • Strong understanding of at least one of the following languages: Java, Scala, Python
  • Experience with systems design, development
  • Passionate about programming, clean coding habits, attention to detail, and focus on quality
  • Strong oral and written communication skills
  • Strong ability to research and solve problems independently without constant supervision
  • Open-minded, desire to learn new things and build great products
  • Experience with distributed systems

Nice to have

  • Experience with SQL planners
  • Experience with using/developing Apache Spark, Livy or other related technologies
  • Experience with large-scale, distributed systems design and development with an understanding of scaling, performance, and scheduling
  • Solid experience with at least one cloud

What we offer

  • Generous PTO Policy
  • Support work life balance with Unplugged Days
  • Flexible WFH Policy
  • Mental & Physical Wellness programs
  • Phone and Internet Reimbursement program
  • Access to Continued Career Development
  • Comprehensive Benefits and Competitive Packages
  • Paid Volunteer Time
  • Employee Resource Groups

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Staff Software Engineer, Spark (Java)

8 matching positions

Staff Software Engineer, Spark (Java)

At Cloudera, we empower people to transform complex data into clear and actionab...
Location
Location
Hungary , Budapest; Szeged
Salary
Salary:
Not provided
cloudera.com Logo
Cloudera
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years professional software development
  • Experience leading and delivering complex product enhancements
  • Strong understanding of at least one of the following languages: Java, Scala, Python
  • Experience with systems design, development
  • Passionate about programming, clean coding habits, attention to detail, and focus on quality
  • Strong oral and written communication skills
  • Strong ability to research and solve problems independently without constant supervision
  • Open-minded, desire to learn new things and build great products
  • Experience with distributed systems
Job Responsibility
Job Responsibility
  • Design new features for Cloudera’s data engineering experience, and take them from prototypes to leading a team to deliver the feature in production at scale
  • Contribute to Apache Spark, Livy
  • Develop new features in Scala/Java/Python on a modern platforms
  • Gain expertise in distributed data processing, from SQL planners and optimizers, to data layout and table formats like Apache Parquet and Iceberg, to fault tolerance in distributed systems
  • Gain a solid understanding and deep technical knowledge of components across the Cloudera Data Engineering Experience stack, but focusing on Iceberg and Spark
  • Get to work on large scale distributed systems, from 100s to 1000s of nodes, in production clusters
  • Debug system level deployment issues, root cause analysis, perform system test analysis and resolve failures
  • Work on improving internal infrastructure
  • Collaborate with other team members and stakeholders
What we offer
What we offer
  • Generous PTO Policy
  • Support work life balance with Unplugged Days
  • Flexible WFH Policy
  • Mental & Physical Wellness programs
  • Phone and Internet Reimbursement program
  • Access to Continued Career Development
  • Comprehensive Benefits and Competitive Packages
  • Paid Volunteer Time
  • Employee Resource Groups
  • Fulltime
Read More
Arrow Right
New

Staff Software Engineer (L4)

As a Staff Engineer on the Twilio Segment Data platform/ pipelines team, you’ll ...
Location
Location
India
Salary
Salary:
Not provided
stytch.com Logo
Stytch
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related field.
  • Hands-on experience with high-scale messaging/streaming systems (several thousand events/sec) and processing engines ( 1M+ events/sec).
  • 8+ years of experience writing production-grade code in a modern programming language
  • Strong theoretical fundamentals and hands-on experience designing and implementing highly available and performant fault-tolerant distributed systems.
  • Experience programming in one or more of the following: Go, Java, Scala, or similar languages
  • Well-versed in concurrent programming, along with a solid grasp of Linux systems and networking concepts.
  • Experience operating large-scale, distributed systems on top of cloud infrastructure such as Amazon Web Services (AWS) or Google Cloud Platform (GCP)
  • Experience in message passing systems (e.g., Kafka, AWS Kinesis) and/or modern stream processing systems (e.g., Spark, Flink).
  • Have hands-on experience with container orchestration frameworks (e.g. Kubernetes, EKS, ECS)
  • Leverage best-in-class development productivity practices including AI tooling.
Job Responsibility
Job Responsibility
  • Design and deliver robust, high-scale routing experiences for the Data platform/ pipelines team for Twilio Segment.
  • Ship features that opt for high availability and throughput with eventual consistency
  • Collaborate with engineering and product leads, as well as teams across Twilio Segment
  • Support the reliability and security of the platform
  • Build and optimize globally available and highly scalable distributed systems
  • Be able to act as a team Tech Lead as needed
  • Mentor other engineers on the team in technical architecture and design
  • Partner with application teams to deliver end to end customer success.
What we offer
What we offer
  • Competitive pay
  • generous time off
  • ample parental and wellness leave
  • healthcare
  • retirement savings program
  • and much more.
Read More
Arrow Right

Staff Software Engineer - Data

Uber's mission is to reimagine the way the world moves for the better. Here, bol...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
uber.com Logo
Uber
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Education: Bachelor's or Master's degree in Computer Science, Engineering, or a related technical field
  • Experience Level: 10+ years of hands-on experience in Data Engineering, with a proven track record of delivering results at a Staff Engineer level (or equivalent scope) at a premier technology company
  • Expert SQL Competency: 10+ years of hands-on, expert-level SQL experience
  • Data Modeling & Warehousing: Extensive experience designing dimensional data models (Star/Snowflake schemas) and data warehouses
  • Software Engineering Fundamentals: Proficiency in at least one high-level programming language (Java, Scala, Python, or Go)
  • Big Data Ecosystem: 10+ years of experience working with distributed data systems (Hadoop, Hive, Spark) and MPP databases (Vertica, Redshift, etc.)
  • End-to-End Architecture: Experience designing full-lifecycle data systems, including logging, ingestion (Batch/Stream), quality frameworks, and monitoring
  • Technical Leadership: Excellent written and verbal communication skills
  • Mentorship & Growth: A strong passion for driving engineering excellence and mentoring engineers
Job Responsibility
Job Responsibility
  • Own the Technical Vision: You will own and drive the technical roadmap for the Payments data ecosystem, balancing long-term architectural scalability with short-term business critical deliveries
  • Navigate Ambiguity: Actively identify strategically important problems and inefficiencies without waiting for instruction
  • Drive Alignment: See the big picture and drive consensus on complex technical decisions across the organization
  • Architect at Scale: Design and implement resilient, cost-effective, and high-scale batch and streaming pipelines that power critical support operations and financial analytics
  • Elevate Data Standards: Define and enforce robust data modeling standards, data contracts, and governance frameworks
  • Optimize & Automate: Identify opportunities to automate manual workflows (like SLA tracking and issue detection) and optimize infrastructure efficiency to lower TCO
  • Raise the Bar: Champion sustainable engineering practices
  • Be a Trusted Mentor: Serve as a humble mentor and technical advisor to both junior engineers and peer leaders
  • Force Multiplier: Act as a role model for judgment and responsibility
  • Fulltime
Read More
Arrow Right

Staff Software Engineer (L4) Data Platform

We are seeking an experienced Staff Engineer to join our Data Substrate team. In...
Location
Location
United States
Salary
Salary:
171120.00 - 213900.00 USD / Year
stytch.com Logo
Stytch
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field
  • 8+ years of experience in software development, or a related field
  • Proven track record of architecting and delivering complex data projects at scale, with a deep understanding of data infrastructure and distributed systems
  • Expertise in big data technologies such as Hadoop, Spark, Kafka, and other distributed computing systems
  • Experience designing, building, and operating large-scale systems using AWS technologies
  • Proficiency in programming languages such as Python, Java, or Scala, with strong problem-solving skills and attention to detail
  • Experience designing or working with Data Lakehouse architectures, including hands-on experience with Hudi, Iceberg, or Delta data formats
  • Excellent communication and collaboration skills, with the ability to influence technical decisions and drive alignment across teams
  • Strong leadership skills, with a track record of mentoring and developing junior engineers
  • Demonstrated ability to thrive in a fast-paced, dynamic environment and deliver results under tight timelines
Job Responsibility
Job Responsibility
  • Serve as a subject matter expert in distributed systems, data technologies, with strong software engineering skills
  • Architect and implement scalable and efficient data systems, storage solutions, and processing frameworks using state-of-the-art technologies
  • Drive technical innovation and research to stay at the forefront of emerging data technologies and best practices
  • Mentor and coach a team of talented engineers, fostering a culture of technical excellence, collaboration, and continuous learning
  • Collaborate closely with cross-functional teams to understand business requirements and translate them into scalable and efficient technical solutions
  • Ensure data quality, integrity, and security throughout the data lifecycle, adhering to industry best practices and compliance standards
  • Fulltime
Read More
Arrow Right

Staff Software Engineer - ETL Platform

Fivetran is building data pipelines to power the modern data stack for thousands...
Location
Location
Canada , Toronto
Salary
Salary:
167548.00 - 209435.00 CAD / Year
fivetran.com Logo
Fivetran
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Expertise in at least one interpreted language such as Python, Ruby on Rails, etc.
  • Expertise in at least one compiled language such as C++, Kotlin, Scala, Go, Java, etc.
  • Solid understanding of relational databases (experience with data warehousing solutions (e.g., Snowflake) is a bonus)
  • Proven experience in system integration, including API consumption and development (experience with infrastructure management is a bonus)
  • Familiarity with leading cloud providers, such as GCP, AWS, or Azure
  • Experience being on-call
  • Ownership - Experience being on-call, driving system quality higher, managing costs and constantly recognizing areas to simplify while always improving composability
  • Autonomous and Self-Motivated - thrives in a high-trust environment, driving projects forward and making independent decisions with confidence
  • Adaptable and Resilient - comfortably navigates ambiguity and views challenges as opportunities for growth
  • Driven and Persistent - maintains a strong focus on goals, confidently overcoming obstacles and eagerly seeking innovative solutions to exceed expectations
Job Responsibility
Job Responsibility
  • Make valuable contributions to the deepest recesses of our tech stack – including writing typesafe Ruby and Rails with Sorbet, building performant Kotlin microservices, managing highly available Spark clusters, working on deterministic Temporal workflows, and Terraforming all the things in AWS
  • Operate in hostile territory on all sides–building and managing a data diffing ETL engine that operates across every type of data warehouse and SQL dialect imaginable on data you don’t own or control, and send it to countless permutations of destinations that could respond any number of unpredictable ways
  • Be a reliable and steadfast operator, participating in the team on-call rotation to help diagnose, triage and fix performance bottlenecks and gnarly edge-cases our customers discover
  • Work closely with our product team to help design and develop new features to improve data pipeline observability and debuggability
What we offer
What we offer
  • 100% employer-paid medical insurance*
  • Generous paid time-off policy (PTO), plus paid sick time, inclusive parental leave policy, holidays, and volunteer days off
  • RSU stock grants*
  • Professional development and training opportunities
  • Company virtual happy hours, free food, and fun team-building activities
  • Monthly cell phone stipend
  • Access to an innovative mental health support platform that offers personalized care and resources in areas such as: therapy, coaching, and self-guided mindfulness exercises for all covered employees and their covered dependents
  • Fulltime
Read More
Arrow Right

Senior Staff Software Engineer: Data & Storage Platform

Uber’s Data Platform is the heart of the company’s critical decision-making and ...
Location
Location
United States , Seattle; San Francisco; Sunnyvale
Salary
Salary:
267000.00 - 297000.00 USD / Year
uber.com Logo
Uber
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 14+ Years of Engineering Excellence: Proven experience designing and operating world-class distributed data and storage systems
  • Mastery of Storage Internals: Extensive storage experience is a must
  • Deep expertise in: Batch & Object Storage: HDFS, Cloud Object Storage (S3/GCS/OCI), and Blobstore metadata management
  • Storage Optimization: Practical experience with Apache Hudi or Apache Iceberg for lakehouse architectures
  • Transactional Systems: Experience with distributed transactional storage (e.g., Docstore, Google Spanner, TiDB)
  • NoSQL & Cache: Cassandra, Redis, and high-throughput Key-Value stores
  • Data + AI Convergence: Deep understanding of how compute fabrics (Spark, Flink, Ray) integrate with vector databases and model-serving platforms
  • Query Engine Proficiency: Architect-level knowledge of Presto, Trino, or Hive for large-scale analytical processing
  • Systems Programming: Expert-level command of Java, Go, Scala, or C++ with a focus on performance tuning and distributed consensus
Job Responsibility
Job Responsibility
  • Architect the Multi-Modal Fabric: Unify batch, streaming, and AI compute into one intelligent fabric, enabling real-time insights and trustworthy AI agents at a global scale
  • Revolutionize Storage & Catalog: Drive the architecture for a unified catalog and metadata management service for unstructured data, leveraging native cloud object store capabilities
  • Operationalize AI Intelligence: Partner with teams like QueryCopilot and DataIQ to bridge human validation with autonomous reasoning through agentic workflows
  • Lead Storage Modernization: Evolve our massive-scale persistence layers—including Docstore (Transactional Distributed Storage) and Distributed MySQL—to increase resiliency and reduce operational overhead
  • Open Source & Act as a force multiplier by contributing to the community (Hudi, Iceberg, Presto)
What we offer
What we offer
  • Eligible to participate in Uber's bonus program
  • May be offered an equity award & other types of comp
  • All full-time employees are eligible to participate in a 401(k) plan
  • Eligible for various benefits
  • Fulltime
Read More
Arrow Right

Staff Software Engineer

The Staff Software Engineer on the Engineering team is responsible for the imple...
Location
Location
India , Pune
Salary
Salary:
Not provided
logicmonitor.com Logo
LogicMonitor
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of software development experience in commercial or enterprise applications
  • 6+ years of full-time experience as a Java developer on Linux platform
  • BS or above degree in computer science or related field
  • Expertise with latest Java development framework and open-source tools
  • Extensive experience and knowledge with inner workings of JVM
  • Strong understanding of web application architectures, specifically Apache Tomcat
  • Experience in SaaS Product Development dealing with large volumes of data
  • Deep SQL / NoSQL database knowledge, including following databases: MySQL, Cassandra, and ElasticSearch
  • Extensive experience with one of the following Big Data technologies: Apache Spark, Kafka Streams, AWS Kinesis/Firehose
  • Experience designing large, complex distributed systems
Job Responsibility
Job Responsibility
  • Prioritize and plan for deliverables in an iterative development strategy, according to our 2 week scrum schedule & 1 week regression testing
  • Design, document, code, and test technical solution for new systems or enhancements to existing systems
  • Follow agile software development methodologies for implementation
  • Working with various teams in LogicMonitor to deliver software products that support LogicMonitor's business growth
  • Provides technical leadership mentoring and guidance at senior engineering levels and below
  • Trusted to represent the team to other functional teams
  • Coordination, Communication, and Collaboration between management, product, techops, support, developers
  • Envision system features and functionalities by analyzing business requirements
  • Troubleshoot and resolve product/application issues for escalated support cases
  • Collaborate with a diverse, distributed development organization
  • Fulltime
Read More
Arrow Right

Staff Software Engineer - IPO (Identity & Performance Optimizations)

Adyen is seeking a highly skilled and enthusiastic Staff Software Engineer with ...
Location
Location
Netherlands , Amsterdam
Salary
Salary:
Not provided
adyen.com Logo
Adyen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 10-12 years of extensive industry hands-on experience in Software Engineering and building data intensive applications, including system design, architecture, and development
  • Experience with building scalable, reliable, and low latency systems
  • Strong programming skills in Java
  • Experience in incident management
  • Experience working with and using SQL and NoSQL databases like Postgres and Cassandra
  • Knowledge of streaming & queuing frameworks
  • Deep curiosity and drive to continually learn new technologies and techniques
  • Familiarity with data engineering concepts, big data tools (like Spark), and distributed data architectures
  • Passion for applying technology to solve real business problems in a pragmatic and impactful way
  • A good product sense and the ability to understand how research can translate into valuable applications
Job Responsibility
Job Responsibility
  • Design and develop robust, scalable software systems that integrate Machine Learning models into production environments
  • Collaborate closely with product managers, data scientists, and business stakeholders to understand business problems and translate them into technical solutions
  • Evaluate new technologies, frameworks, and tools, and drive their adoption where appropriate to solve real business challenges
  • Mentor and guide junior engineers and peers through technical leadership and knowledge sharing
  • Advocate for best practices in software engineering, and system architecture
  • Continuously learn and stay up-to-date with advancements in software engineering, infrastructure, and AI technologies
  • Lead technical design discussions and contribute to the strategic direction of the technology roadmap
  • Close cross functional collaboration with different, non-technical functions like Account Management, Compliance, Operations etc
What we offer
What we offer
  • Direct access to massive global datasets (e.g., payments, identity data) and the ability to see your work have an immediate impact at scale
  • Environment of ownership and speed, where a focused research team can make a significant difference
  • Be at the forefront of bridging AI and fintech, shaping how these two critical areas intersect
Read More
Arrow Right