CrawlJobs Logo

Staff Software Engineer, Spark (Java)

cloudera.com Logo

Cloudera

Location Icon

Location:
Hungary , Budapest

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry. Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises. The Data Platform Pillar is the bedrock of Cloudera’s technology, where we design and build the core components that let our customers store, manage, and process data with unmatched scalability, security, and performance. Cloudera is seeking a Staff Software Engineer, Spark (Java) with strong distributed systems expertise to work on the Cloudera distribution of Apache Spark and Livy. The role involves building enterprise-grade systems for customers running Spark on thousands of nodes and processing petabytes of data. We are looking for a passionate engineer eager to enhance a product already supporting major production systems and to drive the next-generation Data Engineering experience. You will collaborate with a distributed team across the United States and Hungary, including multiple Apache Spark committers.

Job Responsibility:

  • Design new features for Cloudera’s data engineering experience, and take them from prototypes to leading a team to deliver the feature in production at scale
  • Contribute to Apache Spark, Livy
  • Develop new features in Scala/Java/Python on a modern platforms
  • Gain expertise in distributed data processing, from SQL planners and optimizers, to data layout and table formats like Apache Parquet and Iceberg, to fault tolerance in distributed systems
  • Gain a solid understanding and deep technical knowledge of components across the Cloudera Data Engineering Experience stack, but focusing on Iceberg and Spark
  • Get to work on large scale distributed systems, from 100s to 1000s of nodes, in production clusters
  • Debug system level deployment issues, root cause analysis, perform system test analysis and resolve failures
  • Work on improving internal infrastructure
  • Collaborate with other team members and stakeholders

Requirements:

  • 6+ years professional software development
  • Experience leading and delivering complex product enhancements
  • Strong understanding of at least one of the following languages: Java, Scala, Python
  • Experience with systems design, development
  • Passionate about programming, clean coding habits, attention to detail, and focus on quality
  • Strong oral and written communication skills
  • Strong ability to research and solve problems independently without constant supervision
  • Open-minded, desire to learn new things and build great products
  • Experience with distributed systems

Nice to have:

  • Experience with SQL planners
  • Experience with using/developing Apache Spark, Livy or other related technologies
  • Experience with large-scale, distributed systems design and development with an understanding of scaling, performance, and scheduling
  • Solid experience with at least one cloud
What we offer:
  • Generous PTO Policy
  • Support work life balance with Unplugged Days
  • Flexible WFH Policy
  • Mental & Physical Wellness programs
  • Phone and Internet Reimbursement program
  • Access to Continued Career Development
  • Comprehensive Benefits and Competitive Packages
  • Paid Volunteer Time
  • Employee Resource Groups

Additional Information:

Job Posted:
February 17, 2026

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Staff Software Engineer, Spark (Java)

Senior Staff Software Engineer

As a Senior Staff Software Engineer, you will join a highly performing team of e...
Location
Location
United States , New York
Salary
Salary:
156400.00 - 225000.00 USD / Year
siriusxm.com Logo
SiriusXM
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree in Computer Science/Mathematics or a similar field
  • 12+ years of software engineering experience in Java programming language (preferably JDK17 or higher)
  • 5+ years of experience developing and designing data applications and data pipelines
  • 7+ years of experience crafting microservices and scalable products, utilizing diverse cloud platforms (ideally AWS)
  • Strong understanding of engineering software processes, lifecycle methodology, configuration management, release management, and system verification and testing
  • Ability to work independently and oversee entire projects or significant parts, focusing on completing the tasks on time
  • Proven ability to research and become proficient in new technologies
  • Strong analytical and problem-solving skills, with meticulous attention to detail and a dedication to continuous improvement
  • Proficiency in constructing detailed software architecture diagrams
  • Interpersonal skills and ability to interact and work with staff at all levels
Job Responsibility
Job Responsibility
  • Design and build high-performance, reliable, and scalable reporting APIs and data pipelines
  • Lead high-level architecture discussions and planning sessions work with ad measurement, revenue and addressability data
  • Work with various teams of engineers building software in a collaborative development process
  • Execute tasks with utmost clarity and precision, demonstrating a strong sense of ownership and providing clear direction to drive projects forward effectively
  • Collaborate with the Product team to clarify the scope of the projects by giving thoughtful feedback which challenges and clarifies requirements intent
  • Collaborate with the Global Operations and Cloud Infrastructure teams to ensure flawless production deployments and support the Incident Management team
  • Responsible for conducting technical interviews as needed, ensuring a consistently high bar for engineering excellence and performance standards
  • Responsible for mentoring and guiding junior engineers.
What we offer
What we offer
  • discretionary short-term and long-term incentives
  • Fulltime
Read More
Arrow Right

Staff /Sr Staff/ Principal Engineer - Lakehouse

Balbix is the world's leading platform for cybersecurity posture automation. Usi...
Location
Location
India , Bangalore; Gurgaon
Salary
Salary:
Not provided
balbix.com Logo
Balbix
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience in backend software development dealing with large scale applications involving large scale data
  • Proven experience in defining and improving application design and architecture
  • Drive to discover and learn the required new technologies
  • Exposure to state of the art technologies for large scale data systems
  • Proficiency in programming languages such as Python, Scala or Java
  • Hands-on experience with large scale technologies such as Apache Spark, Apache Flink, Cassandra
  • Experience with cloud platforms such as AWS, Azure, or Google Cloud Platform
  • Bachelor's or Master's degree in Computer Science, Engineering, or related field
  • Excellent problem-solving and analytical skills
  • Strong communication and collaboration skills
Job Responsibility
Job Responsibility
  • Collaborate with product managers, data scientists, and other stakeholders to understand requirements and translate them into technical solutions
  • Design, develop, and deploy high scale systems using state of the art technologies
  • Build reliable, consistent and high throughput data services and interfaces
  • Mentor junior developers and contribute to knowledge sharing within the team
  • Help define and ensure the best practices and guidelines across the systems
  • Optimize and tune applications for performance and scalability
  • Troubleshoot and resolve issues in production environments
  • Fulltime
Read More
Arrow Right

Staff Data Engineer

As a Staff Data Engineer, you will be leading the architecture, design and devel...
Location
Location
United States; Canada , Remote
Salary
Salary:
Not provided
https://www.1password.com Logo
1Password
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum of 8+ years of professional software engineering experience
  • Minimum of 7 years technical engineering experience building data processing applications (batch and streaming) with coding in languages
  • In-depth, hands-on experience on extensible data modeling, query optimizations and work in Java, Scala, Python, and related technologies
  • Experience in data modeling across external facing product insights and business processes, such as revenue/sales operations, finance, and marketing
  • Experience with Big Data query engines such as Hive, Presto, Trino, Spark
  • Experience with data stores such as Redshift, MySQL, Postgres, Snowflake, etc.
  • Experience using Realtime technologies like Apache Kafka, Kinesis, Flink, etc.
  • Experience building scalable services on top of public cloud infrastructure like Azure, AWS, or GCP with extensive use of datastores like RDBMS, key-value stores, etc.
  • Experience leveraging distributed systems at scale and systems knowledge on infrastructure hardware, resources bare-metal hosts to containers to networking.
Job Responsibility
Job Responsibility
  • Design, develop, and automate large-scale, high-performance batch and streaming data processing systems to drive business growth and enhance product experience
  • Build data engineering strategy that supports a rapidly growing tech company and aligns with the priorities across our product strategy and internal business organizations’ desire to leverage data for more competitive advantages
  • Build scalable data pipelines using best-in-class software engineering practices
  • Develop optimal data models for storage and retrieval, meeting critical product and business requirements
  • Establish and execute short and long-term architectural roadmaps in collaboration with Analytics, Data Platform, Business Systems, Engineering, Privacy and Security
  • Lead efforts on continuous improvement to the efficiency and flexibility of the data, platform, and services
  • Mentor Analytics & Data Engineers on best practices, standards and forward-looking approaches on building robust, extensible and reusable data solutions
  • Influence and evangelize high standard of code quality, system reliability, and performance.
What we offer
What we offer
  • Maternity and parental leave top-up programs
  • Generous PTO policy
  • Four company-wide wellness days
  • Company equity for all full-time employees
  • Retirement matching program
  • Free 1Password account
  • Paid volunteer days
  • Employee-led inclusion and belonging programs and ERGs
  • Peer-to-peer recognition through Bonusly
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Spark (Java)

At Cloudera, we empower people to transform complex data into clear and actionab...
Location
Location
Hungary , Budapest; Szeged
Salary
Salary:
Not provided
cloudera.com Logo
Cloudera
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years professional software development
  • Experience leading and delivering complex product enhancements
  • Strong understanding of at least one of the following languages: Java, Scala, Python
  • Experience with systems design, development
  • Passionate about programming, clean coding habits, attention to detail, and focus on quality
  • Strong oral and written communication skills
  • Strong ability to research and solve problems independently without constant supervision
  • Open-minded, desire to learn new things and build great products
  • Experience with distributed systems
Job Responsibility
Job Responsibility
  • Design new features for Cloudera’s data engineering experience, and take them from prototypes to leading a team to deliver the feature in production at scale
  • Contribute to Apache Spark, Livy
  • Develop new features in Scala/Java/Python on a modern platforms
  • Gain expertise in distributed data processing, from SQL planners and optimizers, to data layout and table formats like Apache Parquet and Iceberg, to fault tolerance in distributed systems
  • Gain a solid understanding and deep technical knowledge of components across the Cloudera Data Engineering Experience stack, but focusing on Iceberg and Spark
  • Get to work on large scale distributed systems, from 100s to 1000s of nodes, in production clusters
  • Debug system level deployment issues, root cause analysis, perform system test analysis and resolve failures
  • Work on improving internal infrastructure
  • Collaborate with other team members and stakeholders
What we offer
What we offer
  • Generous PTO Policy
  • Support work life balance with Unplugged Days
  • Flexible WFH Policy
  • Mental & Physical Wellness programs
  • Phone and Internet Reimbursement program
  • Access to Continued Career Development
  • Comprehensive Benefits and Competitive Packages
  • Paid Volunteer Time
  • Employee Resource Groups
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - Data Engineer

As Microsoft continues to push the boundaries of AI, we are on the lookout for i...
Location
Location
United States , New York
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 6+ years experience in business analytics, data science, software development, data modeling or data engineering work
  • OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 4+ years experience in business analytics, data science, software development, or data engineering work
  • OR equivalent experience
  • 4+ years technical engineering experience building data processing applications (batch and streaming) with coding in languages including, but not limited to, Python, Java, Spark, SQL
  • Experience working with Apache Hadoop eco system, Kafka, NoSQL, etc
  • 3+ years experience with data governance, data compliance and/or data security
  • 2+ years' experience building scalable services on top of public cloud infrastructure like Azure, AWS, or GCP. Extensive use datastores like RDBMS, key-value stores, etc
  • 2+ years' experience building distributed systems at scale and extensive systems knowledge that spans bare-metal hosts to containers to networking
  • Ability to identify, analyze, and resolve complex technical issues, ensuring optimal performance, scalability, and user experience
  • Dedication to writing clean, maintainable, and well-documented code with a focus on application quality, performance, and security
Job Responsibility
Job Responsibility
  • Build scalable data pipelines for sourcing, transforming and publishing data assets for AI use cases
  • Work collaboratively with other Platform, infrastructure, application engineers as well as AI Researchers to build next generation data platform products and services
  • Ship high-quality, well-tested, secure, and maintainable code
  • Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
  • Enjoy working in a fast-paced, design-driven, product development cycle
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right
New

Member of Technical Staff - Data Platform

If you are excited by the challenge of designing distributed systems that proces...
Location
Location
United States , Mountain View; Redmond
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 3+ years experience in business analytics, data science, software development, data modeling, or data engineering OR Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 4+ years experience in business analytics, data science, software development, data modeling, or data engineering OR equivalent experience
  • Proficiency in Python, Scala, Java, or Go
  • Deep Distributed Systems Knowledge: Demonstrated technical understanding of massive-scale compute engines (e.g., Apache Spark, Flink, Ray, Trino, or Snowflake)
  • Experience architecting Lakehouse environments at scale (using Delta Lake, Iceberg, or Hudi)
  • Experience building internal developer platforms or "Data-as-a-Service" APIs
  • Strong background in streaming technologies (Kafka, Azure EventHubs, Pulsar) and stateful stream processing
  • Experience with container orchestration (Kubernetes) for deploying data applications
  • Experience enabling AI/ML workloads (Feature Stores, Vector Databases)
Job Responsibility
Job Responsibility
  • Core Platform Engineering: Design and build the underlying frameworks (based on Spark/Databricks) that allow internal teams to process massive datasets efficiently
  • Distributed Systems Architecture: Modernize our data stack by moving from batch-heavy patterns to event-driven architectures
  • Unstructured AI Data Pipelines: Architect high-throughput pipelines capable of processing complex, non-tabular data (documents, code repositories, chat logs) for LLM pre-training, fine-tuning and evaluations datasets
  • AI Feedback Loops: Engineer the high-throughput telemetry systems that capture user interactions with Copilot
  • Infrastructure as Code: Treat the data platform as software. Define and deploy all storage, compute, and networking resources using IaC (Bicep/Terraform)
  • Data Reliability Engineering: Move beyond simple "validation checks" to build automated governance and observability systems
  • Compute Optimization: Deep-dive into query execution plans and cluster performance. Optimize shuffle operations, partition strategies, and resource allocation
  • Fulltime
Read More
Arrow Right
New

Senior Software Engineer

Wells Fargo is seeking a Senior Software Engineer (Data Engineering/Generative A...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
https://www.wellsfargo.com/ Logo
Wells Fargo
Expiration Date
February 22, 2026
Flip Icon
Requirements
Requirements
  • 4+ years of Software Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
  • Demonstrate in-depth understanding of Data Warehousing (DWH) concepts
  • Strong experience in large data transformation projects
  • Experience designing and optimizing solutions for high-volume batch processing
  • Good experience on Unix Shell scripting
  • At least 4 years of experience working with any RDBMS
  • At least 4 years of experience in building big data pipelines
  • At least 3 years of experience working with Apache Spark, Java, Hive, Hadoop and GCP
  • At least 2+ years of experience in using LLMs for automation
  • Strong experience with programming in Python, Java, SQL and good understanding of bash scripting for data processing and automation
Job Responsibility
Job Responsibility
  • Lead moderately complex initiatives and deliverables within technical domain environments
  • Contribute to large scale planning of strategies
  • Design, code, test, debug, and document for projects and programs associated with technology domain, including upgrades and deployments
  • Review moderately complex technical challenges that require an in-depth evaluation of technologies and procedures
  • Resolve moderately complex issues and lead a team to meet existing client needs or potential new clients needs while leveraging solid understanding of the function, policies, procedures, or compliance requirements
  • Collaborate and consult with peers, colleagues, and mid-level managers to resolve technical challenges and achieve goals
  • Lead projects and act as an escalation point, provide guidance and direction to less experienced staff
  • Fulltime
!
Read More
Arrow Right

Senior Staff Software Engineer

As a Senior Staff Software Engineer, you will join a highly performing team of e...
Location
Location
United States , Oakland; Washington; Atlanta; New York; Irving
Salary
Salary:
156400.00 - 225000.00 USD / Year
siriusxm.com Logo
SiriusXM
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree in Computer Science/Mathematics or a similar field
  • 8+ years of software engineering experience in Java programming language (preferably JDK17 or higher)
  • 5+ years of experience developing and designing data applications and data pipelines
  • 8+ years of experience crafting microservices and scalable products, utilizing diverse cloud platforms (ideally AWS)
  • Strong understanding of engineering software processes, lifecycle methodology, configuration management, release management, and system verification and testing
  • Ability to work independently and oversee entire projects or significant parts, focusing on completing the tasks on time
  • Proven ability to research and become proficient in new technologies
  • Strong analytical and problem-solving skills, with meticulous attention to detail and a dedication to continuous improvement
  • Proficiency in constructing detailed software architecture diagrams
  • Interpersonal skills and ability to interact and work with staff at all levels
Job Responsibility
Job Responsibility
  • Design and build high-performance, reliable, and scalable reporting APIs and data pipelines
  • Lead high-level architecture discussions and planning sessions work with ad measurement, revenue and addressability data
  • Work with various teams of engineers building software in a collaborative development process
  • Execute tasks with utmost clarity and precision, demonstrating a strong sense of ownership and providing clear direction to drive projects forward effectively
  • Collaborate with the Product team to clarify the scope of the projects by giving thoughtful feedback which challenges and clarifies requirements intent
  • Collaborate with the Global Operations and Cloud Infrastructure teams to ensure flawless production deployments and support the Incident Management team
  • Responsible for conducting technical interviews as needed, ensuring a consistently high bar for engineering excellence and performance standards
  • Responsible for mentoring and guiding junior engineers.
What we offer
What we offer
  • Discretionary short-term and long-term incentives
  • Fulltime
Read More
Arrow Right