Starburst Data Engineer Job at Realign (Charlotte, NC)

Senior PySpark Data Engineer

We are seeking a highly skilled and experienced Senior PySpark Data Engineer to ...

Location

India , Pune

Salary:

Not provided

Citi

Expiration Date

Until further notice

Requirements

Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field
6+ years of professional experience in a data engineering role
Extensive hands-on experience with PySpark and advanced Python programming skills
Proven experience with Big Data ecosystems, including Cloudera and/or DataBricks
Hands-on experience with distributed query engines like Starburst (Trino/Presto)
Proficient in designing and managing complex workflows using scheduling tools, particularly Apache Airflow
Strong expertise in SQL and experience with relational and non-relational databases
Solid understanding of data warehousing concepts, ETL/ELT processes, and data modeling techniques
Experience working in a Linux/Unix environment
GIT HUB, CI/CD Pipeline

Job Responsibility

Design, develop, and maintain robust, scalable, and high-performance data pipelines using PySpark
Develop, schedule, and monitor complex data workflows using orchestration tools like Apache Airflow
Collaborate with data scientists, analysts, and business stakeholders to understand data requirements and deliver high-quality data solutions
Optimize and tune Spark jobs for performance and efficiency
Implement data quality checks and ensure data integrity across all data pipelines
Design and implement data models for optimal storage and retrieval
Mentor junior data engineers and promote best practices in data engineering
Ensure compliance with data governance and security policies
Troubleshoot and resolve data-related issues in a timely manner

Fulltime

Big Data / PySpark Engineering Lead - Vice President

The Applications Development Technology Lead Analyst is a senior level position ...

Location

India , Pune

Salary:

Not provided

Citi

Expiration Date

Until further notice

Requirements

Highly experienced and skilled technical lead with 12+years of experience with software building and platform engineering
Experience in Data Engineering, focused on Big Data ecosystems
Knowledge in Hadoop, YARN, Hive, Impala, Spark, and Spark SQL with extensive high volume of data processing pipeline development
Programming Expert level and hand on experience in Python
Familiarity with data formats like Avro, Parquet, CSV, JSON
Hands-on experience in writing SQL queries
Highly experienced with Unix based operating systems and shell scripting
Experience with source code management tools such as Bitbucket, Git etc
Big Data Tech Proficiency and hands-on in Hadoop, Spark, Hive, Kafka, and NoSQL databases (MongoDB, HBase)
Experience working with query engines like Trino, Presto, Starburst

Job Responsibility

Design and implement scalable, fault-tolerant batch and real-time data processing pipelines
Develop robust data models and schema designs optimized for both performance and storage efficiency
Evaluate and integrate emerging tools and frameworks (e.g., Spark, Flink, Kafka) into the existing stack
Provide in-depth analysis with interpretive thinking to define issues and develop innovative solutions
Develop comprehensive knowledge of how areas of business, such as architecture and infrastructure, integrate to accomplish business goals
Legacy Systems Decommissioning: Lead the strategic migration of data and logic from legacy platforms (e.g. on-premises SQL Servers) to a modern Data Lakehouse environment
ETL/ELT Transformation: Re-engineer existing stored procedures and complex legacy ETL jobs into scalable, distributed processing frameworks using Spark (Python) and Starburst/Trino
Validation & Parity Testing: Design and implement automated frameworks for Data Parity Testing to ensure 100% accuracy and consistency between legacy outputs and new big data results
Schema Evolution: Map and transform rigid, legacy relational schemas into flexible, high-performance formats optimized for the cloud (e.g., Parquet, Avro, or Iceberg)
Phased Cutover Management: Orchestrate a phased migration strategy (Parallel Run, Shadow Execution) to ensure zero downtime for downstream business applications and reporting tools

Fulltime

New

Senior PySpark Data Engineer

We are seeking a highly skilled and experienced Senior PySpark Data Engineer to ...

Location

India , Pune

Salary:

Not provided

Citi

Expiration Date

Until further notice

Requirements

6+ years of professional relevant experience in a data engineering role
Extensive hands-on experience with PySpark and advanced Python programming skills
Proven experience with Big Data ecosystems, including Cloudera and/or DataBricks
Hands-on experience with distributed query engines like Starburst (Trino/Presto)
Proficient in designing and managing complex workflows using scheduling tools, particularly Apache Airflow
Strong expertise in SQL and experience with relational and non-relational databases
Solid understanding of data warehousing concepts, ETL/ELT processes, and data modeling techniques
Experience working in a Linux/Unix environment
GIT HUB, CI/CD Pipeline
Bachelor’s degree/University degree or equivalent experience

Job Responsibility

Design, develop, and maintain robust, scalable, and high-performance data pipelines using PySpark
Develop, schedule, and monitor complex data workflows using orchestration tools like Apache Airflow
Collaborate with data scientists, analysts, and business stakeholders to understand data requirements and deliver high-quality data solutions
Optimize and tune Spark jobs for performance and efficiency
Implement data quality checks and ensure data integrity across all data pipelines
Design and implement data models for optimal storage and retrieval
Mentor junior data engineers and promote best practices in data engineering
Ensure compliance with data governance and security policies
Troubleshoot and resolve data-related issues in a timely manner

Fulltime

Business Intelligence Developer

The CTI Enterprise Analytical Services (EAS) organization is actively recruiting...

Location

India , Pune

Salary:

Not provided

Citi

Expiration Date

Until further notice

Requirements

5+ years of experience with Business Intelligence tool or data federation tools such as Starburst developer or administrator
3+ years of Linux, Shell scripting, Ansible experience
8+ years overall IT experience
Knowledge of the Hadoop ecosystem with experience in Hive, Spark, etc. is a plus
Knowledge of Java or any programming language is a plus
Good interpersonal skills with excellent communication skills - written and spoken English
Able to interact with client projects in cross-functional teams
Good team player interested in sharing knowledge and cross-training other team members and shows interest in learning new technologies and products
5+ years of hands-on experience in setting up security (authentication and authorization) for Business Intelligence or data federation products
Experience with container technologies, Kubernetes, and cloud architectures, including some exposure to public cloud platforms such as AWS, GCP

Job Responsibility

Deliver the tooling and capabilities needed to enable data & analytics services such as Starburst, Tableau on massive, distributed data sets
Understand Engineering needs including those required to build, maintain, and operate the system through all phases of its life
Create and maintain continuous integration and deployment processes including testing and monitoring to ensure the solution is reliable and measurable
Take full ownership of designing solutions, and building blueprints, prototypes, and frameworks to drive enablement of new capabilities
Collaborate with cross-functional teams to build a portfolio of capabilities for recommendation and use in new product developments
Publish best practices, configuration recommendations, design patterns, tool/technology selection methodologies, and playbooks for Engineering and user communities
Collaborate with cross-functional Engineering teams to build a portfolio of capabilities to recommend and use in analytical product development across Citi lines of Businesses
Enable Hybrid cloud implementation along with security for Business Intelligence products
Enable Business Intelligence products on external Cloud platforms as SaaS or PaaS solutions and integrate with various Cloud and on-prem data sources
Build reusable security and deployment framework for Business Intelligence services enabled on Cloud and on-prem

Fulltime

Equities Quant Platform Engineering Lead – Python

Citi's Equities Technology team is undergoing significant growth and investment,...

Location

United Kingdom , London

Salary:

Not provided

Citi

Expiration Date

Until further notice

Requirements

Extensive background in delivering production-grade, data-centric applications for quantitative trading and analytics
Demonstrated expertise in the python data engineering stack (Polars, Parquet, FastAPI, Jupyter, Airflow, Streamlit, Ray)
Demonstrated expertise in high-performance data stores and query engines (Starburst, Snowflake)
Demonstrated expertise in real-time streaming analytics technologies (Kafka, Flink)
Demonstrated expertise in cloud container technologies (AWS, Azure, GCP, Docker, Kubernetes)
Proven success in enhancing developer experience that reduces friction in coding, building and deploying APIs and client libraries
Real-world application of generative AI prompt engineering and RAG pipelines
Full-stack HTML5 web development skills

Job Responsibility

Guide the technical direction and implementation of the platform
Championing engineering excellence through hands-on feature creation, rigorous code quality via pull request reviews, and by mentoring junior engineers to establish robust coding standards and guardrails
Architecting scalable, secure re-usable components
Drive the design process, aligning with or challenging existing blueprints, seeking consensus from senior leads and stakeholders
Staying up to date with open-source solutions and latest trends to accelerate business outcomes

What we offer

27 days annual leave (plus bank holidays)
A discretional annual performance related bonus
Private Medical Care & Life Insurance
Employee Assistance Program
Pension Plan
Paid Parental Leave
Special discounts for employees, family, and friends
Access to an array of learning and development resources

Fulltime

Apps Dev Tech Lead Analyst - Vice President

As a key member of our global development team, you will: Innovate & Develop: Pa...

Location

United States , Irving

Salary:

125760.00 - 188640.00 USD / Year

Citi

Expiration Date

Until further notice

Requirements

6-10 years of progressive experience in systems analysis and programming of software applications
Strong proficiency in Java application technologies, including deep experience with TDD (Test-Driven Development), Spring framework, and Microservices architecture
Extensive hands-on experience with PySpark and advanced Python programming skills
Proven experience with Big Data ecosystems, including Cloudera and/or Data Bricks
Hands-on experience with distributed query engines like Starburst (Trino/Presto)
Proficient in designing and managing complex workflows using scheduling tools, particularly Apache Airflow
Strong expertise in SQL and experience with relational and non-relational databases
Excellent knowledge of algorithms and data structures, design patterns
Strong Java experience: Java core, collections, concurrency, streams
Frameworks and APIs: Spring (Core, Batch, Integration, MVC, Boot, Data), Hibernate, Jackson, JAX RS, JPA, JAXB

Job Responsibility

Innovate & Develop: Partner closely with project managers, business stakeholders, and senior managers to translate complex business requirements into well-architected technical solutions
Drive cross-functional collaboration with diverse management teams
Proactively identify, define, and implement necessary system enhancements
Complex Problem Resolution: Lead the resolution of high-impact problems and critical projects
Consult with users, clients, and other technology groups on issues
Technical Architecture & Standards Leadership: Serve as a subject matter expert in application programming
Leverage an advanced understanding of system flow to develop and enforce robust standards for coding, testing, debugging, and implementation
Mentorship & Talent Development: Act as a trusted advisor and coach for mid-level developers and analysts
Provide technical guidance, mentorship, and code reviews to junior data engineers
Operational Excellence: Ensure adherence to best practices and essential procedures

What we offer

medical, dental & vision coverage
401(k)
life, accident, and disability insurance
wellness programs
paid time off packages including planned time off (vacation), unplanned time off (sick leave), and paid holidays

Fulltime

Staff Data Engineer

The Staff Data Engineer role is part of the Bamboo Health Engineering Team. You ...

Location

United States

Salary:

Not provided

Bamboo Health

Expiration Date

Until further notice

Requirements

Bachelor’s degree in computer science, Analytics, a related field, or equivalent experience
8+ years total software and relational database development experience
3+ years with a strong demonstrated ability to develop and maintain ETL solutions, ideally using Python and various Application Programming Interfaces (API)
3+ years’ experience with AWS Cloud Solutions/Services
Experience working in an Agile environment include using ticketing software such as JIRA
Strong technical problem-solving abilities
Hands on experience maintaining databases on Redshift, PostgreSQL, MySQL or Oracle relational database systems.
Experience with software development using Python, Ruby, or other modern scripting languages, ideally in container solutions such as Docker or Kubernetes.
Proficiency with modern data stack tools such as dbt, Starburst, AWS Glue.
The ability to travel periodically for work.

Job Responsibility

Develop, debug and support ETL processes utilizing AWS services
Lead ideation and development of data models used for data science and analytics
Meet the delivery expectations of the Agile Project Management methodology
Maintain and optimize reports and extracts that serve as lifesaving information sources to customers
Create clear and concise documentation regarding technical solutions, while sharing knowledge and documentation with teammates via “Lunch and Learns”
Collaborate with internal and external customers to deliver modern data products
Explore opportunities to enhance workflows through AI or automation tools (e.g., document summarization, task routing, or data parsing).
Identify repetitive tasks and partner with team leads to implement scalable automation solutions.

What we offer

Receive competitive compensation, including health, dental, vision and other benefits

Database Development Engineer

The Database Development Engineer is an intermediate level position responsible ...

Location

Canada , Mississauga

Salary:

Not provided

Citi

Expiration Date

Until further notice

Requirements

5-8 years of relevant experience
Experience in systems analysis and programming of software applications
Experience in managing and implementing successful projects
Oracle SQL & PL/SQL Expertise – Strong knowledge of writing queries, stored procedures, triggers, and performance tuning
Database Migration & ETL – Experience in moving data between Oracle and other databases (e.g., PostgreSQL, SQL Server)
Python for Data Migration – Proficiency in using Python libraries like cx_Oracle, SQLAlchemy, and pandas for data extraction, transformation, and loading (ETL)
Data Transformation & Cleansing – Hands-on experience with data validation, transformation, and error handling
Shell Scripting & Automation – Writing scripts to automate database tasks and migrations
Performance Optimization – Indexing, query tuning, and bulk data loading techniques (e.g., SQL*Loader, DBMS_DATAPUMP)
Experience with StarBurst Data is an added advantage

Job Responsibility

Conduct tasks related to feasibility studies, time and cost estimates, IT planning, risk technology, applications development, model development, and establish and implement new or revised applications systems and programs to meet specific business needs or user areas
Monitor and control all phases of development process and analysis, design, construction, testing, and implementation as well as provide user and operational support on applications to business users
Utilize in-depth specialty knowledge of applications development to analyze complex problems/issues, provide evaluation of business process, system process, and industry standards, and make evaluative judgement
Recommend and develop security measures in post implementation analysis of business usage to ensure successful system design and functionality
Consult with users/clients and other technology groups on issues, recommend advanced programming solutions, and install and assist customer exposure systems
Ensure essential procedures are followed and help define operating standards and processes
Serve as advisor or coach to new or lower level analysts
Has the ability to operate with a limited level of direct supervision
Can exercise independence of judgement and autonomy
Acts as SME to senior stakeholders and /or other team members

Fulltime

Starburst Data Engineer

Realign

Location:
United States , Charlotte, NC ▼
Plano, TX

Category:
IT - Software Development

Contract Type:
Employment contract

Salary:

Job Description:

Job Responsibility:

Requirements:

Nice to have:

Additional Information:

Job Posted:
March 21, 2026

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Starburst Data Engineer

Senior PySpark Data Engineer

Big Data / PySpark Engineering Lead - Vice President

Senior PySpark Data Engineer

Business Intelligence Developer

Equities Quant Platform Engineering Lead – Python

Apps Dev Tech Lead Analyst - Vice President

Staff Data Engineer

Database Development Engineer

Starburst Data Engineer

Realign

Location:United States , Charlotte, NC ▼Plano, TX

Category:IT - Software Development

Contract Type:Employment contract

Salary:

Job Description:

Job Responsibility:

Requirements:

Nice to have:

Additional Information:

Job Posted:March 21, 2026

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Starburst Data Engineer

Senior PySpark Data Engineer

Big Data / PySpark Engineering Lead - Vice President

Senior PySpark Data Engineer

Business Intelligence Developer

Equities Quant Platform Engineering Lead – Python

Apps Dev Tech Lead Analyst - Vice President

Staff Data Engineer

Database Development Engineer

Location:
United States , Charlotte, NC ▼
Plano, TX

Category:
IT - Software Development

Contract Type:
Employment contract

Job Posted:
March 21, 2026