CrawlJobs Logo

Starburst Data Engineer

realign-llc.com Logo

Realign

Location Icon

Location:
United States , Charlotte, NC

Category Icon

Job Type Icon

Contract Type:
Employment contract

Salary Icon

Salary:

130000.00 USD / Year

Job Description:

Job Title: Starburst Data Engineer. Location – Charlotte ,NC & Plano , TX. FTE Only.

Job Responsibility:

  • Demonstrated expertise in Starburst Data Virtualization, including designing and implementing data virtualization solutions, optimizing query performance across distributed data sources, and integrating Starburst with enterprise data architectures
  • Analyze data mapping documents and business requirements to design comprehensive test plans and cases
  • Perform source-to-target data reconciliation, check data loading, and ensure transformation rules are applied correctly
  • Write complex SQL scripts for validation (count, data completeness, data consistency, data truncation)
  • Identify, log, and track data defects using tools like JIRA or HP ALM or Octane
  • Automate test scripts and validate data volume, performance, and scalability
  • Validate HiveQL, HDFS file structures, and data processing within the Hadoop cluster

Requirements:

  • Minimum 10 years experience
  • Primary Skill: Starburst
  • Secondary: Data Virtualization Engineer, Dremio, Presto, SQL Performance Tuning, Shell Scripting, Autosys
  • Expert-level knowledge of SQL for data analysis
  • Experience with tools such as Informatica and IDMC
  • Understanding of data warehouse concepts and architectures (e.g., star/snowflake schema)
  • Familiarity with Hadoop or Spark is often preferred
  • Strong analytical and troubleshooting skills
  • Excellent communication for collaborating with developers and stakeholders
  • Domain: Banking knowledge, Payments knowledge preferred
  • Concept: Data Virtualization, Data Warehousing, Data Transformation, ETL/ELT, Data Quality

Nice to have:

  • Familiarity with Hadoop or Spark is often preferred
  • Domain: Banking knowledge, Payments knowledge preferred

Additional Information:

Job Posted:
March 21, 2026

Employment Type:
Fulltime
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Starburst Data Engineer

Senior PySpark Data Engineer

We are seeking a highly skilled and experienced Senior PySpark Data Engineer to ...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field
  • 6+ years of professional experience in a data engineering role
  • Extensive hands-on experience with PySpark and advanced Python programming skills
  • Proven experience with Big Data ecosystems, including Cloudera and/or DataBricks
  • Hands-on experience with distributed query engines like Starburst (Trino/Presto)
  • Proficient in designing and managing complex workflows using scheduling tools, particularly Apache Airflow
  • Strong expertise in SQL and experience with relational and non-relational databases
  • Solid understanding of data warehousing concepts, ETL/ELT processes, and data modeling techniques
  • Experience working in a Linux/Unix environment
  • GIT HUB, CI/CD Pipeline
Job Responsibility
Job Responsibility
  • Design, develop, and maintain robust, scalable, and high-performance data pipelines using PySpark
  • Develop, schedule, and monitor complex data workflows using orchestration tools like Apache Airflow
  • Collaborate with data scientists, analysts, and business stakeholders to understand data requirements and deliver high-quality data solutions
  • Optimize and tune Spark jobs for performance and efficiency
  • Implement data quality checks and ensure data integrity across all data pipelines
  • Design and implement data models for optimal storage and retrieval
  • Mentor junior data engineers and promote best practices in data engineering
  • Ensure compliance with data governance and security policies
  • Troubleshoot and resolve data-related issues in a timely manner
  • Fulltime
Read More
Arrow Right

Big Data / PySpark Engineering Lead - Vice President

The Applications Development Technology Lead Analyst is a senior level position ...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Highly experienced and skilled technical lead with 12+years of experience with software building and platform engineering
  • Experience in Data Engineering, focused on Big Data ecosystems
  • Knowledge in Hadoop, YARN, Hive, Impala, Spark, and Spark SQL with extensive high volume of data processing pipeline development
  • Programming Expert level and hand on experience in Python
  • Familiarity with data formats like Avro, Parquet, CSV, JSON
  • Hands-on experience in writing SQL queries
  • Highly experienced with Unix based operating systems and shell scripting
  • Experience with source code management tools such as Bitbucket, Git etc
  • Big Data Tech Proficiency and hands-on in Hadoop, Spark, Hive, Kafka, and NoSQL databases (MongoDB, HBase)
  • Experience working with query engines like Trino, Presto, Starburst
Job Responsibility
Job Responsibility
  • Design and implement scalable, fault-tolerant batch and real-time data processing pipelines
  • Develop robust data models and schema designs optimized for both performance and storage efficiency
  • Evaluate and integrate emerging tools and frameworks (e.g., Spark, Flink, Kafka) into the existing stack
  • Provide in-depth analysis with interpretive thinking to define issues and develop innovative solutions
  • Develop comprehensive knowledge of how areas of business, such as architecture and infrastructure, integrate to accomplish business goals
  • Legacy Systems Decommissioning: Lead the strategic migration of data and logic from legacy platforms (e.g. on-premises SQL Servers) to a modern Data Lakehouse environment
  • ETL/ELT Transformation: Re-engineer existing stored procedures and complex legacy ETL jobs into scalable, distributed processing frameworks using Spark (Python) and Starburst/Trino
  • Validation & Parity Testing: Design and implement automated frameworks for Data Parity Testing to ensure 100% accuracy and consistency between legacy outputs and new big data results
  • Schema Evolution: Map and transform rigid, legacy relational schemas into flexible, high-performance formats optimized for the cloud (e.g., Parquet, Avro, or Iceberg)
  • Phased Cutover Management: Orchestrate a phased migration strategy (Parallel Run, Shadow Execution) to ensure zero downtime for downstream business applications and reporting tools
  • Fulltime
Read More
Arrow Right
New

Senior PySpark Data Engineer

We are seeking a highly skilled and experienced Senior PySpark Data Engineer to ...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of professional relevant experience in a data engineering role
  • Extensive hands-on experience with PySpark and advanced Python programming skills
  • Proven experience with Big Data ecosystems, including Cloudera and/or DataBricks
  • Hands-on experience with distributed query engines like Starburst (Trino/Presto)
  • Proficient in designing and managing complex workflows using scheduling tools, particularly Apache Airflow
  • Strong expertise in SQL and experience with relational and non-relational databases
  • Solid understanding of data warehousing concepts, ETL/ELT processes, and data modeling techniques
  • Experience working in a Linux/Unix environment
  • GIT HUB, CI/CD Pipeline
  • Bachelor’s degree/University degree or equivalent experience
Job Responsibility
Job Responsibility
  • Design, develop, and maintain robust, scalable, and high-performance data pipelines using PySpark
  • Develop, schedule, and monitor complex data workflows using orchestration tools like Apache Airflow
  • Collaborate with data scientists, analysts, and business stakeholders to understand data requirements and deliver high-quality data solutions
  • Optimize and tune Spark jobs for performance and efficiency
  • Implement data quality checks and ensure data integrity across all data pipelines
  • Design and implement data models for optimal storage and retrieval
  • Mentor junior data engineers and promote best practices in data engineering
  • Ensure compliance with data governance and security policies
  • Troubleshoot and resolve data-related issues in a timely manner
  • Fulltime
Read More
Arrow Right

Business Intelligence Developer

The CTI Enterprise Analytical Services (EAS) organization is actively recruiting...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience with Business Intelligence tool or data federation tools such as Starburst developer or administrator
  • 3+ years of Linux, Shell scripting, Ansible experience
  • 8+ years overall IT experience
  • Knowledge of the Hadoop ecosystem with experience in Hive, Spark, etc. is a plus
  • Knowledge of Java or any programming language is a plus
  • Good interpersonal skills with excellent communication skills - written and spoken English
  • Able to interact with client projects in cross-functional teams
  • Good team player interested in sharing knowledge and cross-training other team members and shows interest in learning new technologies and products
  • 5+ years of hands-on experience in setting up security (authentication and authorization) for Business Intelligence or data federation products
  • Experience with container technologies, Kubernetes, and cloud architectures, including some exposure to public cloud platforms such as AWS, GCP
Job Responsibility
Job Responsibility
  • Deliver the tooling and capabilities needed to enable data & analytics services such as Starburst, Tableau on massive, distributed data sets
  • Understand Engineering needs including those required to build, maintain, and operate the system through all phases of its life
  • Create and maintain continuous integration and deployment processes including testing and monitoring to ensure the solution is reliable and measurable
  • Take full ownership of designing solutions, and building blueprints, prototypes, and frameworks to drive enablement of new capabilities
  • Collaborate with cross-functional teams to build a portfolio of capabilities for recommendation and use in new product developments
  • Publish best practices, configuration recommendations, design patterns, tool/technology selection methodologies, and playbooks for Engineering and user communities
  • Collaborate with cross-functional Engineering teams to build a portfolio of capabilities to recommend and use in analytical product development across Citi lines of Businesses
  • Enable Hybrid cloud implementation along with security for Business Intelligence products
  • Enable Business Intelligence products on external Cloud platforms as SaaS or PaaS solutions and integrate with various Cloud and on-prem data sources
  • Build reusable security and deployment framework for Business Intelligence services enabled on Cloud and on-prem
  • Fulltime
Read More
Arrow Right

Equities Quant Platform Engineering Lead – Python

Citi's Equities Technology team is undergoing significant growth and investment,...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Extensive background in delivering production-grade, data-centric applications for quantitative trading and analytics
  • Demonstrated expertise in the python data engineering stack (Polars, Parquet, FastAPI, Jupyter, Airflow, Streamlit, Ray)
  • Demonstrated expertise in high-performance data stores and query engines (Starburst, Snowflake)
  • Demonstrated expertise in real-time streaming analytics technologies (Kafka, Flink)
  • Demonstrated expertise in cloud container technologies (AWS, Azure, GCP, Docker, Kubernetes)
  • Proven success in enhancing developer experience that reduces friction in coding, building and deploying APIs and client libraries
  • Real-world application of generative AI prompt engineering and RAG pipelines
  • Full-stack HTML5 web development skills
Job Responsibility
Job Responsibility
  • Guide the technical direction and implementation of the platform
  • Championing engineering excellence through hands-on feature creation, rigorous code quality via pull request reviews, and by mentoring junior engineers to establish robust coding standards and guardrails
  • Architecting scalable, secure re-usable components
  • Drive the design process, aligning with or challenging existing blueprints, seeking consensus from senior leads and stakeholders
  • Staying up to date with open-source solutions and latest trends to accelerate business outcomes
What we offer
What we offer
  • 27 days annual leave (plus bank holidays)
  • A discretional annual performance related bonus
  • Private Medical Care & Life Insurance
  • Employee Assistance Program
  • Pension Plan
  • Paid Parental Leave
  • Special discounts for employees, family, and friends
  • Access to an array of learning and development resources
  • Fulltime
Read More
Arrow Right

Apps Dev Tech Lead Analyst - Vice President

As a key member of our global development team, you will: Innovate & Develop: Pa...
Location
Location
United States , Irving
Salary
Salary:
125760.00 - 188640.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6-10 years of progressive experience in systems analysis and programming of software applications
  • Strong proficiency in Java application technologies, including deep experience with TDD (Test-Driven Development), Spring framework, and Microservices architecture
  • Extensive hands-on experience with PySpark and advanced Python programming skills
  • Proven experience with Big Data ecosystems, including Cloudera and/or Data Bricks
  • Hands-on experience with distributed query engines like Starburst (Trino/Presto)
  • Proficient in designing and managing complex workflows using scheduling tools, particularly Apache Airflow
  • Strong expertise in SQL and experience with relational and non-relational databases
  • Excellent knowledge of algorithms and data structures, design patterns
  • Strong Java experience: Java core, collections, concurrency, streams
  • Frameworks and APIs: Spring (Core, Batch, Integration, MVC, Boot, Data), Hibernate, Jackson, JAX RS, JPA, JAXB
Job Responsibility
Job Responsibility
  • Innovate & Develop: Partner closely with project managers, business stakeholders, and senior managers to translate complex business requirements into well-architected technical solutions
  • Drive cross-functional collaboration with diverse management teams
  • Proactively identify, define, and implement necessary system enhancements
  • Complex Problem Resolution: Lead the resolution of high-impact problems and critical projects
  • Consult with users, clients, and other technology groups on issues
  • Technical Architecture & Standards Leadership: Serve as a subject matter expert in application programming
  • Leverage an advanced understanding of system flow to develop and enforce robust standards for coding, testing, debugging, and implementation
  • Mentorship & Talent Development: Act as a trusted advisor and coach for mid-level developers and analysts
  • Provide technical guidance, mentorship, and code reviews to junior data engineers
  • Operational Excellence: Ensure adherence to best practices and essential procedures
What we offer
What we offer
  • medical, dental & vision coverage
  • 401(k)
  • life, accident, and disability insurance
  • wellness programs
  • paid time off packages including planned time off (vacation), unplanned time off (sick leave), and paid holidays
  • Fulltime
Read More
Arrow Right

Staff Data Engineer

The Staff Data Engineer role is part of the Bamboo Health Engineering Team. You ...
Location
Location
United States
Salary
Salary:
Not provided
bamboohealth.com Logo
Bamboo Health
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, Analytics, a related field, or equivalent experience
  • 8+ years total software and relational database development experience
  • 3+ years with a strong demonstrated ability to develop and maintain ETL solutions, ideally using Python and various Application Programming Interfaces (API)
  • 3+ years’ experience with AWS Cloud Solutions/Services
  • Experience working in an Agile environment include using ticketing software such as JIRA
  • Strong technical problem-solving abilities
  • Hands on experience maintaining databases on Redshift, PostgreSQL, MySQL or Oracle relational database systems.
  • Experience with software development using Python, Ruby, or other modern scripting languages, ideally in container solutions such as Docker or Kubernetes.
  • Proficiency with modern data stack tools such as dbt, Starburst, AWS Glue.
  • The ability to travel periodically for work.
Job Responsibility
Job Responsibility
  • Develop, debug and support ETL processes utilizing AWS services
  • Lead ideation and development of data models used for data science and analytics
  • Meet the delivery expectations of the Agile Project Management methodology
  • Maintain and optimize reports and extracts that serve as lifesaving information sources to customers
  • Create clear and concise documentation regarding technical solutions, while sharing knowledge and documentation with teammates via “Lunch and Learns”
  • Collaborate with internal and external customers to deliver modern data products
  • Explore opportunities to enhance workflows through AI or automation tools (e.g., document summarization, task routing, or data parsing).
  • Identify repetitive tasks and partner with team leads to implement scalable automation solutions.
What we offer
What we offer
  • Receive competitive compensation, including health, dental, vision and other benefits
Read More
Arrow Right

Database Development Engineer

The Database Development Engineer is an intermediate level position responsible ...
Location
Location
Canada , Mississauga
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5-8 years of relevant experience
  • Experience in systems analysis and programming of software applications
  • Experience in managing and implementing successful projects
  • Oracle SQL & PL/SQL Expertise – Strong knowledge of writing queries, stored procedures, triggers, and performance tuning
  • Database Migration & ETL – Experience in moving data between Oracle and other databases (e.g., PostgreSQL, SQL Server)
  • Python for Data Migration – Proficiency in using Python libraries like cx_Oracle, SQLAlchemy, and pandas for data extraction, transformation, and loading (ETL)
  • Data Transformation & Cleansing – Hands-on experience with data validation, transformation, and error handling
  • Shell Scripting & Automation – Writing scripts to automate database tasks and migrations
  • Performance Optimization – Indexing, query tuning, and bulk data loading techniques (e.g., SQL*Loader, DBMS_DATAPUMP)
  • Experience with StarBurst Data is an added advantage
Job Responsibility
Job Responsibility
  • Conduct tasks related to feasibility studies, time and cost estimates, IT planning, risk technology, applications development, model development, and establish and implement new or revised applications systems and programs to meet specific business needs or user areas
  • Monitor and control all phases of development process and analysis, design, construction, testing, and implementation as well as provide user and operational support on applications to business users
  • Utilize in-depth specialty knowledge of applications development to analyze complex problems/issues, provide evaluation of business process, system process, and industry standards, and make evaluative judgement
  • Recommend and develop security measures in post implementation analysis of business usage to ensure successful system design and functionality
  • Consult with users/clients and other technology groups on issues, recommend advanced programming solutions, and install and assist customer exposure systems
  • Ensure essential procedures are followed and help define operating standards and processes
  • Serve as advisor or coach to new or lower level analysts
  • Has the ability to operate with a limited level of direct supervision
  • Can exercise independence of judgement and autonomy
  • Acts as SME to senior stakeholders and /or other team members
  • Fulltime
Read More
Arrow Right