This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
This Data Solutions Engineer (Applications Development Senior Programmer Analyst - C12) is responsible for building next-generation Data Engineering solutions. This intermediate-level position involves active participation in the establishment and implementation of new or revised application systems and programs in coordination with the Technology team. A key aspect of this role is liaising between business users and technologists to facilitate the exchange of information regarding solutions, including requirements and usage.
Job Responsibility:
Serve as an integral team member of our Data Engineering team, responsible for the design and development of Big Data solutions
Partner with domain experts, product managers, analysts, and data scientists to develop robust Big Data pipelines in Hadoop or Snowflake environments
Responsible for delivering a data-as-a-service framework
Responsible for moving all legacy workloads to cloud platform
Lead the migration of all legacy workloads to cloud platforms
Engage with key stakeholders to elicit and document requirements, including detailed data flow specifications
Assess appropriate solutions and collaborate with relevant teams to drive optimal implementations
Work with data scientists to build client pipelines using heterogeneous sources and provide essential engineering services for data science applications
Research and evaluate open-source technologies and components, recommending and integrating them into design and implementation efforts
Act as a technical expert, mentoring other team members on Big Data and Cloud technology stacks
Define comprehensive requirements for maintainability, testability, performance, security, quality, and usability across the data platform
Drive the implementation of consistent patterns, reusable components, and coding standards for all data engineering processes
Convert SAS-based pipelines into modern languages like PySpark and Scala for execution on Hadoop and non-Hadoop ecosystems
Optimize Big Data applications on both Hadoop and non-Hadoop platforms for peak performance
Evaluate new IT developments and evolving business requirements, recommending appropriate system alternatives and/or enhancements to current systems through analysis of business processes, systems, and industry standards
Appropriately assess risk when making business decisions, demonstrating consideration for the firm's reputation and safeguarding Citigroup, its clients, and assets
Requirements:
5+ years of experience with Hadoop and Big Data technologies
Demonstrated proficiency in Python, PySpark, and Scala, including practical experience with fundamental machine learning libraries
Experience in developing robust data solutions leveraging Google Cloud or AWS platforms
Experience with SAS
Experience with containerization and related technologies (e.g., Docker, Kubernetes)
Comprehensive understanding of software engineering and data analytics
In-depth knowledge and hands-on experience with the Hadoop ecosystem and Big Data technologies (e.g., HDFS, MapReduce, Hive, Pig, Impala, Kafka, Kudu, Solr)
Knowledge of Agile (Scrum) development methodologies
Strong development and automation skills
System-level understanding of data structures, algorithms, distributed storage, and compute
A proactive approach to solving complex business problems, complemented by strong interpersonal and teamwork skills
Bachelor’s degree/University degree or equivalent experience
Applicants must be authorized to work in the U.S for this position
Candidate must be located within commuting distance or be willing to relocate to the area
Nice to have:
Familiarity with Hadoop administration and Snowflake
Proficiency in Java or additional experience with Apache Beam
What we offer:
medical, dental & vision coverage
401(k)
life, accident, and disability insurance
wellness programs
paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays
discretionary and formulaic incentive and retention awards