This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Citi is looking for a Senior Big Data Engineer to design, build, and optimize large-scale data pipelines and distributed data systems that power critical business intelligence across the organisation.
Job Responsibility
Build and maintain scalable data pipelines using PySpark
Design and develop solutions across the Hadoop ecosystem
Develop and manage real-time and batch data workflows using streaming data platforms
Write complex SQL queries
Design and implement data models and data architecture patterns
Automate pipeline scheduling and orchestration using shell scripting and Autosys
Independently identify, assess, and resolve technical risks and data issues
Requirements
Hands-on expertise in PySpark and Big Data processing
Practical knowledge of the Hadoop ecosystem, including Hive, HDFS, Sqoop, Spark, Impala, and Scala
Proficiency in complex SQL query development
Solid understanding of distributed systems architecture
Demonstrated knowledge of data modelling and data design
Competence in shell scripting and job scheduling using Autosys
Strong analytical and problem-solving ability
Clear and effective communication skills
Nice to have
Familiarity with streaming data platforms such as Apache Kafka
Exposure to cloud-based Big Data environments and modern data lakehouse architectures
Experience working in financial services or regulated industries
What we offer
Hybrid working model
Access to continuous learning and development programmes
Exposure to large-scale, complex data systems
A collaborative and inclusive team environment
Competitive compensation and a comprehensive benefits package