This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Lead moderately complex initiatives within Technology and contribute to large scale data processing framework initiatives related to enterprise strategy deliverables
Build and maintain optimized and highly available data pipelines that facilitate deeper analysis and reporting
Review and analyze moderately complex business, operational or technical challenges that require an in-depth evaluation of variable factors
Oversee the data integration work, including developing a data model, maintaining a data warehouse and analytics environment, and writing scripts for data integration and analysis
Resolve moderately complex issues and lead teams to meet data engineering deliverables while leveraging solid understanding of data information policies, procedures and compliance requirements
Collaborate and consult with colleagues and managers to resolve data engineering issues and achieve strategic goals
Requirements
4+ years of Data Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
4+ years of Data Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
4+ years of Strong Experience of Spark/Scala of scala:map,flatMap,option,future,lazy,implicits,fold,immutability, Spark Core, Spark SQL, Data Frames, Datasets
3+ Experience on Python or other scripting languages
3+ years of data warehouse experience, SQL experience (Oracle, SQL, PL/SQL). Understand and write the complex SQL queries and have in-depth understanding of oracle database
Strong experience with Hadoop, Hive, Autosys
Good knowledge of non-relational DBMS platforms, (MongoDB or Neo4j), real-time data streaming via Kafka
Working knowledge in developing end to end ML/AI pipeline in Apache Spark, Sparkling Waters, H2O, experience in working on LLMs. Exposer to Agentic AI frameworks
Ability to work with Data Engineers in discovering and optimizing bottlenecks in the AI/ML pipeline for real-time or near-real-time applications that consumes large throughput of data
Perform various complex activities related to statistical/machine learning. Provide analytical support for developing, evaluating, implementing, monitoring and executing models across business verticals using emerging technologies including but not limited to Python, Agentic AI frameworks like ADK, GCP vertex AI and NLP
Good understanding of distributed system and parallel processing, spark architecture in depth
Prior knowledge on Ab initio ETL or any other ETL tools would be beneficial but not mandatory
Proficient in large scala data processing, Hands-on experience with different file formats, very good understanding of Unix command & Shell scripting
Hands-on experience with GCP or other cloud platforms (AWS/Azure)
Familiarity with CI/CD pipelines, infrastructure as code and cloud deployment best practices
Ability to conduct code reviews with focus on testability and code coverage
Should have executed end to end projects in data engineering
Should have experience working in an Agile environment using Scrum or Kanban
Fluent in communication
Nice to have
Prior knowledge on Ab initio ETL or any other ETL tools would be beneficial but not mandatory