This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
As part of the Data Management & Analytics department, this internship offers a unique opportunity to contribute to the setup of a centralized clinical Data Lake integrating both structured and unstructured data. This Data Lake will serve as the foundation for future analytics and AI initiatives, study design modelling and AI-driven photo evaluation. The intern will participate in defining the data architecture, building ingestion workflows, and supporting downstream analyses. This internship is intended for students completing their final academic requirements.
Job Responsibility:
Participate in the design and implementation of the Data Lake (Azure / Microsoft Fabric / OneLake)
Develop ingestion pipelines for structured and unstructured data
Define a metadata tagging process for unstructured data
Collaborate to define data modelling and standardization strategies
Envision and develop an AI-powered agent
Document all implementation steps to ensure project continuity
Requirements:
Master in IT Engineering, Data Engineering/Science, or a related field
Strong Python, SQL, and Spark skills
experience with ETL/ELT and data quality controls
Familiarity with Azure / Microsoft Fabric / OneLake (or similar cloud data platforms)
Curious, rigorous, autonomous
strong analytical and problem‑solving abilities
Ability to work independently and collaboratively in a cross-functional environment