This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
The Head of the Data Center will lead the planning, design, implementation, and ongoing development of the Data Center, a newly established central unit at the Max Planck Institute of Geoanthropology. This facility will support researchers in the collection, integration, and analysis of diverse datasets across disciplines such as geography, anthropology, history, archaeology, ecology, sociology, climatology, and biology.
Job Responsibility:
Lead the planning, design, implementation, and ongoing development of the Data Center
Conceptualize, develop, and execute management workflows
Design, implement, maintain, and integrate databases
Manage a team of five dedicated staff members
Liaise with the Department Directors, independent research group leaders, and other Core Unit heads
Develop and implement efficient data acquisition and compilation from diverse field, lab, archival and published sources
Develop management strategies to support in-depth exploration of anthropogenic factors in the evolution of Earth systems including diverse modelling approaches
Collaborate with researchers across the Institute to support data availability and storage through models, web databases, and scientific publications
Work closely with IT service units to ensure that curated datasets are accessible and meet scientific standards for security, compliance, and usability
Represent the MPI-GEA Data Center within the Max Planck Society and at international forums, fostering partnerships for data-sharing, responding to emerging technologies, and guiding interdisciplinary initiatives for geoanthropological research and Anthropocene studies
Requirements:
Experience in managing multidisciplinary teams in an international and intercultural environment, including technical and scientific staff
PhD in Computer Science or a related field, with several years of experience in distributed databases, large-scale data management, and data archiving
Proven expertise in handling large datasets, particularly in geospatial and diachronic applications
Strong knowledge of data management processes, leadership, and implementation of large-scale systems
Experience in planning, designing, and deploying scalable infrastructure for large datasets and databases, including containerization and orchestration technologies such as Docker, Kubernetes, MariaDB, PostgreSQL, Apache Cassandra, Neo4J, etc.
Proficiency in semantic data modelling, utilizing technologies such as RDF, XML, JSON-LD, and Linked Open Data (LOD), etc.
Experience with data integration and API technologies such as RESTful API, GraphQL, Apache Camel, etc.
Experience with data processing & ETL tools such as Apache Hadoop, Apache Spark, Apache Nifi, etc.
Background in scientific research with a track record of high-quality publications
Nice to have:
Ability to communicate complex technical concepts effectively to both scientific and public audiences
Knowledge of data protection, security protocols, and compliance standards
Experience with AI/LLM integration for research applications, including RAG systems and vector databases
Familiarity with multiple database platforms
Experience in data archiving and long-term preservation concepts
Experience in data processing and data visualization