This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are seeking a Senior Software Engineer (Java) who combines strong backend engineering skills with a deep interest in data quality and large-scale data correction. This is a highly technical position focused on investigating data issues, debugging complex systems, and writing substantial Java code to repair and improve massive datasets. You’ll work in an iterative, fast-moving environment, diagnosing issues across distributed systems, building safe and repeatable repair workflows, and ensuring data correctness across millions of records.
Job Responsibility:
Investigate, diagnose, and resolve data-quality issues in large-scale backend systems
Design and implement Java-based repair and correction pipelines for high-volume datasets
Debug and fix issues in existing Java codebases supporting distributed systems
Build safe, idempotent jobs to correct data without introducing regressions
Leverage AWS services (such as S3, SQS, and Lambda) to support queue-driven and batch workflows
Analyze system behavior related to performance, concurrency, and memory usage
Clearly communicate findings, risks, and recommendations to technical and non-technical stakeholders
Requirements:
6+ years of professional experience developing Java in large-scale backend systems
Strong debugging, triage, and data analysis skills
Experience working with distributed systems and high-volume processing pipelines
Hands-on experience with AWS, including services such as S3, SQS, and Lambda
Solid understanding of performance optimization, concurrency, and memory-efficient design patterns
Ability to design reliable, repeatable data repair workflows
Comfort operating in ambiguous, evolving problem spaces and deriving practical solutions
Strong communication skills, with the ability to summarize complex technical findings clearly
Nice to have:
Experience working with genealogical, historical, or highly interrelated datasets
Familiarity with queue-driven architectures and event-based processing
Experience improving data integrity in legacy or long-lived systems