This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
As a Junior Data Scientist, you will get involved on projects that encompass the foundational principles of data science, such as Machine Learning, Experimentation, Statistical Modeling and Causal Inference. Most of your actions will hinge on these core and essential activities: aligning on business objectives, formulating problem statements, collecting, cleaning and structuring data, conducting analytical explorations, applying techniques for inferential and predictive modeling and effectively communicating outcomes. You will work on projects of varying sizes and time horizons, aligning your prioritizations with the Operations Coordinator. You will translate business questions into testable hypotheses and rigorous experimental designs, or into mathematical models and ML algorithms. You will deliver your work iteratively and incrementally, ensuring i) your code follows the best practices in terms of quality, reusability, optimization, ii) your analyses are robust, valid, reliable, scalable and impactful, iii) your deliverables are properly maintained and adapted to evolving requirements. You will investigate opportunities in various topics, such as: Market Expansion with randomized controlled trials (RCTs) or ML models for product launch; Market Health models or DnD models for policy evaluation; Audits with regression discontinuity design (RDD) or reinforcement control learning for auditing threshold; Document Processing with natural language processing (NLP) or optical character recognition (OCR) for document classification.
Job Responsibility:
Formulate business questions into testable hypotheses and rigorous experimental designs
Collect and clean data, analyze experiments, infer causality and estimate effects
Use Statistical Modeling and ML to find patterns and make predictions from large datasets
Engineer features, deploy models and improve them as needed
Visualize and report your findings to Stakeholders, adapting your communication to a variety of audiences
Communicate findings to Stakeholders, adapting your communication to a variety of audiences
Requirements:
Knowledge of causal inference, experimentation and econometrics statistics methods for analytical problems
Ability to design, conduct and analyze experiments to infer causality and estimate effects
Understanding of statistical modeling and ML to find patterns and make predictions from large datasets
Skilled in SQL and Python (Pandas Library, basics of OOP and API endpoints)
Hands-on experience in common ML frameworks, tools & libraries (scikit-learn, scipy)
Knowledge of ML methods (neural networks, naive Bayes, SVM, decision forests, etc.)
Ability to engineer features, deploy models and improve them as needed
Ability to visualize and report your findings to Stakeholders, adapting your communication to a variety of audiences
Motivation to learn all along the way and improve continuously