This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
A 6 month remote internship opportunity offering you a glimpse into the real world applications of machine learning and data engineering to build scalable solutions for enterprises. Our aim - creating an experience that allows final year college students to learn how fast growing startups work, gain practical skills, build real world experience, develop a greater understanding of the GenAI and data engineering industry and form valuable connections. At the end of their stints, top performing residents will get an opportunity to meet the co- founders and the team in our Bangalore office and shall be awarded full time positions at Ema.
Job Responsibility:
Research, design, implement, optimize and deploy deep learning models that advance the state of the art in perception and control for autonomous driving
A typical day to day includes reading deep learning papers, implementing described models and algorithms, adapting them to our setting and driving up internal metrics
Play a pivotal role in developing and maintaining sophisticated enterprise-level software applications, with a focus on back-end systems, API development, and the integration of language models and NLP technologies
Train machine learning models at Ema
Develop state-of-the-art algorithms in one or all of the following areas: Prompt engineering for LLM models, Fine tuning models, Training open source models, large-scale distributed training
Comparing and benchmarking performance of different models
Optimize deep neural networks and the associated preprocessing/postprocessing code to run efficiently on an embedded device
Conduct analysis that includes data gathering, data transformation, data processing and analysis
Work with large complex data sets, solve difficult non-routine analysis problems, and apply advanced analytical methods
Build and prototype analysis pipelines iteratively to provide insights at scale
Requirements:
Currently enrolled in a full-time degree program in Computer Science or related field graduating by June 2026
Top rankers in JEE Advanced and consistent high academic record (GPA) preferred
Strong interest in either: Machine Learning or Natural Language Processing (NLP), ideally in one or more of these areas: Generative AI, Natural Language Understanding (NLU), Natural Language Generation (NLG), Structured Prediction, Unsupervised Learning & Representation Learning
AND/OR Systems programming, data engineering, data processing frameworks, streaming databases and data visualization
Strong in coding skills and be ready to pick up any task and run with it
Experience with SQL and other programming languages like Python, Cala, or R
Experience with programming languages like Python and familiarity with frameworks like PyTorch, Tensorflow, Keras, etc.