CrawlJobs Logo

Sr Data Scientist

Genzeon

Location Icon

Location:
India, Hyderabad

Category Icon
Category:
IT - Software Development

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Sr Data Scientist

Job Responsibility:

  • LLM Architecture: Good understanding of the architecture underlying large language models, such as Transformer-based models and their variants. Design and implement deep learning model architectures using PyTorch
  • Language Model Training and Fine-Tuning: Experience in training large-scale language models from scratch, as well as fine-tuning pre-trained models on domain data
  • Data Preprocessing for NLP: Skilled in preprocessing textual data, including tokenization, stemming, lemmatization, and handling of different text encoding
  • Transfer Learning and Adaptation: Proficiency in applying transfer learning techniques to adapt existing LLMs to new languages, domains, or specific business needs
  • Data Annotation and Evaluation: Skills in designing and implementing data annotation strategies for training LLMs and evaluating their performance using appropriate metrics
  • Scalability and Deployment: Experience in scaling LLMs for production environments, ensuring efficiency and robustness in deployment
  • Model Training, Optimization, and Evaluation: Evaluate the performance of PyTorch models using appropriate metrics and techniques like cross-validation, holdout sets, or online evaluation. This encompasses the complete cycle of training, fine-tuning, and validating language models. You will be designing and adapting LLMs for use in virtual assistants, Information retrieval and extraction etc
  • Experimentation with Emerging Technologies and Methods: Actively exploring new technologies and methodologies in language model development, including experimental frameworks and software tools
  • LLM Alignment: Understanding of algorithms like DPO, PPO, KPO, RLHF and using it for guardrails
  • AI Data Retrieval: Data retrieval from unstructured data, extract key value pairs using techniques like donut, layoutLM, table transformers
  • Analyze data and build EDAs to identify data patterns
  • Hands-on and strong understanding of concepts in Deep Learning and NLP
  • Proficient in TensorFlow and similar libraries

Requirements:

  • 5+ years of hands-on experience in developing and deploying Large Language Models, and Machine learning and working with Pytorch
  • A thorough understanding of machine learning, particularly deep learning techniques, including knowledge of neural network architectures, training methods, and optimization algorithms
  • Proficiency in AI technology, Python, including experience with NLP libraries (e.g., Hugging Face Transformers, NLTK, spaCy), text classification
  • Experience with frameworks: PyTorch, or Tensorflow
  • Experience with cloud services (AWS, Azure) and ML deployment tool Docker
  • Familiarity with model fine-tuning and optimization techniques for LLMs
  • Proven track record of innovative solutions in the field of LLMs
  • Strong communication skills, with the ability to explain complex AI concepts to non-expert audiences

Nice to have:

  • 4+ years' experience in data analytics, data science, quantitative analysis using statistical computer languages to draw insights from large data sets
  • 3+ years' experience in Python development, preferably delivering production code for data applications
  • Experience with unstructured data or computer vision models is a plus
  • Experience with SQL is a big plus
  • Extensive model implementation experience using Scikit
  • Experience designing and developing for security critical applications
  • experience with the specifics for HIPAA/PHI/PII/GDPR a big plus
  • Basic experience with Linux, Git, Jupyter Notebooks is must
  • Knowledge of Agile development practices
  • Flexibility and adaptability to respond to a rapidly changing environment
  • Experience with distributed computational techniques and job orchestration tools and platforms is very valuable: airflow, etc

Additional Information:

Job Posted:
December 08, 2025

Job Link Share:
Welcome to CrawlJobs.com
Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.