This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Senior Gen AI & Prompt Engineer @ Deloitte Competence Center. Are you looking for an opportunity in Thessaloniki, Greece that offers you a hybrid working model? Are you ready to work on international projects for industry-leading clients around the world and elevate your tech consulting career to an international level?
Job Responsibility:
GenAI-Native Platform Implementation: We specialize in deploying GenAI-powered applications across software development, integration, and testing, enabling organizations to accelerate the adoption of Generative AI solutions while ensuring robustness, security, and scalability
Accelerated AI Capabilities: Our services drive the rapid adoption of Generative AI by integrating advanced AI models into enterprise environments, unlocking new opportunities for automation, content generation, knowledge management, and enhanced decision-making
Advanced AI Architectures: Our expertise lies in designing architectures that seamlessly integrate Generative AI models with enterprise systems, enabling a secure, scalable, and compliant AI ecosystem that maximizes business value
Requirements:
Proven experience working with Large Language Models (LLMs) and Generative AI architectures for tasks such as text generation, summarization, translation, code generation, and multimodal AI applications
Mastery in designing effective and optimized prompts, leveraging Few-shot learning, Chain-of-Thought (CoT), Retrieval-Augmented Generation (RAG), and Self-Consistency prompting to enhance model performance and contextual accuracy
Hands-on experience integrating Generative AI models into enterprise applications, chatbots, virtual assistants, and knowledge management systems using LangChain, LlamaIndex, or similar orchestration tools
Strong understanding of LLM inference optimization, model distillation, parameter-efficient fine-tuning (LoRA, QLoRA), and deployment strategies on cloud (AWS, Azure, GCP), on-premises, and edge environments
Expertise in embedding models, vector databases, and RAG pipelines to enhance model relevance and contextual understanding
Strong experience with LLM APIs (e.g. OpenAI, Anthropic) and open-source model hosting, including running private AI models in secure environments for enterprises
Nice to have:
Experience in fine-tuning and adapting foundation models (e.g., GPT, Claude, Gemini, LLaMA) using domain-specific datasets, reinforcement learning (RLHF), and instruction tuning techniques
Hands-on experience with Django and FastAPI for building scalable backend services and APIs for Generative AI applications. Experience with React.js for developing interactive AI-powered user interfaces and front-end experiences
Familiarity and use of any VCS (e.g., git)
Experience with Docker and Kubernetes
What we offer:
Modern hybrid workplace, characterized by flexibility and Smart Working
Empowered well-being: We provide multiple program offerings to support your well-being needs (flexible working arrangements, extra days of leave, parental allowances)
Engagement within international large-scale teams and projects, with opportunities to travel for training or client purposes
Constant opportunities for learning with unlimited access to internal and external learning platforms and sponsored certificates aligned with business needs and technology trends
Challenging and innovating environment where personal development and growth are encouraged, always with transparency and trust
Diverse culture and active communities that enable you to bring yourself to work
Team Building and Corporate Social Responsibility Activities
A buddy to support you with your onboarding
Private medical health insurance plan
Ticket restaurant card
Exclusive Discounts to several retail providers, restaurants and others
Mobile phone
Fresh fruits and unlimited coffee every day at our offices