CrawlJobs Logo

Data Engineer with AI Skills

India, Bangalore · Job Posted June 14, 2026
Apply Position
Job Link Share

Job Responsibility

  • Data Platform Migration: Design and execute migration from traditional Data Lake architecture to Lakehouse architecture
  • Modernize data platforms using Apache Spark, Apache Iceberg / Open Table formats, Snowflake / Cloud Data Warehouse
  • Refactor and migrate existing data pipelines to AWS cloud-native services
  • Code Migration & Modernization: Analyze and migrate legacy codebases (Java / ETL pipelines) to cloud-native architectures
  • Optimize existing workflows for performance, scalability, and cost
  • Ensure adherence to best practices in distributed data processing
  • AI-Assisted Engineering: Use AI tools (e.g., Copilot, ChatGPT, or similar) to accelerate code migration and transformation, automate repetitive development tasks, improve developer productivity
  • Apply Prompt Engineering techniques to generate optimized code solutions, automate documentation, streamline data pipeline transformations
  • Testing & Validation Automation: Design and implement automated testing frameworks for migrated pipelines
  • Use AI-assisted tools to generate test cases, validate data integrity and consistency, ensure high-quality delivery with robust CI/CD and testing practices
  • Cloud & Infrastructure Engineering: Build and manage cloud-native data architectures on AWS, including data storage and compute services, scalable processing pipelines
  • Implement infrastructure following DevOps and IaC (Infrastructure as Code) methodologies

Requirements

  • Strong programming expertise in Java
  • Hands-on experience with Apache Spark, Apache Iceberg / Delta Lake or similar table formats, Snowflake or equivalent data warehouse technologies
  • Deep understanding of Lakehouse architecture principles
  • Strong experience with AWS cloud services (S3, EMR, Glue, Lambda, etc.)
  • Experience building cloud-native, distributed data solutions
  • Proven experience using AI tools for development and productivity
  • Strong Prompt Engineering skills for code generation, migration acceleration, automated testing
  • Experience with large-scale data migration projects
  • Knowledge of data modeling, ETL/ELT, and data pipeline orchestration
  • Familiarity with data governance, lineage, and quality frameworks
  • Experience in automated testing frameworks for data pipelines
  • Knowledge of CI/CD pipelines and DevOps practices
  • Hands-on coding skills (Java / Spark)
  • Use of tools and platforms (AWS, Snowflake, Iceberg)
  • Real-time code migration scenarios
  • Application of AI tools & prompt engineering
  • Testing strategies and automation of data pipelines

Nice to have

  • Experience with large-scale enterprise data migrations
  • Exposure to modern data architectures (Lakehouse, Data Mesh)
  • Certifications in AWS / Data Engineering / Cloud Platforms

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Data Engineer with AI Skills

8 matching positions

Senior Data Engineer (with AI)

Provectus is a leading AI consultancy and AWS Premier Consulting Partner with 15...
Location
Location
Poland , Wroclaw Metropolitan Area
Salary
Salary:
Not provided
provectus.com Logo
Provectus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of hands-on engineering experience with production systems
  • Full-stack mindset, comfortable across AI, Backend development, Data, and cloud infrastructure
  • Autonomous working style
  • Experience adopting AI tools in day-to-day workflows (e.g. Claude Code, GitHub Copilot, or similar)
  • Strong sense of ownership and proactivity
  • Openness to broadening skills into adjacent areas
  • B2+ English, comfortable collaborating across distributed, multicultural teams
  • Strong Python and SQL skills and solid software engineering fundamentals
  • Hands-on experience with Apache Spark for large-scale data processing
  • Proficiency with cloud data warehouse technologies: Snowflake, Redshift, or ClickHouse
Job Responsibility
Job Responsibility
  • Design, build, and maintain robust data pipelines and ML systems for production environments
  • Develop and deploy ML and LLM-based solutions addressing real client business challenges
  • Build and maintain ETL/ELT workflows using modern orchestration and distributed computing tools
  • Implement MLOps practices: CI/CD, automated testing, model monitoring, and experiment tracking
  • Architect and implement cloud-native data and AI/ML solutions, primarily on AWS
  • Collaborate closely with Data Scientists, AI/ML Engineers, Backend Engineers, and client stakeholders
  • Participate in code reviews, contribute to technical documentation, and share knowledge within the team
  • Engage in client-facing discussions to understand requirements and propose technical solutions
What we offer
What we offer
  • Impactful work: projects span GenAI, MLOps, and NextGen data platforms for global enterprises across multiple industries
  • Senior-calibre peers: collaborate with top ML and Data professionals across North America, LATAM, and EMEA
  • Career growth: a clear path toward Tech Lead if you have the ambition — we actively develop our engineers
  • Recognised expertise: AWS Premier Consulting Partner featured in Forrester’s AI Technical Services Landscape
Read More
Arrow Right

Data Engineer with Generative AI Expertise

We are looking for a skilled Data Engineer with expertise in Generative AI to jo...
Location
Location
India , Jaipur
Salary
Salary:
Not provided
infoobjects.com Logo
InfoObjects
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related fields
  • 2-6 years of hands-on experience in Data Engineering
  • Proficiency in Generative AI frameworks (e.g., GPT, DALL-E, Stable Diffusion)
  • Strong programming skills in Python, SQL, and familiarity with Java or Scala
  • Experience with data tools and platforms such as Apache Spark, Hadoop, or similar
  • Knowledge of cloud platforms like AWS, Azure, or GCP
  • Familiarity with MLOps practices and AI model deployment
  • Excellent problem-solving and communication skills
Job Responsibility
Job Responsibility
  • Design, develop, and maintain robust data pipelines and workflows
  • Integrate Generative AI models into existing data systems to enhance functionality
  • Collaborate with cross-functional teams to understand business needs and translate them into scalable data and AI solutions
  • Optimize data storage, processing, and retrieval systems for performance and scalability
  • Ensure data security, quality, and governance across all processes
  • Stay updated with the latest advancements in Generative AI and data engineering practices
Read More
Arrow Right

Senior Data Engineer – Data Engineering & AI Platforms

We are looking for a highly skilled Senior Data Engineer (L2) who can design, bu...
Location
Location
India , Chennai, Madurai, Coimbatore
Salary
Salary:
Not provided
optisolbusiness.com Logo
OptiSol Business Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong hands-on expertise in cloud ecosystems (Azure / AWS / GCP)
  • Excellent Python programming skills with data engineering libraries and frameworks
  • Advanced SQL capabilities including window functions, CTEs, and performance tuning
  • Solid understanding of distributed processing using Spark/PySpark
  • Experience designing and implementing scalable ETL/ELT workflows
  • Good understanding of data modeling concepts (dimensional, star, snowflake)
  • Familiarity with GenAI/LLM-based integration for data workflows
  • Experience working with Git, CI/CD, and Agile delivery frameworks
  • Strong communication skills for interacting with clients, stakeholders, and internal teams
Job Responsibility
Job Responsibility
  • Design, build, and maintain scalable ETL/ELT pipelines across cloud and big data platforms
  • Contribute to architectural discussions by translating business needs into data solutions spanning ingestion, transformation, and consumption layers
  • Work closely with solutioning and pre-sales teams for technical evaluations and client-facing discussions
  • Lead squads of L0/L1 engineers—ensuring delivery quality, mentoring, and guiding career growth
  • Develop cloud-native data engineering solutions using Python, SQL, PySpark, and modern data frameworks
  • Ensure data reliability, performance, and maintainability across the pipeline lifecycle—from development to deployment
  • Support long-term ODC/T&M projects by demonstrating expertise during technical discussions and interviews
  • Integrate emerging GenAI tools where applicable to enhance data enrichment, automation, and transformations
What we offer
What we offer
  • Opportunity to work at the intersection of Data Engineering, Cloud, and Generative AI
  • Hands-on exposure to modern data stacks and emerging AI technologies
  • Collaboration with experts across Data, AI/ML, and cloud practices
  • Access to structured learning, certifications, and leadership mentoring
  • Competitive compensation with fast-track career growth and visibility
  • Fulltime
Read More
Arrow Right

Sr Staff Software Engineer (Data & Ai Platform)

We are looking for a talented and experienced Sr Staff Software Engineer to join...
Location
Location
Israel , Petah Tikva
Salary
Salary:
Not provided
paloaltonetworks.com Logo
Palo Alto Networks
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Engineering, or a related field
  • At least 5 years of experience with Python
  • Minimum of 2 years of hands-on experience with AWS
  • Proven experience with the full Software Development Life Cycle (SDLC), including design, development, testing, and deployment
  • Excellent problem-solving skills and strong attention to detail
  • Ability to manage multiple projects simultaneously and strong organizational skills
Job Responsibility
Job Responsibility
  • Define and implement best practices for the R&D platform
  • Collaborate with internal and external stakeholders to understand requirements and deliver high-quality solutions
  • Work on edge technologies and cloud platforms to enhance our services
  • Participate in all phases of the SDLC, including design, development, testing, and deployment
  • Ensure high ROI on deliverables by optimizing processes and solutions
  • Share knowledge and best practices across the company to promote a culture of continuous improvement
  • Fulltime
Read More
Arrow Right

Senior Engineer - AI & Data Science

Location
Location
Poland
Salary
Salary:
15025.00 - 22500.00 PLN / Month
https://www.hsbc.com Logo
HSBC
Expiration Date
July 11, 2026
Flip Icon
Requirements
Requirements
  • Strong experience in delivering Data Science and similar solutions
  • Knowledge of ingestion and data preparation techniques, especially for use in AI solutions
  • Experience working with Cloud based APIs for Large Language Models is highly desirable
  • Very good Python skills and strong object-oriented programming concepts
  • Strong knowledge of software engineering architecture concepts for the build of multi-tier solutions that include user interfaces, data, workflow, and controls
  • Strong knowledge of technology and support practices to maintain a production grade user system, including testing practices
  • Strong experience working with developer tools and frameworks such as GitHub, Agile practices, Source control and branching concepts
  • Understanding of how Large Language Models work
  • Knowledge of web UI coding frameworks such as React, NodeJS, Angular, is desirable
  • Strong knowledge and experience of modern Agile and project management practices
Job Responsibility
Job Responsibility
  • Develop Python-based solutions for Global Finance business teams, including but not limited to Generative AI
  • Research and develop new techniques in data science, Generative AI and related areas to improve the quality of outputs
  • Perform detailed testing of prompts and data inputs and outputs
  • Maintain the codebase including frontend, backend, and algorithms in a engineering global team environment
  • Understand and implement appropriate governance and controls required
  • Face off to the business and other stakeholders, including but not limited to the business line teams, Global Finance Analytics, Transformation, Change/Project Management, IT, Governance teams
  • Contribute towards overall strategy development for the team
  • Work closely with the Quantitative Analytics teams to build software solutions that are practical, workable and comply with accounting, regulatory or other requirements
What we offer
What we offer
  • Additional bonuses for recognition awards
  • Multisport card
  • Private medical care
  • Life insurance
  • One-time reimbursement of home office set-up (up to 800 PLN)
  • Cafeteria platform
  • Employee assistance program
  • Additional contributions to PPK scheme
  • Corporate parties & events
  • CSR initiatives
  • Fulltime
Read More
Arrow Right

Lead Data Engineer - AI Search

We are looking for a Lead Data Engineer — AI Search (all genders) to join Valtec...
Location
Location
France , Paris
Salary
Salary:
Not provided
valtech.com Logo
Valtech
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of professional software engineering experience in enterprise environments
  • Strong expertise in Google Cloud Platform (GCP) and AI ecosystems
  • Experience with AI Search, conversational AI, and generative AI integrations
  • Strong application architecture, API design, and enterprise integration experience
  • Solid understanding of cloud-native, microservices, and event-driven architectures
  • Experience with CI/CD pipelines, software quality, observability, and cloud security best practices
  • Experience working with Kubernetes and Infrastructure as Code
  • Technical leadership experience, including mentoring engineers and guiding technical decisions
  • Ability to collaborate across architecture, engineering, cloud, and data teams
  • Strong communication and stakeholder management skills
Job Responsibility
Job Responsibility
  • Define detailed technical design, components, interfaces, and integration patterns
  • Lead development teams and enforce engineering standards and best practices
  • Coordinate architecture, cloud/platform, data, QA, and business stakeholders
  • Ensure delivery quality through testing, observability, and application security practices
  • Guide teams on GCP and AI Search implementation patterns
  • Lead technical discovery, estimation, and solution design workshops
  • Identify technical risks and propose mitigation strategies early
  • Ensure scalable, reliable, and production-ready AI solutions
  • Contribute hands-on in complex and high-impact technical areas
  • Support industrialization and operationalization of AI and search use cases
What we offer
What we offer
  • Flexibility, with remote and hybrid work options (country-dependent)
  • Career advancement, with international mobility and professional development programs
  • Learning and development, with access to cutting-edge tools, training and industry experts
  • Fulltime
Read More
Arrow Right

AI Data Engineer

We are looking for a technically sharp and detail-oriented Data Engineer to join...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Information Systems, Data Engineering, Mathematics, or a related discipline
  • 4 – 5 years of hands-on experience in data engineering, ETL development, or analytics engineering roles
  • Demonstrable experience with Databricks and/or Microsoft Fabric in a production environment
  • Proficiency in Power BI report and semantic model development
  • Exposure to Collibra or equivalent data governance / cataloguing platforms is strongly preferred
  • Strong SQL and Python skills
  • PySpark experience is required
  • Familiarity with Azure cloud services and DevOps practices for data pipeline deployment
Job Responsibility
Job Responsibility
  • Design, build, and maintain scalable ETL/ELT pipelines using Azure Data Factory, Databricks (PySpark / Delta Live Tables), and Microsoft Fabric Data Factory
  • Transform raw, multi-source data into clean, conformed, and analytics-ready datasets following Medallion Architecture principles (Bronze → Silver → Gold)
  • Develop and optimize SQL and PySpark-based transformation logic for structured, semi-structured, and unstructured data
  • Implement incremental load patterns, merge/upsert logic, and slowly changing dimension (SCD) strategies to support historical data tracking
  • Collaborate with the AI Engineers to prepare high-quality feature datasets for ML and LLM use cases
  • Define, implement, and monitor data quality rules including completeness, accuracy, consistency, timeliness, and uniqueness checks
  • Administer and extend the Collibra data governance platform — including business glossary management, data lineage documentation, and stewardship workflows
  • Build automated data quality validation frameworks using tools such as Great Expectations, dbt tests, or Unity Catalog data quality constraints in Databricks
  • Triage and resolve data quality incidents, root-cause data anomalies, and communicate impact to stakeholders proactively
  • Maintain metadata catalogues and ensure all critical datasets have documented ownership, lineage, and classification
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Senior AI & Data Engineer

The Senior AI & Data Engineer is an individual contributor role that acts as the...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Data Science, AI/ML, Engineering, Mathematics, or a related technical discipline
  • PhD is a plus
  • 7 – 10 years of hands-on experience in AI/ML engineering, applied data science, or LLM engineering roles
  • Proven track record of delivering production AI systems
  • Deep expertise with at least two major LLM platforms (Claude, GPT, Gemini, or equivalent)
  • Significant experience with Collibra or an equivalent enterprise data governance platform
  • Demonstrated experience leading cross-functional AI initiatives and mentoring junior engineers
  • Strong ML fundamentals alongside modern generative AI skills
  • Experience with responsible AI practices, including fairness auditing, explainability, and content safety, is strongly preferred
Job Responsibility
Job Responsibility
  • Serve as the dual AI & data SME for the team and organization
  • Define and uphold engineering standards, design patterns, and best practices across both AI and data engineering disciplines
  • Lead technical discovery for new AI and data use cases
  • Participate in and lead cross-functional initiatives where AI and data strategy intersect
  • Mentor and upskill the Applied AI Engineer and AI Data Engineer
  • Architect and deliver complex agentic AI systems
  • Design and implement advanced RAG architectures
  • Lead LLM evaluation frameworks
  • Assess and implement LLM fine-tuning and alignment strategies
  • Own LLM integration architecture
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
Read More
Arrow Right