CrawlJobs Logo

Data Provisioning Engineer

Czechia, Prague · Job Posted March 01, 2026
Apply Position
Job Link Share

Job Description

Join Barclays as a Data Provisioning Engineer within the Client Analytics programme, a newly established initiative delivering analytics applications for the Investment Bank by removing manual, report-driven processes. In this role, you will be a dedicated resource supporting the implementation of new market data feeds, working directly with internal and external providers to onboard, model, and integrate data into Barclays’ strategic data platforms. You’ll enable reliable data delivery via APIs and batch feeds, maintain end-to-end pipelines, configure AWS storage, and collaborate closely with a wider engineering and analytics team to ensure high-quality, scalable data solutions that underpin emerging client analytics capabilities.

Job Responsibility

  • Build and maintenance of data architectures pipelines that enable the transfer and processing of durable, complete and consistent data
  • Design and implementation of data warehoused and data lakes that manage the appropriate data volumes and velocity and adhere to the required security measures
  • Development of processing and analysis algorithms fit for the intended data complexity and volumes
  • Collaboration with data scientist to build and deploy machine learning models

Requirements

  • Experience with market data ingestion and provisioning mechanisms, such as SFTP, REST APIs, vendor SDKs, scheduled file drops, and polling processes
  • Strong AWS cloud engineering expertise
  • Proven experience in database modelling and data persistence
  • Experience with data quality controls, validation, and monitoring
  • DevOps experience, including setting up and maintaining data pipelines for ingesting, processing, and delivering external data across platforms and systems
  • Solid knowledge of market data providers, such as FactSet, Dealogic, LSEG, and Bloomberg

Nice to have

  • Knowledge of Python
  • Advanced SQL scripting and query optimisation skills

What we offer

  • Competitive holiday allowance
  • Life assurance
  • Private medical care
  • Pension contribution

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Data Provisioning Engineer

8 matching positions

New

Full-Stack Data Engineer – Data & ML Automation (Databricks)

We are seeking a Fullstack Data Engineer who can operate at the intersection of ...
Location
Location
India , Pune
Salary
Salary:
Not provided
Codvo AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience with CLI tools, scripts, and utilities for automating data platform workflows
  • Experience with Databricks APIs, Terraform, Databricks SDK
  • Experience designing integration tests, end-to-end pipeline tests, validation frameworks for Databricks ETL/ELT pipelines and ML inference workflows
  • Experience building internal applications using React, Streamlit, or similar frameworks
  • Experience with spec-driven development, coding agents and automation patterns, CI/CD workflows for data/ML systems
Job Responsibility
Job Responsibility
  • Develop CLI tools, scripts, and utilities to automate repetitive workflows across the data platform
  • Automate Databricks workflows, job deployments, environment provisioning, and MLOps operations using Databricks APIs, Terraform, Databricks SDK
  • Design and implement integration tests, end-to-end pipeline tests, validation frameworks for Databricks ETL/ELT pipelines and ML inference workflows
  • Improve reliability, observability, and overall engineering productivity across the data & ML team
  • Build quick internal applications using React, Streamlit, or similar frameworks to visualize data flows, provide model inference demos, enable operational or configuration controls
  • Develop internal productivity and monitoring dashboards
  • Apply best practices around spec-driven development, coding agents and automation patterns, CI/CD workflows for data/ML systems
  • Fulltime
Read More
Arrow Right

Senior Data Engineer - Data Platform

We are looking for a Senior Data Engineer - Data Platform to join our Data & AI ...
Location
Location
France , Paris
Salary
Salary:
Not provided
doctolib.fr Logo
Doctolib
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • More than 7 years of experience as Site Reliability Engineer, Data Ops, Data Platform Engineer or in a similar role, with a proven track record of building and maintaining complex data infrastructures
  • Strong proficiency in data engineering and infrastructure tools and technologies, such as stream and events processing (Kafka, PubSub, Firehose) and Kubernetes
  • Expertise in programming languages like Python
  • Familiar with cloud infrastructure and services, preferably AWS, Azure, or GCP, and have experience with infrastructure-as-code tools such as Terraform
  • Excellent problem-solving skills with a focus on identifying and resolving data infrastructure bottlenecks and performance issues
Job Responsibility
Job Responsibility
  • Design and implement a scalable and reliable data infrastructure that supports the collection, processing, storage, and analysis of large-scale datasets while pushing security and privacy best practices
  • Build and maintain data pipelines that efficiently extract, transform, and load data from various sources into our data warehouse
  • Implement automation and orchestration tools to streamline infrastructure provisioning, data workflows, reduce manual effort, and improve operational efficiency
  • Monitor data platform for performance and reliability, identify and troubleshoot issues, and implement proactive solutions to ensure data quality and availability
  • Streamline and monitor platform costs, identify optimizations and saving opportunities while collaborating with data engineers, data scientists, and other stakeholders
What we offer
What we offer
  • Free comprehensive health insurance for you and your children
  • Parent Care Program: receive one additional month of leave on top of the legal parental leave
  • Free mental health and coaching services through our partner Moka.care
  • For caregivers and workers with disabilities, a package including an adaptation of the remote policy, extra days off for medical reasons, and psychological support
  • Work from EU countries and the UK for up to 10 days per year, thanks to our flexibility days policy
  • Up to 14 days of RTT
  • A subsidy from the work council to refund part of the membership to a sport club or a creative class
  • Lunch voucher with Swile card
  • Fulltime
Read More
Arrow Right

Senior Solutions Engineer – Big Data & Data Infrastructure

This is a great opportunity to be part of one of the fastest-growing infrastruct...
Location
Location
Israel , Tel Aviv
Salary
Salary:
Not provided
vastdata.com Logo
VAST Data
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 2–4 years in software / solution or infrastructure engineering
  • 2–4 years focused on building / maintaining large-scale data pipelines / storage & database solutions
  • Proficiency in Trino, Spark (Structured Streaming & batch) and solid working knowledge of Apache Kafka
  • Coding background in Python (must-have)
  • Deep understanding of data storage architectures including SQL, NoSQL, and HDFS
  • Solid grasp of DevOps practices, including containerization (Docker), orchestration (Kubernetes), and infrastructure provisioning (Terraform)
  • Experience with distributed systems, stream processing, and event-driven architecture
  • Hands-on familiarity with benchmarking and performance profiling for storage systems, databases, and analytics engines
  • Excellent communication skills
Job Responsibility
Job Responsibility
  • Build distributed data pipelines using technologies like Kafka, Spark (batch & streaming), Python, Trino, Airflow, and S3-compatible data lakes
  • Design, deploy, and troubleshoot hybrid cloud/on-prem environments using Terraform, Docker, Kubernetes, and CI/CD automation tools
  • Implement event-driven and serverless workflows
  • Create technical guides, architecture docs, and demo pipelines
  • Integrate data validation, observability tools, and governance directly into the pipeline lifecycle
  • Own end-to-end platform lifecycle: ingestion → transformation → storage (Parquet/ORC on S3) → compute layer (Trino/Spark)
  • Benchmark and tune storage backends (S3/NFS/SMB) and compute layers for throughput, latency, and scalability using production datasets
  • Work cross-functionally with R&D to push performance limits across interactive, streaming, and ML-ready analytics workloads
  • Operate and debug object store–backed data lake infrastructure
Read More
Arrow Right

Senior Data Engineer

We are seeking a Senior Data Engineer to support a high-impact Salesforce initia...
Location
Location
United States , Philadelphia
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7–9 years of experience in data engineering / data integration projects
  • Strong hands-on experience with: Spark / PySpark (required)
  • Python for ETL development (required)
  • Databricks (required)
  • Experience building data pipelines in AWS environments
  • Knowledge of data migration and integration frameworks
  • Experience with Terraform (infrastructure as code)
  • Familiarity with CI/CD tools (Concourse, GitHub Actions)
  • Experience working in Agile/Scrum environments
  • Strong troubleshooting and problem-solving skills
Job Responsibility
Job Responsibility
  • Develop ETL pipelines to migrate and integrate data into Databricks
  • Support Salesforce to Databricks data migration and integration
  • Design, build, test, and maintain data pipelines on AWS + Databricks
  • Translate business requirements into scalable data engineering solutions
  • Contribute to data warehousing components for Salesforce Phase 2
  • Utilize Terraform for infrastructure provisioning and management
  • Implement and maintain CI/CD pipelines (Concourse or GitHub Actions)
  • Perform testing, QA, and documentation for all data engineering work
  • Provide post-deployment support and troubleshooting
  • Identify and mitigate risks, including single points of failure (SPOF)
What we offer
What we offer
  • medical
  • vision
  • dental
  • life and disability insurance
  • 401(k) plan
  • Fulltime
Read More
Arrow Right

Senior Data Engineer

We are seeking a highly technical, forward-thinking Senior Data Engineer / MLOps...
Location
Location
Canada , Vancouver
Salary
Salary:
46.22 - 61.63 USD / Hour
https://www.randstad.com Logo
Randstad
Expiration Date
July 27, 2026
Flip Icon
Requirements
Requirements
  • 4+ years of hands-on experience building, scaling, and maintaining production-grade MLOps pipelines using the Azure data ecosystem
  • 3+ years of proven success building AI workflows specifically utilizing Databricks, Unity Catalog (for governance), and MLflow (for model tracking)
  • 3+ years of documented experience in core Azure infrastructure management, networking boundaries, and secure enterprise provisioning
  • Expert-level proficiency in SQL, Spark SQL, Python, and PySpark data manipulation scripts
  • Proficient with Terraform (IaC), Apache Airflow, Azure Data Factory, Azure Functions, Snowflake, and Fabric OneLake environments
  • Demonstrated capability in active crisis management, handling customer escalations, and troubleshooting distributed runtime system failures
  • Bachelor’s degree in Computer Science, Software Engineering, or an equivalent technical field
Job Responsibility
Job Responsibility
  • Define the enterprise standard architecture for MLOps, focusing on infrastructure scaling, automated continuous training (CT), and deployment observability
  • Consolidate and simplify disparate machine learning workflows across varied global data science teams into a unified platform
  • Build and scale robust ML/AI orchestration pipelines utilizing Databricks, Unity Catalog, and MLflow for model tracking, lineage tracking, and governance
  • Architect and manage secure enterprise cloud environments natively within Azure using Terraform for Infrastructure as Code (IaC)
  • Automate the provisioning of complex network configurations, cloud resources, IAM security privileges, and containerized configurations
  • Monitor cloud environment footprint performance, guaranteeing high availability, structural reliability, and cost optimization
  • Oversee and manage large-scale production deployments of batch and real-time machine learning models
  • Standardize the continuous integration and continuous delivery (CI/CD) pipelines utilizing Azure DevOps, Jenkins, or GitLab
  • Implement containerization and deployment orchestration frameworks across critical corporate data domains
  • Design and implement advanced telemetry, monitoring metrics, and proactive alerting frameworks for distributed cloud infrastructure and data apps
What we offer
What we offer
  • Strategic Architectural Influence: Build and define the net-new global standard for machine learning infrastructure for a premium corporate brand
  • Advanced Tech Spectrum: Work natively with cutting-edge data tech: Unity Catalog, serverless Azure frameworks, and modern IaC toolsets
  • Elite Collaboration Hub: Form part of an energetic, values-driven onsite workspace in Vancouver that fosters innovation and entrepreneurial spirit
  • Long-Term Program Depth: Secure an initial 6-month contract with highly probable rolling extensions as the global platform scales
  • Fulltime
Read More
Arrow Right

Data Engineer

ASML Customer Support (CS) Diagnostics is at the core of ASML’s ambition to sign...
Location
Location
China , Shanghai
Salary
Salary:
Not provided
asml.com Logo
ASML
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master's degree in Data Science, Computer Science, Engineering, Applied Mathematics, or a related field
  • 5+ years of relevant experience in data science, data engineering, or advanced analytics roles
  • Strong proficiency in Python and experience with analytical and ML libraries
  • Scripting skill such as PERL, Bash, Power Shell
  • Proven experience developing and deploying machine learning / deep learning models in production environments
  • Strong experience with cloud-based data platforms (Azure preferred), including Databricks, Spark, SQL / Kusto
  • Experience with SQL ETL processes
  • Solid understanding of statistics, data analysis, SPC/FDC concepts, and analytical problem solving
  • Experience working with large-scale, high-frequency data streams
Job Responsibility
Job Responsibility
  • Design, develop, deploy, and maintain machine learning and deep learning models for Predictive Maintenance (PdM), Fault Detection & Classification, and root-cause identification and observability improvement
  • Own the end-to-end model lifecycle
  • Continuously improve model performance based on field feedback, diagnostic outcomes, and new data availability
  • Design and implement scalable, cloud-native data pipelines to ingest, transform, and provision large volumes of structured and unstructured machine data
  • Work with platforms such as Azure, Databricks, Spark, and Kusto to ensure reliable, performant, and secure data access
  • Ensure data quality, traceability, and reproducibility for downstream analytics and AI applications
  • Enable early access to data through proof-of-concept pipelines
  • Improve observability through machine data by identifying gaps, defining required signals, and translating diagnostic needs into data and model requirements
  • Identify structural improvements in diagnostic services
  • Define and follow standards, policies, and protocols for data, models, and analytics solutions
  • Fulltime
Read More
Arrow Right

Senior Data Engineer

This role is categorized as hybrid. This means the successful candidate is expec...
Location
Location
United States , Austin
Salary
Salary:
125000.00 - 173150.00 USD / Year
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Software Engineering, Information Systems, or related field, or equivalent practical experience
  • 6 - 8+ years of professional experience in software engineering and/or data engineering, with a strong track record of delivering production systems
  • Strong proficiency in Java and object-oriented design, with experience applying design patterns and clean architecture principles
  • Hands-on experience building Quarkus and Spring Boot applications, including configuration management, dependency injection, and integration with external services
  • Demonstrated experience designing and consuming REST APIs and building microservices architectures, including service contracts, versioning, and backward compatibility
  • Strong knowledge of event-driven architectures and real-time data processing using Kafka or Azure Event Hub (topics, partitions, consumer groups, schema evolution)
  • Deep experience with relational databases, especially PostgreSQL, including schema design, performance tuning, query optimization, and monitoring
  • Hands-on experience with Azure cloud services, especially AKS, networking (ingress, load balancers), identity, and managed data/services
  • Experience implementing and maintaining CI/CD pipelines using GitHub Actions/Workflows, including build, test, quality gates, and deployment automation
  • Solid Infrastructure-as-Code experience with Terraform, including modules, environment strategy, state management, and authoring Datadog monitors via code
Job Responsibility
Job Responsibility
  • Own the end-to-end design, development, and operation of scalable data engineering pipelines and backend services using Java, Quarkus, Spring Boot ensuring reliability, observability, and maintainability
  • Lead the design and implementation of cron-based and event-driven orchestration services that retrieve and process data from multiple enterprise systems via REST APIs and messaging platforms
  • Architect and implement real-time data processing solutions using Kafka and Azure Event Hub, including schema design, consumer group strategy, and resiliency patterns
  • Design and optimize relational data models and database solutions using PostgreSQL and other relational data stores, including indexing strategies, query optimization, and performance tuning at scale
  • Drive the deployment, scaling, and lifecycle management of services on Azure Kubernetes Service (AKS), including workload identity, networking, and security configuration
  • Define and implement CI/CD pipelines using GitHub Actions/Workflows, and manage automated, GitOps-based deployments using ArgoCD across multiple environments
  • Lead infrastructure automation using Terraform, establishing reusable modules, environment standards, and best practices for cloud resource provisioning and governance, including Datadog monitor creation and management
  • Design and implement end-to-end observability using Prometheus, Datadog, and related tooling, including metrics, logs, traces, dashboards, and alerting with clear SLOs/SLIs
  • Build and maintain data processing workflows using Databricks and distributed data frameworks, including batch and streaming jobs, job orchestration, and cost-optimized compute
  • Collaborate closely with product, architecture, and cross-functional engineering teams to refine requirements, define technical roadmaps, and translate business outcomes into robust technical designs
What we offer
What we offer
  • medical
  • dental
  • vision
  • Health Savings Account
  • Flexible Spending Accounts
  • retirement savings plan
  • sickness and accident benefits
  • life insurance
  • paid vacation & holidays
  • tuition assistance programs
  • Fulltime
Read More
Arrow Right

Data Engineer

The primary responsibility of this role is to design, build, and maintain the da...
Location
Location
Croatia , Zagreb
Salary
Salary:
Not provided
museumofillusions.com Logo
Museum of Illusions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 2 years of professional experience in data engineering or a closely related role
  • Familiarity with Power BI or comparable business intelligence tools to support data visualization and reporting needs
  • Familiarity with programming languages like Python for scripting, automating tasks, managing data pipelines, and troubleshooting issues
  • Understanding of Microsoft Azure cloud or comparable cloud data platforms and its data management services
  • Experience in managing and administering data management tools and infrastructure, including user provisioning, access control, monitoring system performance, and troubleshooting technical issues
  • Ability to schedule data refreshes, pipeline executions, and other routine tasks to ensure data accuracy and timeliness. Experience with scripting or workflow automation tools is a plus
  • Experience setting up and managing data quality checks and alerts to identify and address potential issues within data pipelines or data storage
  • Understanding of data security best practices and implementing access controls to safeguard sensitive data within the data warehouse solution
  • Familiarity with data warehouse technologies, including data storage formats, data loading procedures, and querying capabilities
  • Knowledge of data integration tools and how to extract, transform, and load data from various sources into the data warehouse
Job Responsibility
Job Responsibility
  • Using Azure tools and services to design and scale cloud-based data solutions
  • Collaborating with business teams to deliver clean and reliable reports in Power BI
  • Working with Python to connect to APIs and pull in external data
  • Assuring operational efficiency of MoI data warehouse solution
  • Ensuring system uptime and performance for data storage and access
  • Monitoring data pipelines for errors and ensuring smooth data flow
  • Scheduling data refreshes and updates within the data warehouse
  • Performing routine data quality checks and addressing any data quality issues
  • System administration for data management tools and infrastructure
  • User provisioning and access management within the data platform
What we offer
What we offer
  • Full-time salaried position with a bonus structure
  • Supplementary and additional health insurance
  • A young, vibrant and ambitious team to work with
  • A fun, exciting and industry-leading concept to manage and develop in Zagreb
  • Reimbursement of travel expenses and provision of the warm meal
  • Fulltime
Read More
Arrow Right