CrawlJobs Logo

Junior Data Infrastructure Engineer

United Kingdom, Brighton · Job Posted December 06, 2025
Apply Position
Job Link Share

Job Description

As part of the Data Infrastructure team you will be supporting mission critical big data platforms, to ensure they are fully performant, reliable, available and secure. We call it data infrastructure engineering, also known as DataOps or Database Administration or SRE. The role is a mixture of development of tooling and operational support for our platforms, two aspects that go hand-in-hand. It requires attention to detail and curiousity about how the systems work under the hood, and gives you a wide base of skills from low level system tuning to general coding. We manage four main storage platforms, namely: Apache Solr (~2.2 PB), Apache HBase (~450 TB), PostgreSQL (~15 TB), Kafka (~60 TB). These platforms are all open source, written in Java, Scala or C, and we maintain in-house builds and patching of them. We use a variety of open-source and in-house developed tooling to manage these services, mostly written in Rust and Python, which run primarily on hundreds of servers in multiple data centres and in the cloud. We maintain a balance between project work and operational/ad-hoc work for all members in the team, whether they are senior or recent graduates, and your day-to-day work will be a mix of these.

Job Responsibility

  • Supporting mission critical big data platforms, to ensure they are fully performant, reliable, available and secure
  • Development of tooling and operational support for our platforms
  • Help with staging support
  • Join the team supporting the production systems
  • Take a full part in the life of the team
  • Start designing the infrastructure we run

Requirements

  • An interest in how computer infrastructure actually works, and a passion for learning
  • Interest, and ideally production experience, running storage systems, eg. as part of a selfhosted service, a home lab or as part of academic studies
  • Experience with Linux systems administration, including experience of trouble shooting
  • Fluency with one or more scripting languages, ideally Bash or Python
  • Experience helping your peers
  • Pride in the quality of your work

Nice to have

  • Development experience, in Python, Java, Rust, C/C++ or Golang.
  • Upstream open-source contributions
  • Academic research in scalability, distributed systems or storage infrastructure
  • Kubernetes experience, ideally running or writing Operators
  • Experience with Docker and with CI/CD pipelines

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Junior Data Infrastructure Engineer

8 matching positions

Senior AWS Data Engineer / Data Platform Engineer

We are seeking a highly experienced Senior AWS Data Engineer to design, build, a...
Location
Location
United Arab Emirates , Dubai
Salary
Salary:
Not provided
northbaysolutions.com Logo
NorthBay
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience in data engineering and data platform development
  • Strong hands-on experience with: AWS Glue
  • Amazon EMR (Spark)
  • AWS Lambda
  • Apache Airflow (MWAA)
  • Amazon EC2
  • Amazon CloudWatch
  • Amazon Redshift
  • Amazon DynamoDB
  • AWS DataZone
Job Responsibility
Job Responsibility
  • Design, develop, and optimize scalable data pipelines using AWS native services
  • Lead the implementation of batch and near-real-time data processing solutions
  • Architect and manage data ingestion, transformation, and storage layers
  • Build and maintain ETL/ELT workflows using AWS Glue and Apache Spark on EMR
  • Orchestrate complex data workflows using Apache Airflow (MWAA)
  • Develop and manage serverless data processing using AWS Lambda
  • Design and optimize data warehouses using Amazon Redshift
  • Implement and manage NoSQL data models using Amazon DynamoDB
  • Utilize AWS DataZone for data governance, cataloging, and access management
  • Monitor, log, and troubleshoot data pipelines using Amazon CloudWatch
  • Fulltime
Read More
Arrow Right
New

Junior Data Engineer

Are you looking for a new challenge? Fancy helping us shape the future of motor ...
Location
Location
Spain , Madrid
Salary
Salary:
Not provided
prima.it Logo
Prima
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong academic background in STEM disciplines
  • Ability to break down problems, learn quickly, and test different approaches
  • Programming foundations (language is not important. We value clean, maintainable code above all)
  • Curiosity for data and software — whether from coursework, projects, or personal initiatives
  • Interest in building systems end-to-end: from ingesting and transforming data, to creating models, to deploying services
  • Motivation and eagerness to learn from more experienced teammates.
Job Responsibility
Job Responsibility
  • Shaping the architecture of data products designed for data analytics and data science specifically focusing on use cases like forecasting, feature engineering, customer behaviour, and integration of new data sources
  • Support the data transformation by setting up best practices in areas like Data modelling, performance optimisation, Data Governance etc, ensuring that the data used within Prima is consistent, available and reliable
  • Build reusable technology that enables teams to ingest, store, transform, and serve their own data products
  • Engaging with data scientists and machine learning engineers to explore the product landscape and refine data requirements for enhanced data infrastructure.
What we offer
What we offer
  • Work Your Way: Enjoy full flexibility – work from home, the office or a mix of both. Plus, work from anywhere for up to 30 days a year
  • Grow with us: We may move fast at Prima, but we move together. Get access to learning resources, mentorship and a growth plan tailored to you
  • Thrive and perform: Your best work begins when you feel your best. Enjoy private healthcare, gym discounts, wellbeing programs and mental health support.
  • Fulltime
Read More
Arrow Right

Junior Infrastructure Engineer

The Junior Infrastructure Engineer supports the Infrastructure and Asset Managem...
Location
Location
Australia , Parramatta
Salary
Salary:
Not provided
transdev.com Logo
Transdev
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Electrical, Civil, Mechatronic, or Systems Engineering
  • Exposure or experience in rail infrastructure maintenance (signalling, power, communications, electrical, track, or civil)
  • Understanding of maintenance processes and engineering change management
  • Familiarity with rail technical standards and specifications
  • Strong analytical, troubleshooting, and problem-solving skills
  • Ability to interpret technical documentation and drawings
  • Good communication and interpersonal skills
  • Proficiency with MS Office and computer-based systems
  • Strong time management and prioritisation abilities
Job Responsibility
Job Responsibility
  • Comply with all relevant rail, safety, and environmental legislation and GRCLR procedures
  • Investigate asset performance, maintenance needs, and technical changes under the guidance of the Senior Infrastructure Manager
  • Prepare and support engineering change proposals and participate in FRACAS, defect management, and engineering change control processes
  • Manage configuration change activities, including requirements definition, performance specifications, documentation updates (drawings, manuals, specifications), liaison with suppliers, OEMs and contractors, and technical investigations including root cause analysis
  • Maintain and update key systems and records including EAM Hexagon Asset Management System, Integrated Management System (IMS) documentation, Engineering Change Register, FRACAS records, and asset configuration and performance data
  • Support maintenance, incident recovery, and fault-finding activities to ensure safe, efficient, and reliable operation of infrastructure assets
  • Collaborate with maintenance teams, contractors, and other disciplines, manage documentation in line with GRCLR guidelines, and apply sound engineering practices within competency limits
  • Maintain awareness of plant, tools, and equipment security, ensure clear understanding of job scope, and perform other duties as reasonably directed by the Senior Infrastructure Manager
What we offer
What we offer
  • We support the development, work–life balance, and well-being of our employees
  • We foster a caring company culture that values diversity and enables everyone to thrive
  • We empower our teams to make a positive societal impact by delivering sustainable mobility solutions
  • We offer opportunities to build meaningful career experiences within an international Group rooted in local operations
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Data Infrastructure

At Docker, we make app development easier so developers can focus on what matter...
Location
Location
United States , Seattle
Salary
Salary:
195400.00 - 275550.00 USD / Year
docker.com Logo
Docker
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of software engineering experience with 3+ years focused on data engineering and analytics systems
  • Expert-level experience with Snowflake including advanced SQL, performance optimization, and cost management
  • Deep proficiency in DBT for data modeling, transformation, and testing with experience in large-scale implementations
  • Strong expertise with Apache Airflow for complex workflow orchestration and pipeline management
  • Hands-on experience with Sigma or similar modern BI platforms for self-service analytics
  • Extensive AWS experience including data services (S3, Redshift, EMR, Glue, Lambda, Kinesis) and infrastructure management
  • Proficiency in Python, SQL, and other programming languages commonly used in data engineering
  • Experience with infrastructure-as-code, CI/CD practices, and modern DevOps tools
  • Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience
  • Proven track record designing and implementing large-scale distributed data systems
Job Responsibility
Job Responsibility
  • Define and drive the technical strategy for Docker's data platform architecture, establishing long-term vision for scalable data systems
  • Lead design and implementation of highly scalable data infrastructure leveraging Snowflake, AWS, Airflow, DBT, and Sigma
  • Architect end-to-end data pipelines supporting real-time and batch analytics across Docker's product ecosystem
  • Drive technical decision-making around data platform technologies, architectural patterns, and engineering best practices
  • Establish technical standards for data quality, testing, monitoring, and operational excellence
  • Design and build robust, scalable data systems that process petabytes of data and support millions of user interactions
  • Implement complex data transformations and modeling using DBT for analytics and business intelligence use cases
  • Develop and maintain sophisticated data orchestration workflows using Apache Airflow
  • Optimize Snowflake performance and cost efficiency while ensuring reliability and scalability
  • Build data APIs and services that enable self-service analytics and integration with downstream systems
What we offer
What we offer
  • Freedom & flexibility
  • fit your work around your life
  • Designated quarterly Whaleness Days plus end of year Whaleness break
  • Home office setup
  • we want you comfortable while you work
  • 16 weeks of paid Parental leave
  • Technology stipend equivalent to $100 net/month
  • PTO plan that encourages you to take time to do the things you enjoy
  • Training stipend for conferences, courses and classes
  • Equity
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - Data Infrastructure

We build the data and machine learning infrastructure to enable Plaid engineers ...
Location
Location
United States , San Francisco
Salary
Salary:
180000.00 - 270000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of software engineering experience
  • Extensive hands-on software engineering experience, with a strong track record of delivering successful projects within the Data Infrastructure or Platform domain at similar or larger companies
  • Deep understanding of one of: ML Infrastructure systems, including Feature Stores, Training Infrastructure, Serving Infrastructure, and Model Monitoring OR Data Infrastructure systems, including Data Warehouses, Data Lakehouses, Apache Spark, Streaming Infrastructure, Workflow Orchestration
  • Strong cross-functional collaboration, communication, and project management skills, with proven ability to coordinate effectively
  • Proficiency in coding, testing, and system design, ensuring reliable and scalable solutions
  • Demonstrated leadership abilities, including experience mentoring and guiding junior engineers
Job Responsibility
Job Responsibility
  • Contribute towards the long-term technical roadmap for data-driven and machine learning iteration at Plaid
  • Leading key data infrastructure projects such as improving ML development golden paths, implementing offline streaming solutions for data freshness, building net new ETL pipeline infrastructure, and evolving data warehouse or data lakehouse capabilities
  • Working with stakeholders in other teams and functions to define technical roadmaps for key backend systems and abstractions across Plaid
  • Debugging, troubleshooting, and reducing operational burden for our Data Platform
  • Growing the team via mentorship and leadership, reviewing technical documents and code changes
What we offer
What we offer
  • medical, dental, vision, and 401(k)
  • equity and/or commission
  • Fulltime
Read More
Arrow Right

Junior Data Engineer

As a Junior Data Engineer, you will have the exciting opportunity to work with a...
Location
Location
Poland , Warsaw; Cracow; Wroclaw; Bialystok
Salary
Salary:
8400.00 - 15120.00 PLN / Month
addepto.com Logo
Addepto sp. z o.o.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 1 year of proven commercial experience developing, or maintaining Big Data systems
  • Hands-on experience with Big Data technologies, including Databricks, Apache Spark, Airflow, and DBT
  • Strong programming skills in Python: writing a clean code, OOP design
  • Experience in designing and implementing data governance and data management processes
  • Experience implementing and deploying solutions in cloud environments (with a preference for Azure)
  • Practical knowledge of DevOps practices, including designing and maintaining CI/CD pipelines for data and ML workflows, and Terraform for Infrastructure as Code
  • Knowledge of how to build and deploy Power BI reports and dashboards for data visualization
  • Excellent understanding of dimensional data and data modeling techniques
  • Excellent communication skills and consulting experience with direct interaction with clients
  • Ability to work independently and take ownership of project deliverables
Job Responsibility
Job Responsibility
  • Design scalable data processing pipelines for streaming and batch processing using Big Data technologies like Databricks, Airflow and/or Dagster
  • Contribute to the development of CI/CD and MLOps processes
  • Develop applications to aggregate, process, and analyze data from diverse sources
  • Collaborate with the Data Science team on Machine Learning projects, including text/image analysis and predictive model building
  • Develop and organize data transformations using Databricks/DBT and Apache Airflow
  • Translate business requirements into technical solutions and ensure optimal performance and quality
What we offer
What we offer
  • Work in a supportive team of passionate enthusiasts of AI & Big Data
  • Engage with top-tier global enterprises and cutting-edge startups on international projects
  • Enjoy flexible work arrangements, allowing you to work remotely or from modern offices and coworking spaces
  • Accelerate your professional growth through career paths, knowledge-sharing initiatives, language classes, and sponsored training or conferences, including a partnership with Databricks, which offers industry-leading training materials and certifications
  • Choose your preferred form of cooperation: B2B or a contract of mandate, and make use of 20 fully paid days off
  • Participate in team-building events and utilize the integration budget
  • Celebrate work anniversaries, birthdays, and milestones
  • Access medical and sports packages, eye care, and well-being support services, including psychotherapy and coaching
  • Get full work equipment for optimal productivity, including a laptop and other necessary devices
  • With our backing, you can boost your personal brand by speaking at conferences, writing for our blog, or participating in meetups
  • Fulltime
Read More
Arrow Right

Junior Research Infrastructure Engineer

We are seeking a Product-Minded Junior Research Infrastructure Engineer to join ...
Location
Location
United States , Sunnyvale
Salary
Salary:
Not provided
meshy.ai Logo
Meshy LLC
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 2+ years of experience in software engineering, backend development, or distributed systems
  • Strong programming skills in Python (plus Scala/Java/C++ a plus)
  • Familiarity with distributed frameworks (Spark, Dask, Ray) and cloud platforms (AWS/GCP/Azure)
  • Experience with workflow orchestration tools (Temporal, Celery, or Airflow)
  • Proficiency with Infrastructure as Code (Terraform) and CI/CD tools (GitHub Actions)
  • Experience building web applications or internal tools using React or Next.js
  • A 'product-first' mindset: an interest in how users interact with infrastructure and a desire to build clean, functional interfaces
Job Responsibility
Job Responsibility
  • Participate in the design and implementation of distributed task orchestration systems using Temporal or Celery
  • Architect pipelines across cloud object storage (S3, GCS), data lakes, and metadata catalogs
  • Implement partitioning, sharding, and caching strategies to ensure data processing pipelines are resilient, highly available, and consistent
  • Design, implement, and maintain distributed ingestion pipelines for structured and unstructured data (images, 3D/2D assets, binaries)
  • Build scalable ETL/ELT workflows to transform, validate, and enrich datasets for AI/ML model training and analytics
  • Support preprocessing of unstructured assets (e.g., images, 3D/2D models, video) for training pipelines, including format conversion, normalization, augmentation, and metadata extraction
  • Implement validation and quality checks to ensure datasets meet ML training requirements
  • Collaborate with ML researchers to quickly adapt pipelines to evolving pretraining and evaluation needs
  • Use infrastructure-as-code (Terraform, Kubernetes, etc.) to manage scalable and reproducible environments
  • Manage data assets using Databricks Asset Bundles (DABs) and build rigorous CI/CD pipelines (GitHub Actions)
What we offer
What we offer
  • Competitive salary, equity, and benefits package
  • Opportunity to work with a talented and passionate team at the forefront of AI and 3D technology
  • Flexible work environment, with options for remote and on-site work
  • Opportunities for fast professional growth and development
  • An inclusive culture that values creativity, innovation, and collaboration
  • Unlimited, flexible time off
  • Stock options available for core team members
  • 401(k) plan for employees
  • Comprehensive health, dental, and vision insurance
  • The latest and best office equipment
  • Fulltime
Read More
Arrow Right

IT Systems Engineer | Infrastructure Engineer

We are seeking an Adelaide-based Systems Engineer to take ownership of our core ...
Location
Location
Australia , Adelaide
Salary
Salary:
Not provided
dyflex.com.au Logo
DyFlex Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of experience in systems engineering / systems administration or infrastructure engineering
  • Deep expertise in the Microsoft ecosystem, including Windows Server 2022, Entra ID (hybrid), Azure, and Microsoft 365
  • Proven ability to automate processes using PowerShell (advanced scripting) and/or Power Automate
  • Strong background in cybersecurity uplift: patching, hardening, vulnerability remediation, and identity/endpoint security
  • Hands‑on experience with ASD Essential Eight, with exposure to ISO 27001 or SOC 2 considered highly advantageous
  • Experience in firewall administration (e.g., Sophos), routing/switching fundamentals, and secure remote access design
  • Experience supporting or administering Linux (SUSE preferred) within a predominantly Windows environment
  • Demonstrated ability to deliver technical upgrades end‑to‑end with high‑quality documentation and handover
  • Experience producing clear technical diagrams and architectural documentation
  • Strong communication, collaboration, and coaching skills, with the ability to guide junior team members
Job Responsibility
Job Responsibility
  • Manage and optimise our Microsoft ecosystem, including Windows Server, Active Directory, and Microsoft 365
  • Administer and enhance Microsoft Entra ID in a hybrid environment, including Conditional Access, SSO integrations, and identity security controls
  • Lead our cybersecurity uplift, driving vulnerability remediation, system hardening, Essential Eight maturity, and Microsoft Defender improvements
  • Contribute to the implementation and operationalisation of Microsoft Sentinel, including onboarding data sources and alert tuning
  • Architect, manage, and scale our Azure environment (IaaS/PaaS) to support a rapidly growing national team
  • Act as the final Level 3 escalation point for complex server, identity, networking, and endpoint issues
  • Oversee network integrity and security, including firewall management, site‑to‑site VPNs, remote access VPNs, and uplift of network segmentation
  • Drive infrastructure automation and consistency by developing and maintaining advanced PowerShell scripts and automations
  • Support and enhance our SOE, server build patterns, platform standards, and operational processes
  • Maintain and monitor our mixed environment, including SUSE Linux servers used for internal projects
What we offer
What we offer
  • A flexible and supportive work environment
  • Competitive remuneration and benefits including novated lease, birthday leave, salary packaging, wellbeing programme, additional purchased leave, and company-provided laptop
  • Comprehensive SAP training and certifications
  • Fulltime
Read More
Arrow Right