CrawlJobs Logo

Infrastructure Kubernetes Specialist

https://www.inetum.com Logo

Inetum

Location Icon

Location:
Portugal , Lisbon

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

As an Infrastructure Kubernetes Specialist, you will define and implement the containerized platform in collaboration with various technical departments (Infrastructure, Production, Security). You will also establish platform standards and support project teams in migrating their applications to the new environment.

Job Responsibility:

  • Design and implement Kubernetes clusters (sizing, high availability, automated installation, IAM, etc.)
  • Manage SDN and CNI technologies (e.g., Calico, Cilium)
  • Implement backup solutions (e.g., Velero)
  • Handle persistent volume management (e.g., CEPH, Portworx)
  • Integrate CI/CD tools (e.g., Kustomize, ArgoCD)
  • Collaborate cross-functionally to define platform standards
  • Support project teams in container migration efforts
  • Participate in strategic planning and technical evaluations

Requirements:

  • Proven expertise in Kubernetes infrastructure
  • Experience with CI/CD tools and container orchestration
  • Proficiency in Python development
  • Familiarity with Agile methodologies
  • Strong communication skills in English (minimum B2 level)
  • Autonomous and proactive mindset
  • Knowledge of Microsoft Azure is a plus

Nice to have:

Knowledge of Microsoft Azure

Additional Information:

Job Posted:
October 07, 2025

Employment Type:
Fulltime
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Infrastructure Kubernetes Specialist

Founding Infrastructure Engineer

As the first dedicated Infrastructure Engineer at Reducto, you will influence ev...
Location
Location
United States , San Francisco
Salary
Salary:
150000.00 - 300000.00 USD / Year
reducto.ai Logo
Reducto
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Have 5+ years of hands-on experience in building or supporting production-grade infrastructure and reliability processes for high-throughput systems
  • Are comfortable with Python or similar languages
  • Exceptional at working across cloud platforms, container orchestration (e.g., Kubernetes), networking, and storage technologies
  • Build your own tools on the fly to diagnose, experiment, and address reliability problems
  • Bring a quantitative, hands-on approach to system operations, automation, and continuous improvement
  • Are your own worst critic—have an extremely high bar for quality and always aim for robust solutions rather than quick fixes
Job Responsibility
Job Responsibility
  • Designing, building, and maintaining highly available, scalable infrastructure to support intensive AI/ML workloads and real-time model deployments
  • Implementing robust monitoring, alerting, and observability systems to ensure system health, performance, and uptime across cloud and on-prem environments
  • Debugging, optimizing, and automating infrastructure for fast iteration and rapid deployment cycles, focusing on both reliability and developer velocity
  • Proactively identifying, investigating, and resolving incidents to minimize downtime and maintain world-class service levels for enterprise customers
  • Collaborating closely with engineers, ML specialists, and founders to shape product, infrastructure, and security strategies
What we offer
What we offer
  • Unlimited PTO
  • Free lunch daily at the office
  • Reimbursed Transportation
  • Generous health insurance covering medical, dental, and vision
  • Health and Wellness Budget up to $150/mo reimbursement
  • Parental Leave
  • Fulltime
Read More
Arrow Right

Senior Technical Operations Specialist

We're on the hunt for a talented and proactive individual to join our team, some...
Location
Location
Poland , Gdańsk
Salary
Salary:
Not provided
navblue.aero Logo
NAVBLUE Limited
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Successful completion of a post-secondary degree or diploma in computer science or technology (or equivalent)
  • Five or more years of proven experience in a technical role supporting multiple systems
  • Experience using scripting languages to perform and automate tasks
  • Using Infrastructure as Code and Configuration as Code such as Terraform and Ansible
  • Experience deploying production services into cloud environments
  • AWS Cloud Practitioner
  • AWS SysOps Administrator or Solutions Architect Associate preferred
  • Linux Foundation Certified Systems Administrator (LFCS) or similar is a bonus
  • Solid knowledge of Operating Systems & ability to perform troubleshooting required
  • Proven track record building and maintaining infrastructure in cloud environments
Job Responsibility
Job Responsibility
  • Ensuring availability across numerous services, whether they are custom software, commercial software, or free and open source solutions
  • Monitoring system and application performance, and logs
  • Creating and testing backup and recovery procedures
  • Responding to alerts and incidents when they occur
  • Investigating and finding solutions to operational issues at the infrastructure, network, os and application levels
  • Escalating issues to vendors or partners when appropriate
  • Follow and improve the best practices and standards that help us keep services safe, secure, and reliable
  • Improve or create our best practices to ensure the smooth operation of services and execution of procedures
  • Develop and improve SOPs for the maintenance of our services and their underlying systems
  • Develop and improve Infrastructure as Code (IaC) and Configuration as Code (CaC) used to maintain services and systems
What we offer
What we offer
  • Stable employment based on a full-time job contract
  • Flexible working hours and work-from-home opportunities (3 days in office)
  • International working environment in a dynamic company
  • Access to the latest knowledge and technologies enabling professional development
  • Training and development possibilities
  • Participating in international projects and international trips
  • Competitive salary dependent on experience and qualifications
  • Private medical coverage for you and your family
  • Sport card
  • Life insurance for you and your family
  • Fulltime
Read More
Arrow Right
New

IT Infrastructure Specialist

Join our international IT Infrastructure Team and help support the technology th...
Location
Location
Austria , Innsbruck
Salary
Salary:
Not provided
Logifuture
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum of 2 years in IT Support or Infrastructure
  • A degree or A-Level qualification in an IT-related field (or equivalent professional experience)
  • Proficiency in written and verbal English
  • Solid experience with Microsoft Windows Server 2022 and various Linux distributions (Ubuntu, Debian, RedHat/CentOS, Oracle)
  • Strong understanding of networking, IP addressing, and troubleshooting
  • experience managing firewall policies (Juniper, Checkpoint, Cisco)
  • Knowledge of load balancer concepts, ideally with hands-on experience in F5 or HAProxy
  • Familiarity with hypervisors such as VMWare, OpenStack, and Proxmox, alongside a cloud-native mindset using Kubernetes
  • Experience with Dell server hardware and a conceptual understanding of Storage Area Networks (Brocade/Broadcom)
  • Exceptional ability to troubleshoot and resolve complex technical issues
Job Responsibility
Job Responsibility
  • Responsible for maintaining a healthy infrastructure of networks, datacenters, servers, and services
  • Assisting with rollouts for business-critical network infrastructure, products, and services
  • Ensure the integrity and security of data in accordance with best practices and business requirements for regulatory, security and privacy compliance
  • Produce and maintain documentation for IT procedures
  • Report to IT Infrastructure Team Leader
  • Provide, as part of a rota, on-call production support for IT Operations (appropriate compensation)
What we offer
What we offer
  • Private health insurance
  • Bi-Monthly company wide social and team building activities
  • Hybrid & Remote work arrangements
  • Flexible working hours
  • Daily paid meal
  • Wellbeing day
  • Training and Development opportunities
  • Fulltime
Read More
Arrow Right

Infrastructure Engineer

We are working with a Global Professional Services client as they look to add to...
Location
Location
United Kingdom , Manchester
Salary
Salary:
55000.00 - 60000.00 GBP / Year
eutopiaonline.com Logo
Eutopia
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven hands-on commercial experience with Kubernetes in a production environment (preferably Red Hat OpenShift)
  • Experience in automating infrastructure processes using scripting and Infrastructure as Code (IaC) tools
  • Experience building and managing CI/CD pipelines (ie Azure DevOps, ArgoCD)
  • Expertise in Microsoft Azure cloud services and solutions
  • Expertise in the VMware Cloud Foundation suite
  • Experience with automation tools such as Terraform or Ansible
  • Kubernetes certification, such as Red Hat Certified Specialist in OpenShift Administration (RHCSA) or an equivalent (ie CKA/CKAD)
  • Certification in relation to on prem solutions – this may be VMware Certified Professional (VCP), RedHat Certified System Administrator (RHSA), or similar
  • Cloud certification(s) ie Azure Administrator Associate, Azure DevOps Engineer Expert, Azure Security Engineer Associate etc
  • Must live within easy commute of central Manchester
Job Responsibility
Job Responsibility
  • Work with both on prem and cloud infrastructure
  • Design, implement and manage solutions
What we offer
What we offer
  • Annual bonus
  • Healthcare
  • Continual professional development opportunities
  • Supportive environment
  • Fulltime
Read More
Arrow Right

AI Platform Site Reliability Engineering Specialist

The AI Platform Site Reliability Engineering Specialist will operate and maintai...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science or related field, or equivalent job experience
  • 5 years of production experience in SRE / Infrastructure / ops for large-scale systems
  • Strong programming/scripting skills (Python, Go, Java, or equivalent)
  • Deep experience with containerization (Docker), orchestration (Kubernetes, etc.)
  • Infrastructure-as-code (Terraform, Helm, CloudFormation, Ansible, etc.)
  • Familiarity with GPU / AI compute clusters, high-performance data storage, and distributed architectures
  • Experience with monitoring / observability / logging / alerting tools (Prometheus, Grafana, ELK / EFK, Datadog, etc.)
  • Networking and systems engineering knowledge (TCP/IP, DNS, routing, load balancing, distributed storage)
  • Solid experience in capacity planning, performance tuning, scaling, and incident response
  • Demonstrated ability to lead RCAs, deploy fixes, and drive reliability improvements
Job Responsibility
Job Responsibility
  • Operate, monitor, and maintain the infrastructure supporting GenAI applications ( training, inference, feature store, data ingestion, model serving)
  • Design and build automation for core platform capabilities, reducing manual toil
  • Develop and maintain infrastructure-as-code (IaC) for provisioning and managing compute, storage, network, GPU clusters, Kubernetes / container orchestration, etc.
  • Establish, monitor and enforce SLOs/SLIs/LSAs, error budgets, alerting, and dashboards
  • Lead incident response, root cause analysis (RCA), postmortems, and systemic remediation
  • Perform capacity planning, scaling strategies, workload scheduling and resource forecasting
  • Optimize cost vs. performance trade-offs in large-scale compute environments
  • Harden systems for security, compliance, auditability, and data governance
  • Collaborate across teams (cloud engineers, data engineers, infrastructure, security) to ensure safe deployment, rollout, rollback, and integration of new systems
  • Define disaster recover (DR) strategies, back/restore practices, fault tolerance mechanisms
Read More
Arrow Right

Forward Deployed Engineer, Infrastructure Specialist

This role offers a unique opportunity to shape how enterprises harness the power...
Location
Location
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience with and enjoy working directly with customers
  • Experience deploying enterprise software in private/hybrid cloud environments
  • Proven experience administering production Kubernetes clusters and expertise with Helm
  • Familiarity with DevOps practices, CI/CD pipelines, and tools like Git for version control
  • Strong expertise in cloud infrastructure (Azure, AWS, GCP), networking, and virtualization
  • Excel in fast-paced environments and can execute while priorities and objectives are a moving target
Job Responsibility
Job Responsibility
  • Lead end-to-end deployment of North in private cloud and on-premises environments, including planning, configuration, testing, and rollout
  • Partner with enterprise IT teams to assess infrastructure, security requirements, and data management practices
  • Experiment at a high velocity and with a high level of quality to engage our customers and ultimately deliver solutions that exceed their expectations
  • Design and implement deployment strategies tailored to client needs, ensuring compliance with data privacy and security standards
  • Troubleshoot and resolve deployment-related technical issues, providing timely solutions to minimize downtime
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right

Forward Deployed Engineer, Infrastructure Specialist

This role offers a unique opportunity to shape how enterprises harness the power...
Location
Location
United Kingdom
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience with and enjoy working directly with customers
  • Experience deploying enterprise software in private/hybrid cloud environments
  • Proven experience administering production Kubernetes clusters and expertise with Helm
  • Familiarity with DevOps practices, CI/CD pipelines, and tools like Git for version control
  • Strong expertise in cloud infrastructure (Azure, AWS, GCP), networking, and virtualization
  • Excel in fast-paced environments and can execute while priorities and objectives are a moving target
Job Responsibility
Job Responsibility
  • Lead end-to-end deployment of North in private cloud and on-premises environments, including planning, configuration, testing, and rollout
  • Partner with enterprise IT teams to assess infrastructure, security requirements, and data management practices
  • Experiment at a high velocity and with a high level of quality to engage our customers and ultimately deliver solutions that exceed their expectations
  • Design and implement deployment strategies tailored to client needs, ensuring compliance with data privacy and security standards
  • Troubleshoot and resolve deployment-related technical issues, providing timely solutions to minimize downtime
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right

Forward Deployed Engineer, Infrastructure Specialist

This role offers a unique opportunity to shape how enterprises harness the power...
Location
Location
Canada , Toronto; Vancouver; Montreal; Ottawa
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience with and enjoy working directly with customers
  • Experience deploying enterprise software in private/hybrid cloud environments
  • Proven experience administering production Kubernetes clusters and expertise with Helm
  • Familiarity with DevOps practices, CI/CD pipelines, and tools like Git for version control
  • Strong expertise in cloud infrastructure (Azure, AWS, GCP), networking, and virtualization
  • Excel in fast-paced environments and can execute while priorities and objectives are a moving target
Job Responsibility
Job Responsibility
  • Lead end-to-end deployment of North in private cloud and on-premises environments, including planning, configuration, testing, and rollout
  • Partner with enterprise IT teams to assess infrastructure, security requirements, and data management practices
  • Experiment at a high velocity and with a high level of quality to engage our customers and ultimately deliver solutions that exceed their expectations
  • Design and implement deployment strategies tailored to client needs, ensuring compliance with data privacy and security standards
  • Troubleshoot and resolve deployment-related technical issues, providing timely solutions to minimize downtime
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right