CrawlJobs Logo

Senior Linux Systems Administrator/ SRE

keepit.com Logo

Keepit

Location Icon

Location:
Denmark , Copenhagen

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

You will play a key role in ensuring the reliability, scalability, and performance of our systems. The ideal candidate will bring extensive knowledge of Linux administration, IaC using Ansible, DevOps workflows, Networking and Observability using mainly Grafana LGTM stack. You should be passionate about automation, security, and upholding best practices in reliability engineering.

Job Responsibility:

  • Help ensure the continued health and performance of our systems
  • Work with the extension and development of automation tooling
  • Support hardware and software systems
  • Maintain documentation for daily operations
  • Deploy and operate Linux in a data centre environment

Requirements:

  • Excellent Linux CLI knowledge
  • Extensive Grafana (LGTM) stack and Alloy knowledge
  • Solid Debian/Ubuntu experience
  • Excellent troubleshooting skills and good familiarity with the command line
  • Proficiency in both spoken and written English
  • Infrastructure as Code – mainly Ansible
  • Ability to work in a structured and collaborative manner

Nice to have:

  • Experience with Postgres and Pacemaker/Corosync
  • Familiarity with ISO27001, ISAE3402
  • DevOps CI/CD experience
  • Python
What we offer:
  • Competitive salary
  • Pension scheme
  • A modern, energetic global work environment
  • Flexible work-life balance supported by a hybrid working model
  • Regular team-building activities
  • Opportunities for professional development and career advancement

Additional Information:

Job Posted:
January 18, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior Linux Systems Administrator/ SRE

Senior DevOps Engineer

We’re looking for a Senior DevOps Engineer who is passionate about automation, o...
Location
Location
United States , Nashville; McLean; Tampa
Salary
Salary:
Not provided
theinclab.com Logo
TheIncLab
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years in DevOps, SRE, or Infrastructure Engineering roles
  • Hands-on experience and proficiency with AWS services (EC2, S3, RDS, VPC, IAM, etc.) and infrastructure automation (Terraform, Ansible, or similar)
  • Experience deploying and managing infrastructure using Terraform and/or Ansible
  • Solid knowledge of Linux system administration
  • Strong skills in Windows system administration environments
  • Proven experience managing and automating GitLab, including CI/CD pipelines
  • Proficiency in at least one programming or scripting language (Python, Bash, etc.)
  • Experience implementing monitoring, logging, and alerting solutions (CloudWatch, Datadog, CloudTrail)
  • Solid understanding of networking, security best practices, and high-availability system design
  • Familiarity with version control systems (Git) and GitLab workflows
Job Responsibility
Job Responsibility
  • Build, maintain, and improve CI/CD pipelines using GitLab CI/CD or similar tools
  • Automate infrastructure provisioning, deployment, and maintenance using Terraform, Ansible, or related technologies
  • Collaborate with developers and QA to create reliable deployment paths from local dev to production
  • Implement infrastructure-as-code practices across environments (e.g., AWS, Kubernetes, bare-metal)
  • Design and implement monitoring, alerting, and observability systems to maintain high availability and performance
  • Respond to incidents, lead root cause analysis, and implement preventive measures
  • Establish and evolve SLOs/SLIs to ensure measurable system reliability
  • Participate in on-call rotation and help build automation to reduce the need for human intervention
  • Drive capacity planning, performance tuning, and cost optimization initiatives
  • Administer Linux (Ubuntu/Debian) and Windows-based infrastructure
What we offer
What we offer
  • Hybrid and flexible work schedules
  • Professional development programs
  • Training and certification reimbursement
  • Extended and floating holiday schedule
  • Paid time off and Paid volunteer time
  • Health and Wellness Benefits include options for Medical, Dental, and Vision insurance along with access to Wellness, Mental Health, and Employee Assistance Programs
  • 100% Company Paid Benefits that include STD, LTD, and Basic Life insurance
  • 401(k) Plan Options with employer matching
  • Incentive bonuses for eligible clearances, performance, and employee referrals
  • A company culture that values your individual strengths, career goals, and contributions to the team.
  • Fulltime
Read More
Arrow Right

Senior Site Reliability Engineer

We are looking for an experienced engineer with strong Linux and system-level ex...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
bumble.com Logo
Bumble Inc.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in Linux system administration or SRE roles
  • Proven experience managing large-scale infrastructure environments
  • Strong troubleshooting and performance tuning skills at the infrastructure level
  • Basic scripting/automation experience (Bash, Python)
  • Familiarity with IaC tools (e.g., Ansible, Puppet)
  • Knowledge of distributed systems and container orchestration (Kafka, Kubernetes, etc.)
  • Excellent communication and problem-solving skills
Job Responsibility
Job Responsibility
  • Operate autonomously in complex production environments
  • Independently troubleshoot incidents
  • Lead and support post-incident service recovery
  • Drive improvements to overall system stability, performance, and observability
  • Manage and optimize large-scale environments (5,000+ hosts) running technologies like Kafka, Redis, and Kubernetes
What we offer
What we offer
  • Competitive compensation, equity, and world-class benefits
  • Fulltime
Read More
Arrow Right

Senior Site Reliability Engineer

We are looking for an experienced engineer with strong Linux and system-level ex...
Location
Location
United States , Austin
Salary
Salary:
185000.00 - 225000.00 USD / Year
bumble.com Logo
Bumble Inc.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in Linux system administration or SRE roles
  • Proven experience managing large-scale infrastructure environments
  • Experience with cloud infrastructure (Google Cloud)
  • Strong troubleshooting and performance tuning skills at the infrastructure level
  • Basic scripting/automation experience (Bash, Python)
  • Familiarity with IaC tools (e.g., Ansible, Puppet)
  • Knowledge of distributed systems and container orchestration (Kafka, Kubernetes, etc.)
  • Excellent communication and problem-solving skills
Job Responsibility
Job Responsibility
  • Operate autonomously in complex production environments
  • Independently troubleshoot incidents
  • Lead and support post-incident service recovery
  • Drive improvements to overall system stability, performance, and observability
  • Manage and optimize large-scale environments (5,000+ hosts) running technologies like Kafka, Redis, and Kubernetes
  • On-call rotation: one week every 4–5 weeks (24x7 coverage)
What we offer
What we offer
  • Medical
  • Dental
  • Vision
  • 401(k) match
  • Unlimited Paid Time Off Policy
  • Maven Fertility: $10,000 lifetime benefit for fertility, adoption, abortion care, and more
  • 26 Weeks Parental Leave: For both primary and secondary caregivers
  • Family & Compassionate Leave: Inclusive of domestic violence recovery
  • Unlimited Paid Time Off
  • Company-wide Week Off: Annual collective rest for the entire company
  • Fulltime
Read More
Arrow Right

Manager – AI Infrastructure Operations

As a senior leader on our team, you will be responsible for the overall health, ...
Location
Location
United States , Sunnyvale
Salary
Salary:
Not provided
cerebras.net Logo
Cerebras Systems
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Technical Leadership: 15+ years of experience in managing and operating complex compute infrastructure, with a minimum of 5 years in a senior or leadership role
  • SRE and Operations Expertise: A strong background as a Site Reliability Engineer or in a similar role, with a proven track record of managing large-scale, mission-critical systems
  • Deep Systems Knowledge: Expert-level proficiency in Linux-based systems, Python scripting, and command-line tools for system administration and automation
  • Troubleshooting Acumen: Exceptional ability to lead and resolve complex technical challenges under pressure, especially during customer or engineering escalations
  • On-Call Leadership: Proven experience managing an on-call rotation and responding to 24/7 technical incidents
  • Communication: Excellent communication and leadership skills, with the ability to effectively mentor junior team members and communicate complex technical concepts to a diverse audience
Job Responsibility
Job Responsibility
  • Lead and Manage Infrastructure: Oversee the operation and reliability of our advanced AI compute infrastructure, defining strategy and setting a high bar for operational excellence
  • Drive Technical Ownership: Act as the primary owner for critical infrastructure systems, ensuring uptime, performance, and capacity are consistently optimized
  • Handle High-Stakes Escalations: Serve as the final point of contact for complex customer and engineering escalations, providing expert-level, hands-on support and driving issues to a rapid and complete resolution
  • Champion Reliability and Automation: Leverage your SRE experience to develop and implement robust monitoring, alerting, and automation solutions, reducing manual toil and preventing future issues
  • Collaborate and Strategize: Partner with cross-functional teams, including engineering and product, to align on long-term infrastructure strategy and support future AI initiatives
  • Innovate and Improve: Continuously evaluate and improve existing processes, tools, and technologies to enhance system reliability and operational efficiency
What we offer
What we offer
  • Build a breakthrough AI platform beyond the constraints of the GPU
  • Publish and open source their cutting-edge AI research
  • Work on one of the fastest AI supercomputers in the world
  • Enjoy job stability with startup vitality
  • Our simple, non-corporate work culture that respects individual beliefs
Read More
Arrow Right

Senior Platform Engineer

We’re looking for a Senior Engineer to join our Core Platform Service team, some...
Location
Location
Germany , Berlin
Salary
Salary:
Not provided
justeattakeaway.com Logo
Just Eat Takeaway.com
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Solid understanding of Linux systems administration, networking (DNS, TLS/SSL, HTTP), and container fundamentals
  • Experience in designing multi-cluster EKS architectures or hybrid Kubernetes setups
  • Familiarity with RBAC design, OIDC authentication, and Vault secret injection into workloads
  • Experience designing distributed, event-driven systems and microservice architectures
  • Familiarity with SRE practices, monitoring, automation, release engineering, and incident response
  • Awareness of cloud security best practices and common threat mitigations
  • Proficiency with Terraform and Helm for infrastructure and application automation
  • Scripting or programming experience (Go preferred, Python or Bash also acceptable)
Job Responsibility
Job Responsibility
  • Design, deploy, and manage production workloads on AWS (Mainly Compute - EKS, EC2 , Lambdas)
  • Lead and operate EKS clusters across multiple environments, ensuring scalability, performance, and reliability
  • Implement and maintain automation, monitoring, and alerting using tools like Terraform, Grafana, Prometheus, and Datadog
  • Manage Linux-based infrastructure, including performance tuning, debugging, and kernel-level analysis
  • Roll out and standardize ArgoCD and Argo Workflows as part of our GitOps and automation strategy
  • Collaborate with development teams to design and operate microservices and event-driven architectures at scale
  • Troubleshoot incidents, drive root-cause analysis, and contribute to postmortems
  • Design, deploy, and manage HashiCorp Vault, implementing secret management, access policies, and integrations with workloads on EKS
  • Fulltime
Read More
Arrow Right

Senior DevOps Engineer

To reinvent an industry, you need not only outstanding products but also world-c...
Location
Location
Hungary , Budapest
Salary
Salary:
Not provided
formlabs.com Logo
Formlabs GmbH
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS degree in Computer Science, Engineering, or equivalent practical experience
  • 5+ years of relevant experience in DevOps, Platform, or SRE roles supporting production SaaS systems
  • Hands-on expertise with: Linux systems administration and troubleshooting
  • Kubernetes
  • Cloud platforms (GCP, AWS)
  • Infrastructure as Code (Terraform, Helm, ArgoCD, etc.)
  • CI/CD tools (Jenkins, GitHub Actions, etc.)
  • Scripting in Bash, Python, or similar
  • Monitoring and observability systems (Prometheus, Grafana, Loki, Sentry, etc.)
  • Strong understanding of modern infrastructure security principles
Job Responsibility
Job Responsibility
  • Design, build, and operate reliable, secure, and scalable infrastructure to support the development, testing, and operation of Formlabs' software and services
  • Drive improvements in developer experience through better CI/CD pipelines, observability, and platform tooling
  • Take ownership of production systems with a proactive mindset towards monitoring, performance, and continuous improvement
  • Lead and contribute to architectural decisions that impact our infrastructure and how engineering teams interact with it
  • Mentor other engineers through technical guidance, code reviews, and knowledge sharing
  • Build and maintain Kubernetes-based platforms, cloud infrastructure (GCP, AWS), and automation tooling
  • Ensure that monitoring, logging, and alerting systems are robust, actionable, and scalable
  • Work cross-functionally with Software, Test, and Data teams to align platform improvements with organizational needs
What we offer
What we offer
  • Shares in the company in the form of RSUs
  • Catered lunch at the office 3 days per week
  • Private health insurance with Medicover (Blue package + hospital coverage)
  • A monthly or quarterly public transportation pass for Budapest
  • Free beverages and snacks at the office
  • All You Can Move sports pass with 7000 HUF monthly allowance
  • Free 3D prints
  • An inclusive, dog-friendly office with diverse and inspiring colleagues
  • Development opportunities both in-house and off-site
Read More
Arrow Right

Senior DevOps / Systems Engineer

We are seeking a talented DevOps / Systems Engineer to join our team. You'll own...
Location
Location
Mexico
Salary
Salary:
Not provided
techholding.co Logo
Tech Holding
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of DevOps, Systems, or SRE experience
  • Strong experience with AWS (EC2, S3, RDS, VPC, IAM at minimum)
  • Experience with Docker and container orchestration
  • Experience building CI/CD pipelines (GitHub Actions, GitLab CI, or similar)
  • Experience with PostgreSQL administration (backups, monitoring, basic tuning)
  • Experience with Linux server administration
  • Understanding of networking and security fundamentals
  • Experience with infrastructure as code (Terraform, Pulumi, CloudFormation, or CDK)
  • Comfortable communicating in English (written and verbal async-first team)
  • Available to overlap 4+ hours with US Eastern time
Job Responsibility
Job Responsibility
  • Infrastructure (40%) Design and maintain AWS infrastructure
  • Manage containerized deployments (Docker, potentially ECS or Kubernetes)
  • Set up and maintain PostgreSQL and Redis
  • Configure networking, security groups, and access controls
  • Implement infrastructure as code (Terraform or Pulumi)
  • Manage secrets and environment configuration
  • CI/CD & Automation (30%) Build and maintain GitHub Actions pipelines
  • Automate testing, building, and deployment
  • Implement staging and production environments
  • Set up automated database migrations
What we offer
What we offer
  • Fully remote engagement across MX
  • Opportunity to work on high-impact client systems with real operational ownership
Read More
Arrow Right

Senior Site Reliability Engineer

SREs at Optimizely are focused on making us the most reliable, performant, and t...
Location
Location
Vietnam , Hanoi
Salary
Salary:
Not provided
optimizely.com Logo
Optimizely
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong experience in Linux Systems Administration in cloud or virtualized environments
  • Proficiency in infrastructure-as-code tools such as Terraform
  • Hands-on experience with configuration management tools like Ansible or SaltStack
  • Skilled in scripting and automation using Python and Bash
  • Experience deploying and maintaining services in public cloud environments (Azure, AWS, or GCP)
  • Solid understanding of observability tooling, especially Datadog, ELK Stack (Elasticsearch, Logstash, Kibana), or similar
  • Experience building and maintaining CI/CD pipelines (e.g., GitHub Actions, Azure DevOps, Octopus)
  • Familiarity with Kubernetes and Docker
  • production experience is a strong plus
  • Experience operating and scaling distributed systems across multiple regions
Job Responsibility
Job Responsibility
  • Champion a Site Reliability Engineering culture across the organization by sharing best practices, tools, documentation, and code
  • Identify and automate manual operational tasks using scripting, infrastructure-as-code, and CI/CD pipelines
  • Build and maintain observability (monitoring, logging, tracing) for all production systems to ensure reliability, availability, and performance
  • Proactively monitor alerts across all platforms and coordinate with SRE, Operations, Engineering, and Support teams to ensure quick detection and resolution of incidents—minimizing MTTA/MTTR
  • Lead and manage on-call rotations, driving a blameless incident management and postmortem culture
  • Collaborate with development teams to define and implement SLOs, SLIs, and error budgets
  • Ensure uptime SLAs are met through robust automation, testing, monitoring, and operational best practices
  • Create and maintain runbooks, playbooks, and system documentation to ensure operational readiness and knowledge sharing
Read More
Arrow Right