CrawlJobs Logo

Gcp Devops Hpc Engineer

Spain 70000.00 - 80000.00 EUR / Year · Job Posted January 20, 2026
Apply Position
Job Link Share

Job Responsibility

  • Lead end-to-end migrations of SLURM-based HPC clusters from on-prem to GCP
  • Design, build, and operate secure, scalable HPC architectures in the cloud
  • Optimise SLURM scheduling, workload performance, and resource utilisation
  • Automate cluster deployment and operations using Terraform, Ansible, Python, and Bash
  • Manage HPC software stacks using Spack
  • Deploy and support parallel workloads using MPI, OpenMP, and related frameworks
  • Troubleshoot performance issues and drive continuous optimisation
  • Collaborate with engineering teams and stakeholders in a fully remote environment

Requirements

  • 5+ years’ experience in HPC environments (SLURM, MPI, parallel workloads)
  • Strong Linux systems expertise in performance-critical environments
  • Hands-on experience running or migrating HPC workloads in the cloud (GCP preferred)
  • Solid experience with Terraform and Ansible
  • Strong scripting skills (Python, Bash)
  • Deep understanding of GCP services (GCE, VPC, Cloud Storage)

Nice to have

  • GCP certifications (DevOps / Cloud Engineer)
  • Experience with Preemptible VMs and cloud cost-optimisation strategies
  • HPC performance profiling and debugging tools
  • Containers in HPC (Singularity, Docker)
  • Exposure to Spark or big data tooling

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Gcp Devops Hpc Engineer

8 matching positions

Developer Experience Engineer

We are looking for a Developer Experience Engineer to enhance developer producti...
Location
Location
United States , San Jose
Salary
Salary:
Not provided
etched.com Logo
Etched
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong Python skills for automation, scripting, and infrastructure development
  • Experience with Slurm job scheduling in an HPC or hybrid environment
  • Hands-on experience with observability and monitoring tools like Prometheus, Grafana, and OpenTelemetry
  • Expertise with Docker and Kubernetes, including Helm charts and cluster management
  • Proficiency in modern CI/CD pipeline management with tools like GitHub Actions, Jenkins, or Buildkite
  • Experience with infrastructure-as-code tools like Terraform or Ansible
  • Knowledge of cloud infrastructure, compute, and storage optimization on AWS or GCP
Job Responsibility
Job Responsibility
  • Develop and maintain automation tools to streamline development, testing, and deployment workflows
  • Optimize and manage Slurm-based job scheduling for AI workloads, simulation, and chip design workflows
  • Build observability solutions using Grafana, Prometheus, and OpenTelemetry for monitoring pipelines, infrastructure, and compute clusters
  • Manage and optimize containerized environments using Docker and Kubernetes to enhance scalability and reproducibility
  • Enhance build, test, and deployment pipelines with CI/CD tools like GitHub Actions, Jenkins, Buildkite, or Bazel
  • Develop caching and artifact management systems to reduce build times and improve dependency resolution
  • Integrate and manage cloud resources (AWS, GCP) for scaling compute, storage, and hybrid workloads
  • Support security and compliance efforts including secrets management and access control
  • Document and share best practices for efficient developer tooling and workflows
What we offer
What we offer
  • Full medical, dental, and vision packages, with generous premium coverage
  • Housing subsidy of $2,000/month for those living within walking distance of the office
  • Daily lunch and dinner in our office
  • Relocation support for those moving to West San Jose
  • Unlimited compute budget subject to ROI justification
  • Fulltime
Read More
Arrow Right

Hardware Development Infrastructure Engineer

We’re looking for a Hardware Development Infrastructure Engineer to build and ru...
Location
Location
United States , San Francisco
Salary
Salary:
260000.00 - 335000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Familiarity with chip development workflows and at least one deep EDA domain (e.g., DV, PD, emulation, or formal verification)
  • Strong infrastructure fundamentals, including cloud platforms, networking, security, performance, and automation
  • Experience operating cloud environments (Azure preferred
  • AWS, GCP, or OCI acceptable) with strong infrastructure-as-code practices (e.g., Terraform, Bicep
  • configuration management tools a plus)
  • Strong programming skills (Python preferred) and solid software engineering and scripting practices
  • Experience building and operating CI/CD systems (e.g., Jenkins, Buildkite, GitHub Actions), including testing and release workflows
  • Database experience (e.g., Postgres or MySQL), including schema design, migrations, indexing, and operational safety
  • Clear communicator with strong judgment—able to explain tradeoffs, propose pragmatic solutions, and articulate a realistic vision for scalable infrastructure
Job Responsibility
Job Responsibility
  • Partner with hardware teams on workflows and tooling: Embed with teams across DV, PD, emulation, formal, and software to understand development flows, identify failure modes, and deliver tooling (CLIs, services, APIs) that reduces manual work and accelerates iteration
  • Build and operate regression systems at scale: Own regressions end-to-end—from definition and scheduling to execution, results ingestion, triage, and reporting—while improving throughput, reproducibility, and flake reduction
  • Own CI/CD for infrastructure and tooling: Design and operate pipelines for infrastructure-as-code, services, images, and cluster configuration changes, including testing, gated deploys, staged rollouts, and safe rollback
  • Run cloud and HPC platforms: Design, provision, and operate cloud infrastructure (Azure preferred) and HPC/HTC clusters (e.g., Slurm), tuning scheduling policies, autoscaling, node lifecycles, and cost-performance tradeoffs
  • Build data foundations and visibility: Develop ETL pipelines to ingest metrics, logs, and results
  • operate databases for workflow metadata and outcomes
  • and build dashboards that surface efficiency, utilization, and reliability trends
  • Drive operational excellence: Establish monitoring and alerting, lead incident response and postmortems, maintain runbooks, and produce clear, durable documentation
What we offer
What we offer
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Daily meals in our offices, and meal delivery credits as eligible
  • Fulltime
Read More
Arrow Right

Senior Manager, AI Infrastructure and Operations

The Sr. Manager/Staff Engineer, AI Infrastructure & MLOps Engineering is a senio...
Location
Location
Japan , Tokyo
Salary
Salary:
Not provided
pfizer.de Logo
Pfizer
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of hands-on software engineering experience in cloud infrastructure, DevOps, and MLOps
  • Deep expertise in Python, Kubernetes, Terraform, Helm, and CI/CD pipeline development
  • Proven experience architecting and operating containerized solutions on AWS, GCP, and Azure
  • Strong knowledge of Infrastructure-as-Code, distributed systems, and production system reliability
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related field
Job Responsibility
Job Responsibility
  • Design, implement, and own large-scale cloud-based HPC and MLOps platforms supporting AI model training, genomic sequencing, and precision medicine
  • Architect multi-environment clusters (AWS, GCP, Azure), enabling GPU/FPGA workloads and advanced observability
  • Lead the development of developer and cloud platforms, including internal engineering accelerators and reusable toolsets
  • Design, implement, and manage unified platform catalogs using Backstage, enhancing developer experience and application metadata management
  • Develop custom plugins and APIs for Backstage to support internal engineering workflows and documentation
  • Build and maintain Python-based automation frameworks, CI/CD pipelines, and Infrastructure-as-Code (Terraform, Helm, Pulumi, AWS CDK)
  • Operationalize containerized solutions using Docker and Kubernetes, integrating MLflow, Kubeflow, and other orchestration platforms
  • Implement robust automation for provisioning, configuring, and managing cloud resources across multiple environments
  • Lead the implementation of Service Level Indicators (SLIs), Service Level Objectives (SLOs), and advanced observability (Prometheus, Grafana, PagerDuty)
  • Develop and maintain APIs and services for model management, feature stores, and inference pipelines
  • Fulltime
Read More
Arrow Right
New

IT Training Lead

The IT Training Lead will drive technology learning and user adoption across the...
Location
Location
United States , Delray Beach
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience in IT training, instructional design, technical enablement, or learning and development
  • Strong knowledge of Microsoft 365
  • Excellent communication, facilitation, and content development skills
  • Ability to translate technical concepts into practical, user-friendly training.
Job Responsibility
Job Responsibility
  • Design, develop, and deliver IT training programs in instructor-led, virtual, and self-paced formats
  • Take lead in the Microsoft Copilot and AI training strategy, including onboarding, advanced use cases, responsible AI usage, and ongoing enablement
  • Partner with IT leadership to support new technology rollouts, system upgrades, and digital transformation initiatives
  • Create and maintain training content, including videos, guides, tutorials, and job aids
  • Identify skill gaps and develop targeted learning solutions to improve adoption and productivity
  • Gather feedback and measure training effectiveness to continuously improve programs.
Read More
Arrow Right
New

K Kitchen Representative

The position includes, but is not limited to, the following essential job duties...
Location
Location
United States , New Albany
Salary
Salary:
Not provided
https://www.circlek.com Logo
Circle K
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Excellent communication skills
  • Team player who can work well with others or independently
  • Acts with integrity
  • keeps commitments
  • Contagious positive attitude
  • Focuses on achieving results while having fun
  • Frequently bend, twist at waist, kneel, squat, stand, and walk
  • Occasionally climb and descend ladders
  • Tolerate extreme cold and hot temperatures and work in and around fryers, ovens, grills, coolers, freezers, sharp objects, and loud noises
  • Reach, grasp, and manipulate objects with hands for entire shift, including reaching for objects overhead
Job Responsibility
Job Responsibility
  • Provides excellent guest service in a fast and friendly manner
  • Maintains a clean restaurant environment by cleaning and performing general housekeeping duties
  • Prepares and serves food items in accordance with all Brand, Company, and health department regulations
  • Ensures product quality, food safety, and operational standards are met
  • Keeps accurate cash, sales, and inventory control records
  • Follows all government laws and safety codes
  • Completes reports on all incidents following our 5-minute rule policy
  • Lives our Company values: One Team, Do the Right Thing, Takes Ownership, Play to Win
What we offer
What we offer
  • Medical, Dental, Vision, Term Life and AD&D plans
  • Flexible spending and health savings accounts (FT)
  • Vacation paid time off
  • Company holidays paid at time and a half
  • Matching 401(k)
  • Tuition Reimbursement
  • Stock Purchase Plan
  • Employee Discount Program
  • Discount Meal Benefit
  • Wellness Plan
Read More
Arrow Right
New

K Kitchen Representative

Location
Location
United States , Decatur
Salary
Salary:
Not provided
https://www.circlek.com Logo
Circle K
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Excellent communication skills
  • Team player who can work well with others or independently
  • Acts with integrity
  • keeps commitments
  • Contagious positive attitude
  • Focuses on achieving results while having fun
  • Frequently bend, twist at waist, kneel, squat, stand, and walk
  • Occasionally climb and descend ladders
  • Tolerate extreme cold and hot temperatures and work in and around fryers, ovens, grills, coolers, freezers, sharp objects, and loud noises
  • Reach, grasp, and manipulate objects with hands for entire shift, including reaching for objects overhead
Job Responsibility
Job Responsibility
  • Provides excellent guest service in a fast and friendly manner
  • Maintains a clean restaurant environment by cleaning and performing general housekeeping duties
  • Prepares and serves food items in accordance with all Brand, Company, and health department regulations
  • Ensures product quality, food safety, and operational standards are met
  • Keeps accurate cash, sales, and inventory control records
  • Follows all government laws and safety codes
  • Completes reports on all incidents following our 5-minute rule policy
  • Lives our Company values: One Team, Do the Right Thing, Takes Ownership, Play to Win
What we offer
What we offer
  • Medical, Dental, Vision, Term Life and AD&D plans
  • Flexible spending and health savings accounts (FT)
  • Vacation paid time off
  • Company holidays paid at time and a half
  • Matching 401(k)
  • Tuition Reimbursement
  • Stock Purchase Plan
  • Employee Discount Program
  • Discount Meal Benefit
  • Wellness Plan
Read More
Arrow Right
New

Restaurant Assistant Manager

This position assists the Restaurant Manager (RM) with daily operations of the r...
Location
Location
United States , Holly Springs
Salary
Salary:
Not provided
https://www.circlek.com Logo
Circle K
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Full time required
  • availability during all hours of operation and at least one hour pre-opening and post-closing required
  • Valid state Driver's License required
  • Excellent communication skills
  • Motivates, coaches, and leads team members
  • Acts with integrity
  • keeps commitments
  • Contagious positive attitude
  • Focuses on achieving results while having fun
  • Ability to gain control during stressful situations
Job Responsibility
Job Responsibility
  • Assists the Restaurant Manager with daily operations of the restaurant and supervises the team in their absence
  • Leads and coaches Restaurant Team Members and partners with the management team to maintain the Company and Brand operational standards
  • Provides excellent guest service in a fast and friendly manner
  • coaches and corrects team
  • Conducts second interviews for team members and shift leads
  • Maintains a clean restaurant environment by cleaning and performing general housekeeping duties
  • Assigns shift duties to team members and follows up to ensure completion
  • Directs team and ensures all food items are prepared and served in accordance with all Brand, Company, and health department regulations
  • Coaches team members to follow guidelines for food preparation and production management
  • Cascades relevant information to team members and assists with new product training
What we offer
What we offer
  • Unlimited tip pooling
  • Medical, Dental, Vision, Term Life and AD&D plans
  • Flexible spending and health savings accounts
  • Short-Term Disability
  • Vacation paid time off
  • Company holidays paid at time and a half
  • Matching 401(k)
  • Tuition Reimbursement
  • Stock Purchase Plan
  • Employee Discount Program
  • Fulltime
Read More
Arrow Right
New

Plant Operator - Crushing and Screen

Are you an experienced and ticketed Machine Operator looking for stable, high-ho...
Location
Location
Australia , Petrie
Salary
Salary:
42.00 - 52.00 AUD / Hour
https://www.randstad.com Logo
Randstad
Expiration Date
July 09, 2026
Flip Icon
Requirements
Requirements
  • Proven Experience working in a quarry, concrete recycling, or heavy industrial yard
  • Current tickets for Front-End Loader (LL) and Excavator (LE)
  • Truck License: Heavy Rigid (HR) or higher is highly regarded
  • Reliability with strong work ethic and punctuality
  • Own reliable vehicle and current driver's license
Job Responsibility
Job Responsibility
  • Safe and efficient operation of heavy machinery in a fast-paced recycling and quarry environment
  • Operating Front-End Loaders
  • Operating Excavators utilized as material handlers
  • Operating Moxy (Articulated Dump Trucks) and other yard machinery as required
  • Assisting with daily machinery pre-starts, basic maintenance, and ensuring the yard runs smoothly
  • Adhering strictly to site health and safety protocols
What we offer
What we offer
  • Top Rates: $42.00 to $52.00 per hour + overtime penalties
  • Big Hours: Consistent 40 to 55-hour work weeks
  • Career Progression: Pathway from casual to permanent full-time employment within 3-6 months
  • Local Work: Convenient Brisbane Northside location (Petrie)
  • Immediate Start
  • Fulltime
Read More
Arrow Right