CrawlJobs Logo

Production Engineer, Storage

crusoe.ai Logo

Crusoe

Location Icon

Location:
United States , San Francisco, Sunnyvale

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

166000.00 - 201000.00 USD / Year

Job Description:

At Crusoe Energy Systems, our Site Reliability Engineering (SRE) team plays a mission-critical role in maintaining the performance and reliability of our AI-optimized cloud infrastructure. The Storage-focused SRE role is responsible for ensuring the availability, performance, and scalability of Crusoe’s cloud storage products and services, which power compute-intensive, latency-sensitive workloads for AI and HPC use cases. This role directly supports our vertically integrated, sustainable cloud platform by building and optimizing distributed, fault-tolerant storage systems at scale.

Job Responsibility:

  • Build automation and self-healing tools to monitor and maintain Crusoe’s distributed cloud storage infrastructure
  • Drive reliability initiatives focused on data replication, encryption, backup and restore strategies, and robust failover mechanisms
  • Help implement and maintain high-performance NVMe- and SSD-backed volumes that support large-scale AI compute clusters
  • Support user-facing storage services with a focus on availability, performance tuning, and adherence to error budgets
  • Investigate and resolve storage-related incidents using deep telemetry, logs, and performance profiling
  • Partner with hardware and kernel teams to diagnose low-level I/O issues and optimize I/O paths, cache policies, and file systems
  • Contribute to the architecture of fault-tolerant, scalable storage backends tailored for AI-first cloud environments

Requirements:

  • 5+ years of professional experience in SRE, systems, or storage engineering
  • Hands-on experience with distributed storage systems (e.g., Ceph, GlusterFS, OpenEBS) and deep understanding of object, block, and file storage paradigms
  • Proficiency in a programming language such as Python, Go, Java, or C
  • Experience with Infrastructure as Code and deployment tooling such as Terraform, Ansible, or Puppet
  • Deep knowledge of Linux internals with a focus on I/O subsystems, memory management, and storage scheduling
  • Familiarity with storage protocols like NFS, SMB, iSCSI, or NVMe-oF
  • Strong experience working with containerized workloads and orchestration platforms (e.g., Kubernetes, Docker)
  • Excellent incident response, troubleshooting, and documentation practices
  • Experience with building and operating managed services at scale such as object, file and block storage (AWS, GCP, Azure)
  • Excellent communication skills
  • Must be able to pass a background check
  • Embody the Company values

Nice to have:

  • Contributions to open-source storage projects or the Linux storage stack
  • Experience with hybrid storage models across on-prem and cloud environments
  • Familiarity with high-throughput network topologies for storage backplanes (e.g., RoCE, RDMA, InfiniBand)
What we offer:
  • Restricted Stock Units in a fast growing, well-funded technology company
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Tuition reimbursement
  • Subscription to the Calm app
  • MetLife Legal
  • Company paid commuter benefit
  • $300 per month

Additional Information:

Job Posted:
February 21, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Production Engineer, Storage

Platform Engineer – Storage Product Platform Development

Senior level network and system expert to define and lead Enterprise storage pro...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Information Systems, or equivalent
  • Typically 8+ years of total experience
  • Prior experience of bringing up a Hardware platform
  • Prior experience of performance tuning disk drives, device drivers & memory management for scale
  • Designing software systems running on multiple platform types and protocols like SNMP & iSCSI
  • Must have very strong system programming background with C/C++/Golang for large enterprise class software
  • Must have proficiency with data structures, algorithms and multi-threaded programming
  • Must have in-depth knowledge of OS internals, networking, and storage concepts
  • Strong analytical and problem-solving skills
Job Responsibility
Job Responsibility
  • Design and develop products that require in-depth knowledge of Device-driver development and Linux internals
  • Design, specify, and lead the implementation of the platform features of the storage array
  • Work with cross organizational interactions: Hardware, Firmware, System management, Network teams, Architects
  • Design enhancements, updates, and programming changes for portions and subsystems of systems software, including IO path, storage management, databases and cloud-related application
  • Write and execute complete testing plans, protocols, and documentation
  • Identify, debug and create solutions for issues with code and integration into system architecture
  • Collaborate and communicate with management, internal, and external partners regarding software systems design status, project progress, and issue resolution
  • Provide guidance and mentoring to less-experienced staff members
What we offer
What we offer
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment
  • Comprehensive suite of benefits supporting physical, financial and emotional wellbeing
  • Fulltime
Read More
Arrow Right

Platform Engineer – Storage Product Platform Development

Senior level network and system expert to define and lead Enterprise storage pro...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Information Systems, or equivalent
  • Typically 8+ years of total experience
  • Prior experience of bringing up a Hardware platform
  • Prior experience of performance tuning disk drives, device drivers & memory management for scale
  • Designing software systems running on multiple platform types and protocols like SNMP & iSCSI
  • Must have very strong system programming background with C/C++/Golang for large enterprise class software
  • Must have proficiency with data structures, algorithms and multi-threaded programming
  • Must have in-depth knowledge of OS internals, networking, and storage concepts
  • Strong analytical and problem-solving skills
Job Responsibility
Job Responsibility
  • Define and lead Enterprise storage product efforts
  • Design and develop products that require in-depth knowledge of Device-driver development and Linux internals
  • Design, specify, and lead the implementation of the platform features of the storage array
  • Work with cross organizational interactions: Hardware, Firmware, System management, Network teams, Architects
  • Design enhancements, updates, and programming changes for portions and subsystems of systems software, including IO path, storage management, databases and cloud-related application
  • Write and execute complete testing plans, protocols, and documentation
  • Provide guidance and mentoring to less-experienced staff members
What we offer
What we offer
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment
  • Fulltime
Read More
Arrow Right

Principal Software Engineer, Cloud Storage Engineering

We are working on a greenfield storage platform built on top of Kubernetes and P...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelors, Masters, or PhD in Computer science in a related technical field or similar experience
  • 10+ years of experience in software development and architecture
  • Expert-level experience with one or more prominent languages such as Java, Kotlin, or Go is crucial.
  • An expert in Kubernetes stateful sets and/or databases such as PostgreSQL.
  • Passion for collaborating with and mentoring junior members of the team
  • A real appetite for helping others learn and grow
  • Considers the customer impact when making technical decisions
Job Responsibility
Job Responsibility
  • Regularly tackle the largest and most complex problems on the team, from technical design to launch
  • Deliver solutions that are used by other teams and products
  • Determine plans-of-attack on large projects
  • Routinely tackle complex architecture challenges and apply architectural standards and start using them on new projects
  • Lead code reviews & documentation as well as take on complex bug fixes, especially on high-risk problems
  • Set the standard for thorough, meaningful code reviews
  • Partner across engineering teams to take on company-wide initiatives spanning multiple projects
  • Transfer your depth of knowledge from your current language to excel as a Software Engineer
  • Mentor more junior members
What we offer
What we offer
  • health coverage
  • paid volunteer days
  • wellness resources
  • Fulltime
Read More
Arrow Right

Senior Platform Engineer, Storage

We’re looking for a Senior Platform Engineer specializing in storage services to...
Location
Location
Ireland , Dublin
Salary
Salary:
102000.00 - 124000.00 EUR / Year
getdbt.com Logo
dbt Labs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Significant experience designing and operating relational data and object storage platforms in production
  • Hands-on experience with one or more cloud providers (AWS, Azure, GCP) and declarative Infrastructure as Code (Terraform preferred)
  • Programming/scripting ability in Python, Go, Rust or Bash
  • Excellent communication skills and experience working asynchronously on a fully remote, distributed team
Job Responsibility
Job Responsibility
  • Design, operate, and scale storage based infrastructure systems across multiple tenancy models (single vs. multi-tenant) and public clouds (AWS, Azure, and GCP)
  • Deepen our team’s expertise in one more areas including: relational databases, search, caching, queuing, and streaming - helping strengthen platform scalability, security, and developer experience
  • Partner with Architecture, Release Engineering, Network, Compute, and Security teams to provide a seamless platform for application teams
  • Leverage tools and languages such as Terraform, Kubernetes, Helm, Argo CD, Python, SQL, Go, Bash, and Datadog
  • Participate in a balanced on-call rotation in an environment that values continuous improvement, helping to improve reliability and reduce operational toil
What we offer
What we offer
  • Equity Stake
  • Unlimited PTO
  • Excellent healthcare coverage
  • Paid parental leave
  • Wellness and home office stipends
  • Fulltime
Read More
Arrow Right

Assistant Product Compliance Engineer Intern

Join the Amazon Development Center in Taiwan to gain insight into the technical ...
Location
Location
Taiwan , Taipei
Salary
Salary:
Not provided
amazon.de Logo
Amazon Pforzheim GmbH
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Are enrolled in or have completed a Bachelor's degree in engineering or equivalent
  • Speak, write, and read fluently in Mandarin
  • Can work as full time intern during 1 year
Job Responsibility
Job Responsibility
  • Technical Analysis: Conduct research on regulatory and compliance issues and explain their impact on product design
  • Collaboration: Work with engineering teams to evaluate and ensure product designs are compliant from the start
  • Test Oversight: Oversee the preparation, delivery, and storage of compliance samples, ensuring that samples are set up under the correct conditions for accurate results
  • Documentation: Organize, file and review key product certification documents to ensure that every detail meets the technical requirements
  • Status Management: Track product certification status, ensure timely updates and flag any discrepancies
  • Certification Process: Process all product certification applications, ensuring thorough technical review prior to documentation and certificate retrieval
  • Billing & Purchasing: Process complex billing invoices and oversee the technical aspects of purchase order requests
  • System management : Maintain certification management system in pristine condition, including technical reviews, storage specifications and timely submissions
  • Ad Hoc Assignments: Under the direction of the Supervisor, be ready to respond to technical challenges as they arise
  • Fulltime
Read More
Arrow Right

Storage engineer

HPE Operations is our innovative IT services organization. It provides the exper...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Engineering (or Equivalent)
  • Minimum 6-7 years of experience in Storage & Backup administration support
  • Strong knowledge & relevant certification on the latest track like Zerto/Vault, Commvault
  • Knowledge on server/operating system technology and good understanding of other domains such as storage/SAN/networking/database
  • Flexible to work in 24X7 support environment
  • ITIL certification is an added advantage
Job Responsibility
Job Responsibility
  • Resolve customer’s issues via the telephone, email or remote sessions
  • Reproduce issues in-house and responding back in a timely manner
  • Regular follow ups with customers with recommendations, updates and action plans
  • Identify and escalate issues in a timely manner to vendor according to Standard Operating Procedures
  • Leverage internal technical expertise, including peers, mentors, knowledge base, community forums and other internal tools, to provide the most effective solutions to customer issues
  • Collaborate with other CoE/HW teams in diagnosing and isolating the cause of complex issues
  • Provide consulting support in his/her area of expertise
  • Maintain quality on case documentation, SLA timeframes and operational metrics
  • Performs within the Productivity Measure of the team (scorecard)
  • Incident Management: Resolve single and cross technology incidents independently
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Software Product Engineering Manager

Applies advanced subject matter knowledge to manage staff activities in solving ...
Location
Location
United States , Aguadilla
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or master's degree in computer science, Information Systems, or equivalent
  • Typically, 5 or more years of related work experience, including minimum 2 years of people management experience
  • Experience leading or managing technical teams, including software development and security
  • Strong Understanding of multiple software systems design tools and languages, including testing methodology and test plans
  • Experience and technical background related to IT Security and engineering environments, including servers, networks, storage, and cloud systems
  • Understanding of Secure application and Secure software Development Lifecycle (SDLC)
  • Understanding of Agile methodologies
  • Advanced English Level
  • Experience working in a hardware and software environment
  • Python experience is desired
Job Responsibility
Job Responsibility
  • Provides direct and ongoing leadership for a team of individual contributors designing and developing security tests, enhancements and updates
  • Manages headcount, deliverables, schedules, and costs for multiple ongoing projects
  • Communicates project status and escalates issues to direct managers, program managers, and internal and external development partners
  • Manages relationships with outsourced partners and suppliers, global security teams and R&D team
  • Proactively identifies opportunities for process improvement and cost reductions opportunities
  • Provides people-care management for assigned team members, including hiring, setting and monitoring of annual performance plans, coaching, and career development
  • Manage laboratory resources, systems and infrastructure to support lab activities
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Scality Storage Engineer

This role provides operate, admin and consulting support on storage infrastructu...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Technical knowledge on Object, File & Block storage with cloud data management solutions – Installing, Configuring & Troubleshooting of at least 2 of the storage skills Scality, DDN Storage, GPFS, Luster
  • Good Knowledge on Parallel/Distributed file system
  • Firmware and management experience on above Storage
  • Basic Operating Systems Knowledge – Install, configure, administration and troubleshoot RHEL/SUSE (as Bare-Metal OS & as VMs on Hypervisors)
  • Knowledge on SAN, NAS technologies (Ethernet / iSCSI, FC, FCOE)
  • Performed routine Performance Analysis, Capacity analysis, security audit analysis reports to customer for necessary planned changes
  • Troubleshooting performance related issues on HW and Operating system
  • Working knowledge on AIX , Redhat , CentOS , SUSE Linux and HP UX
  • Should be ready to work in 24x7 rotational shifts and on weekends
  • Good written and verbal communication skills (Mandatory)
Job Responsibility
Job Responsibility
  • Resolve customer’s issues via the telephone, email or remote sessions
  • Reproduce issues in-house and responding back in a timely manner
  • Regular follow ups with customers with recommendations, updates and action plans
  • Identify and escalate issues in a timely manner to vendor according to Standard Operating Procedures
  • Leverage internal technical expertise, including peers, mentors, knowledge base, community forums and other internal tools, to provide the most effective solutions to customer issues
  • Collaborate with other CoE/HW teams in diagnosing and isolating the cause of complex issues
  • Provide consulting support in his/her area of expertise
  • Maintain quality on case documentation, SLA timeframes and operational metrics
  • Performs within the Productivity Measure of the team (scorecard)
  • Incident Management: Resolve single and cross technology incidents independently. Lead the team members to resolve complex or cross technology incidents
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right