CrawlJobs Logo

Senior Software Engineer - Storage

United States, Aliso Viejo 119800.00 - 234700.00 USD / Year · Job Posted March 19, 2026
Apply Position
Job Link Share

Job Description

Microsoft Silicon and Cloud Hardware Infrastructure Engineering (SCHIE) is the team behind Microsoft’s expanding Cloud Infrastructure and responsible for powering Microsoft’s “Intelligent Cloud” mission. CHIE delivers the core infrastructure and foundational technologies for Microsoft's over 200 online businesses including Bing, MSN, Office 365, Xbox Live, Skype, OneDrive and the Microsoft Azure platform globally with our server and data center infrastructure, security and compliance, operations, globalization, and manageability solutions. Our focus is on smart growth, high efficiency, and delivering a trusted experience to customers and partners worldwide and we are looking for passionate, high-energy engineers to help achieve that mission. As Microsoft's cloud business continues to grow the ability to deploy new offerings and HW infrastructure on time, in high volume with high quality and lowest cost is of paramount importance. To achieve this goal, the Silicon Cloud Hardware Infrastructure Engineering (SCHIE) team is instrumental in defining and delivering measures of success for hardware design, qualification, fleet support, scale, and sustainability related to Microsoft cloud hardware. Azure Memory and Storage Center of Excellence (AMS CoE) is part of the SCHIE organization focusing on Memory and Storage devices going into the Cloud hardware servers. AMS provide memory and storage solutions to Azure, drive memory and storage suppliers to deliver high quality products, meeting our requirements. We are looking for a Senior Software Engineer-Storage to scale Azure’s Fault Self-Healing and Failure Prediction systems including the storage subsystem. You will develop the end-to-end technical design and execution of the fault prevention ecosystem, spanning telemetry, ML models, automation, isolation logic, firmware deployment, and repair workflows, operating at hyperscale across millions of nodes. The role directly impacts customer uptime and fleet availability.

Job Responsibility

  • Design and build best-in-class fleet resiliency systems for storage devices at scale
  • Develop scalable live monitoring capabilities, fault detection and repair solutions
  • Deploy SSD and Storage Accelerator firmware to hyperscale cloud
  • Lead collaboration projects with hardware, firmware, and software teams that fault reduction projects
  • Build automation to drive repair efficiency for storage operations in the production fleet
  • Collaborate with suppliers to design reliable, high performance and quality storage devices
  • Analyze data to identify, prototype, and drive the implementation of technical and process improvements to increase the predictability, agility, and quality of Azure systems
  • Actively support Azure service stakeholders

Requirements

  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements is required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

Nice to have

  • M.S. Computer or Electrical Engineering
  • 8+ years of SSD firmware engineering development experience
  • 4+ years of NVMe and PCIe experience
  • Expertise in SSD virtualization, reliability, fault analysis, and live‑site operations
  • Ability to analyze storage system solutions and drive towards recommendation based on data and objective reasoning
  • Ability to lead collaborative technical projects from conception to successful implementation
  • Demonstrable organizational, problem solving and prioritization skills
  • Ability to deal with ambiguity, resolve conflicts, prioritize multiple strategic and tactical options and drive issues to closure without compromising on quality

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Senior Software Engineer - Storage

8 matching positions

Senior Software Engineer - Storage

The Windows Servicing & Delivery (WSD) team investigates and remediates security...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 8+ years of software engineering with deep expertise in C and C++ for Windows kernel-mode development
  • OR equivalent experience
  • Hands-on experience with Windows storage driver stack: StorPort miniport drivers, storage filter drivers, or file system minifilter drivers — understanding of IRP flow, completion routines, and cancel-safe queue management
  • Solid grounding in Windows kernel fundamentals
  • Demonstrated ability to perform crash dump analysis and live kernel debugging using WinDbg
  • Working knowledge of NTFS on-disk structures: MFT record layout, attribute types, USN journal, and the NTFS log file for crash recovery
  • Familiarity with ReFS (Resilient File System): B+ tree metadata structure, integrity streams, block cloning, and the differences in crash recovery model versus NTFS
  • Experience debugging file system corruption scenarios: cross-linked clusters, orphaned MFT records, directory entry inconsistencies, and reparse point cycles
  • Understanding of Windows file system minifilter architecture: altitude registration, pre/post operation callbacks
  • Hands-on experience with Windows Server Failover Clustering (WSFC): quorum models (Node Majority, Disk Witness, Cloud Witness), cluster network configuration, and the cluster API
Job Responsibility
Job Responsibility
  • Own end-to-end resolution of critical ICMs escalated from top enterprise customers — analyze memory dumps, ETW traces, Storage Spaces logs, and cluster event logs to root-cause failures in S2D, WSFC, CSV, NTFS, and ReFS that cannot be resolved by field support
  • Investigate and fix security vulnerabilities in the Windows storage stack: privilege escalation through NTFS reparse points and junctions, information disclosure via uninitialized kernel pool in file system drivers, and denial-of-service through crafted on-disk structures in ReFS or NTFS
  • Design and implement reliability and correctness fixes in kernel-mode storage miniport drivers (StorPort, NVMe, iSCSI, SMB Direct/RDMA) and file system filter drivers — owning the full fix lifecycle from root cause through regression test to servicing release
  • Work directly with Storage Spaces Direct (S2D): diagnose and fix rebuild, rebalance, and fault-domain logic errors
  • investigate cache tier promotion/demotion bugs
  • resolve pool fragmentation and storage bus layer (SBL) issues in hyper-converged deployments
  • Maintain and harden Windows Server Failover Clustering (WSFC) and Cluster Shared Volumes (CSV): resolve quorum edge cases, CSV ownership transfer failures, cluster validation regressions, and inter-node storage arbitration deadlocks
  • Contribute to the Volume Shadow Copy Service (VSS) and Windows Backup infrastructure: fix provider/requester interaction bugs, VSS writer timeouts in large-scale environments, and shadow copy metadata consistency failures
  • Develop diagnostic tooling and automated regression suites for the storage stack — including kernel debugger extensions (!sdt, !storport analysis), ETW provider instrumentation, and Storage Spaces health model validation
  • Collaborate with MSRC for coordinated disclosure and patch delivery on storage-related CVEs
  • Fulltime
Read More
Arrow Right

Software Engineer / Senior Software Engineer - CoreAI

Azure DevOps is a suite of modern development services that enables software dev...
Location
Location
Czech Republic , Prague
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Solid software development experience
  • Demonstrable experience with C#, C++, Java or any other OOP language
  • Strong analytical skills as well as communication skills both verbal and written
  • Ability to understand unfamiliar code bases, debug client and service side applications (including database stored procedures)
  • Knowledge and experience with Microsoft Azure, AWS or similar cloud computing platforms is preferred
  • Experience with SQL performance tuning (preferably Microsoft SQL Server)
  • Solid understanding of testing principles
  • Ability to prioritize and handle multiple tasks completely and independently and generate clarity in ambiguous situations
  • Troubleshooting skills across network, application, caching, queuing, load-balancing storage and distributed services layers
Job Responsibility
Job Responsibility
  • Design, develop, test and support features, experiences
  • Collaborate on the design and development of features and solutions, contributing to technical direction across business scenarios
  • Support highly available services used by top companies and millions of developers on a daily basis
  • Troubleshooting of complex issues through the entire tech stack including frontend and database layers
  • Participate in on-call rotations with your team. Triage and respond to issues and advocate for opportunities to improve service health
  • Collaborate through pairing and code reviews and contribute to a culture of learning and growth
  • Fulltime
Read More
Arrow Right

Software engineer 2 / Senior Software engineer - Azure Data

Microsoft's Azure Data engineering team is leading the transformation of analyti...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 3+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • Experience with the Azure stack including Storage, Compute, Networking, Fabric, Purview, Synapse, AKS, DevOps, Data Factory, or Power BI
  • Experience with big data technologies such as Spark, Kafka, Hadoop, or HBase
  • Experience building data lake or data engineering products, tools, or pipelines
  • Familiarity with container-based architectures (Docker, Kubernetes)
  • Ability to debug complex distributed systems on Linux and/or Windows platforms
Job Responsibility
Job Responsibility
  • Write extensible, maintainable code in C#, Java, Scala, or Python for Fabric Materialized Lake View services and HDInsight components
  • Use AI tools and coding best practices across the development lifecycle
  • Design data refresh, scheduling, and query optimisation features with minimal supervision
  • Review code from teammates for correctness, test coverage, security risks, and adherence to team standards
  • Coach junior engineers through code reviews
  • Debug complex issues in distributed systems running on Azure, Linux, and Windows
  • Run live site operations on a rotational, on-call basis
  • Integrate logging and instrumentation to gather telemetry on system health, performance, reliability, and security
  • Work with product managers, technical leads, and partners across geographies to define customer requirements for Materialized Lake View features
  • Fulltime
Read More
Arrow Right

Software Engineer II and Senior Software Engineer

The FIO (Office Files and Identity) Team drives collaboration, identity, cloud f...
Location
Location
United States , Redmond
Salary
Salary:
100600.00 - 199000.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, or Rust
  • OR equivalent experience
  • 4+ years industry engineering experience coding in languages including, but not limited to, C, C++, C#, Java or Rust
  • 4+ years industry experience building and shipping production quality, performant and cross-platform applications
  • Experience collaborating cross-team and cross-function to deliver software features or projects
  • Platform-specific experience building Windows, MacOS, iOS or Android applications
  • Experience designing and implementing efficient network communications, including network protocols, performance and reliability tooling, modeling and implementing complex customer scenarios across network services
  • Experience designing and implementing client-side storage stacks, with a focus on correctness, resiliency, performance, and adaptability
  • Understanding of client file system design and APIs, including advanced performance optimizations
  • Experience directly implementing large-scale data pipelines for product telemetry, with ability to evolve system as business and technical needs change
Job Responsibility
Job Responsibility
  • Design and write code
  • Work across organizations and directly with partners both in Office and across Microsoft, including other engineers and product leaders
  • Use data as the basis for decision making
  • Be a steward of products that ship to hundreds of millions of customers around the world, staying connected to customers through data and feedback and being agile and responsive to issues
  • Grow as an engineer in a modern and highly impactful team
  • Fulltime
Read More
Arrow Right

Senior Software Engineer Storage

We are seeking skilled engineers to join our Storage Platform team. We specializ...
Location
Location
United States , Sunnyvale
Salary
Salary:
202000.00 - 224000.00 USD / Year
uber.com Logo
Uber
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree or higher in an engineering field (e.g. Computer Science, Computer Engineering, etc.)
  • 6+ years software development experience
  • Experience implementing complex projects with multiple dependencies
  • Experience coding with Go, Java, C++ or Python
  • Experience in at least one scalable storage or database platform
Job Responsibility
Job Responsibility
  • Design, build, and deploy high-performance storage features and frameworks that prioritize reliability and operability
  • Drive efficiency and scale through intelligent design and automation, focusing on observability and proactive monitoring
  • Work cross-functionally with stakeholders to ensure our storage ecosystem integrates seamlessly with diverse backends and provides a superior experience for application engineers
  • Actively participate in design and code reviews, maintaining a high bar for documentation and technical excellence
  • Take full accountability for your work, ensuring every project is backed by rigorous testing and comprehensive monitoring before it’s considered done
What we offer
What we offer
  • Eligible to participate in Uber's bonus program
  • May be offered an equity award & other types of comp
  • All full-time employees are eligible to participate in a 401(k) plan
  • Eligible for various benefits
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, Storage

Patreon is looking for a Senior Storage Platform Software Engineer to design and...
Location
Location
United States , San Francisco
Salary
Salary:
200000.00 - 300000.00 USD / Year
patreon.com Logo
Patreon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in backend software engineering
  • Strong experience with Python or a similar language
  • Strong understanding of software design patterns and principles
  • Experience with relational databases (such as PostgreSQL) and NoSQL databases (such as Redis)
  • Experience with container orchestration (such as Kubernetes)
  • Strong communication and collaboration skills
  • Familiarity with continuous integration and deployment (CI/CD) practices
  • Strong understanding of RESTful API design and development
  • Ability to work in a fast-paced environment and deliver high-quality work on time
  • Bachelor’s degree in Computer Science, Computer Engineering, or a related field, or the equivalent
Job Responsibility
Job Responsibility
  • Designing and implementing backend systems that handle millions of requests per day
  • Writing efficient, scalable, and maintainable code
  • Identifying and addressing performance bottlenecks
  • Building our robust APIs that support our web and mobile applications
  • Collaborating with other engineers, product managers, and designers to deliver high-quality features on time
What we offer
What we offer
  • Equity plans
  • Healthcare
  • Flexible time off
  • Company holidays and recharge days
  • Commuter benefits
  • Lifestyle stipends
  • Learning and development stipends
  • Patronage
  • Parental leave
  • 401k plan with matching
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, Storage

As a Senior Software Engineer on our storage team, you'll be joining our core en...
Location
Location
United States , San Francisco, Sunnyvale
Salary
Salary:
166000.00 - 201000.00 USD / Year
crusoe.ai Logo
Crusoe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Hands-on proficiency in modern software development best practices, and practical experience in languages like Go, Java, C/C++, or Rust
  • Extensive experience developing multi-tenant, cloud scale distributed storage infrastructure software and systems
  • Experience contributing to at least one or more of the following storage products: File (e.g., NFS, SMB, Lustre), Object, or Block Storage (e.g., NVMe, iSCSI)
  • A strong background in high performance filesystem based products, VFS and linux filesystems (e.g., ext4, XFS, ZFS)
  • Proficiency working with Linux and its storage subsystems.
  • Knowledge of monitoring tools (Prometheus, Grafana), log analysis, distributed tracing and debugging
Job Responsibility
Job Responsibility
  • Building Our Multi-Petabyte Cloud Storage Platform
  • Building core components of our foundational storage products, purpose built for high performance AI and ML workloads
  • Contributing to distributed file, block and object storage products, with a focus on filesystem based solutions
  • System Design & Architecture
  • Design and implement high-performance, scalable, and resilient storage architectures that are highly extensible
  • Proposing and prototyping novel strategies to scale performance and system throughput for our most demanding customer workloads
  • Building observability, metrics and tooling for our services and fleet
  • High Velocity Problem Solving
  • Troubleshooting and resolving unique and complex distributed systems problems only seen at the scale we operate at
  • Provide ongoing support for production systems, and customer workloads including troubleshooting, performance tuning, and incident response
What we offer
What we offer
  • Industry competitive pay
  • Restricted Stock Units in a fast growing, well-funded technology company
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Fulltime
Read More
Arrow Right

Senior+ Software Engineer, Storage

The Cloud Storage team at Crusoe seeks a Staff Software Engineer to lead the dev...
Location
Location
United States , San Francisco; Sunnyvale
Salary
Salary:
155000.00 - 250000.00 USD / Year
crusoe.ai Logo
Crusoe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Hands-on experience building and operating large scale, complex distributed cloud computing infrastructure products
  • Preferably, experience building redundant and fault tolerant storage solutions with backups, replication, encryption, and data protection mechanisms
  • Knowledge of professional software engineering practices and best practices for the full software development life cycle
  • Strong experience with at least one application programming language like Java or Go
  • Exposure to Infrastructure as Code tooling with any of Ansible, Chef, Puppet, and/or Terraform
  • Knowledge of Linux Systems Internals and computer architecture
  • Strong communication and collaboration skills
  • Must be able to pass a background check
Job Responsibility
Job Responsibility
  • Lead engineering efforts on cloud storage features by collaborating with product and engineering to define and execute features on the roadmap
  • Write and review code, generate and review design documentation
  • Participate in qualifications and rollouts of software across the stack journeying from bare metal to user-facing APIs
  • Guide the engineering team through architecture decisions, design processes, design reviews, code reviews, and implementation tasks
  • Mentor and grow engineers on your team
  • Champion and lead initiatives across the engineering organization such as tech talks, open source development, and book clubs
  • Benchmark, analyze, and improve scale, performance, and resiliency issues
What we offer
What we offer
  • Restricted Stock Units
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Tuition reimbursement
  • Fulltime
Read More
Arrow Right