CrawlJobs Logo

Principal Software Engineer - Azure Storage

https://www.microsoft.com/ Logo

Microsoft Corporation

Location Icon

Location:
United States , Multiple Locations

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

139900.00 - 274800.00 USD / Year

Job Description:

Want to impact the foundation for future AI storage development in Azure, the world's computer? The Azure Managed Lustre File System (AMLFS) team leads development, deployment, and monitoring of the most popular High-Performance Computing (HPC) parallel file system in the world: Lustre, the Azure storage solution of choice for AI training and fine-tuning. The Pittsburgh-based AMLFS Platform Team is responsible for end-to-end delivery of AMLFS images, cluster deployment, logs and metrics, and configuration compliance. An ideal candidate will also have opportunities to impact cluster architecture and design of Lustre in the Azure ecosystem, performance analysis and optimization of AMLFS, and customer support for the most challenging parallel filesystem bugs or performance anomalies that arise within our product.

Job Responsibility:

  • Partners with appropriate stakeholders to determine user requirements for a set of scenarios
  • Leads identification of dependencies and the development of design documents for a product, application, service, or platform
  • Leads by example and mentors others to produce extensible and maintainable code used across products
  • Leverages subject-matter expertise of cross-product features with appropriate stakeholders (e.g., project managers) to drive multiple group's project plans, release plans, and work items
  • Holds accountability as a Designated Responsible Individual (DRI), mentoring engineers across products/solutions, working on-call to monitor system/product/service for degradation, downtime, or interruptions
  • Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale and shares knowledge with other engineers

Requirements:

  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements
  • Microsoft Cloud Background Check

Nice to have:

  • Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, or Python OR Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, or Python OR equivalent experience
  • 3+ years of experience: working, developing, and debugging within a Linux operating system environment and at least broad understanding of Linux kernel fundamentals, AND working with filesystem design, development, and debugging, AND with high-performance computing OR distributed systems in an industry or academic setting
  • 6+ years of experience: with high-performance computing OR distributed systems in an industry or academic setting, AND with the Lustre parallel file system OR an equivalent parallel or distributed file system
  • Experience performing performance analysis and root cause of a distributed or complex system

Additional Information:

Job Posted:
February 03, 2026

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Principal Software Engineer - Azure Storage

Principal Software Engineer

Atlassian is a global leader in cloud collaboration, and one of the world’s larg...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Software Engineering, or a related field
  • Deep expertise in PostgreSQL, including internals, extension development, performance tuning, and scaling
  • 10+ years of experience in backend software development, with a focus on distributed systems and storage solutions
  • 5+ years of hands-on experience with AWS RDS/Aurora or equivalent cloud database platforms (GCP, Azure)
  • Demonstrated leadership in technical design, mentoring, and open-source contribution
  • Ability to drive technical roadmaps, influence architectural decisions, and champion best practices across teams
  • Experience mentoring engineers and building high-performing, collaborative teams
Job Responsibility
Job Responsibility
  • Contribute to open-source projects and represent Atlassian in the broader PostgreSQL community
  • Lead initiatives to improve scalability, performance, reliability, and security of the self managed Postgres
  • Collaborate with cross-functional teams to define technical strategy and deliver robust solutions for complex storage challenges
  • Establish and promote best practices in distributed systems, cloud infrastructure, and cost optimization
  • Mentor and develop engineers, fostering a culture of technical excellence and continuous learning
What we offer
What we offer
  • Health coverage
  • Paid volunteer days
  • Wellness resources
Read More
Arrow Right

Principal Software QA Engineer

Principal Software QA Engineer to lead test architecture and automation strategy...
Location
Location
Puerto Rico , Aguadilla
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of hands-on QA experience
  • Designing and building test automation frameworks from scratch
  • Non-functional testing (scale, reliability, performance, security)
  • Strong coding skills in Python, Java, or Go
  • Experience with Pytest, TestNG, JUnit, Playwright or similar tools
  • Deep understanding of Cloud platforms (AWS, Azure, GCP)
  • Microservices, Containers (Docker, Kubernetes)
  • Infrastructure & Data Center management
  • Linux/VM environments, Storage, Compute, Networking
  • REST APIs, JSON, SQL/NoSQL
Job Responsibility
Job Responsibility
  • Design, automate, and execute system-level test cases focused on scale, reliability, security, and performance
  • Lead the test automation strategy
  • evaluate and integrate new tools to improve efficiency and coverage
  • Collaborate closely with product, development, support, and platform engineering teams to ensure full lifecycle quality coverage
  • Provide technical leadership and mentorship to QA engineers and partners across teams
  • Contribute to design reviews with a QA lens to ensure testability and risk mitigation
  • Maintain and manage multiple product test configurations aligned with diverse deployment environments
What we offer
What we offer
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment
  • Comprehensive suite of benefits supporting physical, financial and emotional wellbeing
  • Fulltime
Read More
Arrow Right

Software Engineer

As a Software Engineer you will be part of a team of talented developers, busine...
Location
Location
India , Kochi
Salary
Salary:
Not provided
valoremreply.com Logo
Valorem Reply
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree or higher in computer science, engineering, or related field
  • 4 years experience with Microsoft & open source Technologies
  • Proficient developing web-based interfaces, including: ASP.Net MVC, Angular/React.js and HTML5
  • JavaScript, jQuery, Typescript
  • Modern, multi-device and form factor Single Page Applications
  • Experience working with Microsoft Azure is preferred, including: Compute & Containers - App Service (Web Apps and API Apps) and Containers
  • Database & Storage - Azure SQL, Storage Accounts and Table Storage
  • Integration - Service Bus, Event Grid, API Management and Logic Apps
  • Security - AAD & Key Vault
  • Azure Management, including deployments, configuration and the SDK
Job Responsibility
Job Responsibility
  • Be an active individual contributor on a team: creating working, readable code that fits given business requirements, and participating in code reviews
  • Understand project scopes, architectural diagrams and requirements to further breakdown more granular requirements and implementing them as part of a project team
  • Work in an Agile team (Scrum, Kanban, etc.) and be a vocal participant in team meetings
  • Promote the organization's present technology within a team
  • Fulltime
Read More
Arrow Right

Principal Software Developer in Test

The PC/FLex team is seeking a Principal Cloud developer in Test who thrives in a...
Location
Location
Puerto Rico , Aguadilla
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Knowledge of complete SDLC and experience in software testing methodology, tools, developing test strategy, test planning and test execution
  • Strong automation background to be able to own automation strategy and tools required to build a test framework
  • Strong troubleshooting and analytical skills
  • Experience in software testing methodology, tools, developing test strategy, test planning and test execution
  • Knowledge of tools and processes for solution, regression, scale, and performance testing
  • Experience working in continuous integration and continuous delivery
  • Self-motivated with excellent verbal and written communication skills
  • Ideal candidates will have at least 13+ years of relevant experience in developing and supporting automation test frameworks from ground up
  • Proficiency in programming in Python/Java/Go, Playwright
  • Experience in the following areas/technologies: Private Cloud experience, Infrastructure & Data Center management experience, SQL/NoSQL DBs, XML, REST APIs, JSON, Web Applications, AWS/GCP/Azure, Docker, Microservices, Containerisation, Kubernetes, CI/CD, Git/Gitlab, Jenkins, LINUX/VM environments, Pytest, TestNG, Junit, Scale/performance testing tools, Storage / Compute / Networking background, KVM / Vmware
Job Responsibility
Job Responsibility
  • Design, execute, automate and deploy long running, scale, reliability, and security testcases and report the results
  • Provide guidance to functional QA teams, manage and expand relationships with other QA partners on software systems test and automation
  • Review and evaluate designs and provide tangible feedback to improve product quality and mitigate failure risk
  • Own the automation strategy, guide the team to build automation libraries, and utilize existing tools wherever possible as a part of new feature testing or as a part of the sustaining process for released/deployed software
  • Collaborate with other cross-functional teams including but not limited to Product Management, Customer Support and Platform engineering
  • Maintain multiple product configurations as applicable to various projects
  • Provide guidance and mentoring to new hires joining the team
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Principal Data Engineer

We are on the lookout for a Principal Data Engineer to help define and lead the ...
Location
Location
United Kingdom
Salary
Salary:
Not provided
dotdigital.com Logo
Dotdigital
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Extensive experience delivering python-based projects in the data engineering space
  • Extensive experience working with SQL and NoSQL database technologies (e.g. SQL Server, MongoDB & Cassandra)
  • Proven experience with modern data warehousing and large-scale data processing tools (e.g. Snowflake, DBT, BiqQuery, Clickhouse)
  • Hands on experience with data orchestration tools like Airflow, Dagster or Prefect
  • Experience using cloud environments (e.g. Azure, AWS, GCP) to process, store and surface large scale data
  • Experience using Kafka or similar event-based architectures e.g. (Pub/Sub via AWS SQS, Azure EventHubs, AWS Kinesis)
  • Strong grasp of data architecture and data modelling principles for both OLAP and OLTP workloads
  • Capable in the wider software development lifecycle in terms of agile ways of working and continuous integration/deployment of data solutions
  • Experience as a lead or Principal Engineer on large-scale data initiative or product builds
  • Demonstrated ability to architect data systems and data structures for high volume, high throughput systems
Job Responsibility
Job Responsibility
  • Lead the design and implementation of scalable, secure and resilient data systems across streaming, batch and real-time use cases
  • Architect data pipelines, model and storage solutions that power analytical and product use cases
  • using primarily Python and SQL via orchestration tooling that run workloads in the cloud
  • Leverage AI to automate both data processing and engineering processes
  • Assure and drive best practices relating to data infrastructure, governance, security and observability
  • Work with technologists across multiple teams to deliver coherent features and data outcomes
  • Support the data team to help adopt data engineering principles
  • Identify, validate and promote new tools and technologies that improve the performance and stability of data services
What we offer
What we offer
  • Parental leave
  • Medical benefits
  • Paid sick leave
  • Dotdigital day
  • Share reward
  • Wellbeing reward
  • Wellbeing Days
  • Loyalty reward
  • Fulltime
Read More
Arrow Right

Principal Engineer

The Principal AI/ML Operations Engineer leads the architecture, automation, and ...
Location
Location
United States , Pleasanton, California
Salary
Salary:
251000.00 - 314500.00 USD / Year
blackline.com Logo
BlackLine
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Machine Learning, Data Science, or a related field
  • 10+ years in ML infrastructure, DevOps, and software system architecture
  • 4+ years in leading MLOps or AI Ops platforms
  • Strong programming skills in languages such as Python, Java, or Scala
  • Expertise in ML frameworks (TensorFlow, PyTorch, scikit-learn) and orchestration tools (Airflow, Kubeflow, Vertex AI, MLflow)
  • Proven experience operating production pipelines for ML and LLM-based systems across cloud ecosystems (GCP, AWS, Azure)
  • Deep familiarity with LangChain, LangGraph, ADK or similar agentic system runtime management
  • Strong competencies in CI/CD, IaC, and DevSecOps pipelines integrating testing, compliance, and deployment automation
  • Hands-on with observability stacks (Prometheus, Grafana, Newrelic) for model and agent performance tracking
  • Understanding of governance frameworks for Responsible AI, auditability, and cost metering across training and inference workloads
Job Responsibility
Job Responsibility
  • Define enterprise-level standards and reference architectures for ML-Ops and AIOps systems
  • Partner with data science, security, and product teams to set evaluation and governance standards (Guardrails, Bias, Drift, Latency SLAs)
  • Mentor senior engineers and drive design reviews for ML pipelines, model registries, and agentic runtime environments
  • Lead incident response and reliability strategies for ML/AI systems
  • Lead the deployment of AI models and systems in various environments
  • Collaborate with development teams to integrate AI solutions into existing workflows and applications
  • Ensure seamless integration with different platforms and technologies
  • Define and manage MCP Registry for agentic component onboarding, lifecycle versioning, and dependency governance
  • Build CI/CD pipelines automating LLM agent deployment, policy validation, and prompt evaluation of workflows
  • Develop and operationalize experimentation frameworks for agent evaluations, scenario regression, and performance analytics
What we offer
What we offer
  • short-term and long-term incentive programs
  • robust offering of benefit and wellness plans
  • Fulltime
Read More
Arrow Right
New

Principal Software Engineer- Azure Storage

Are you ready to shape the future of Artificial Intelligence AI-scale storage so...
Location
Location
United States , Multiple Locations
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C# or Java OR equivalent experience
  • 5+ years of hands-on experience with Storage products and cloud-native architectures
  • 5+ years of experience in software development with a focus on large distributed systems
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Partner with stakeholders to determine requirements for scalable, reliable, and high-performance storage solutions
  • Lead the design and implementation of enhancements to optimize the storage control plane for AI-scale workloads
  • Identify and address bottlenecks in system performance and scalability while driving improvements across the platform
  • Act as a Designated Responsible Individual (DRI) to ensure system availability, performance, and fault tolerance, including participation in on-call rotations
  • Leverage subject matter expertise in Azure Storage and distributed systems to influence technical decisions and product roadmaps
  • Fulltime
Read More
Arrow Right
New

Principal Software Engineer - Azure Storage team

Are you passionate about distributed systems, massive scalability, and durabilit...
Location
Location
Australia , Multiple Locations
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • 1+ year(s) experience with proficiency in C, C++, and object-oriented programming
Job Responsibility
Job Responsibility
  • Partners with appropriate stakeholders to determine user requirements for a set of scenarios
  • Leads identification of dependencies and the development of design documents for a product, application, service, or platform
  • Leads by example and mentors others to produce extensible and maintainable code used across products
  • Leverages subject-matter expertise of cross-product features with appropriate stakeholders (e.g., project managers) to drive multiple group's project plans, release plans, and work items
  • Holds accountability as a Designated Responsible Individual (DRI), mentoring engineers across products/solutions, working on-call to monitor system/product/service for degradation, downtime, or interruptions
  • Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale and shares knowledge with other engineers
  • Fulltime
Read More
Arrow Right