CrawlJobs Logo

Principal Software Engineer

https://www.microsoft.com/ Logo

Microsoft Corporation

Location Icon

Location:
United States , Multiple Locations

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

139900.00 - 274800.00 USD / Year

Job Description:

Microsoft Azure High Performance Computing & AI Engineering (HPC & AI Eng) team is responsible for managing the core platform & fleet of AI High Performance Computing products that customers use to run their most performant and demanding workloads. The AI Customer Experience (AICE) engineering team within the HPC & AI Eng. team is on the frontlines managing the flagship supercomputers used by top tier AI customers that enable breakthroughs such as ChatGPT and are highlighted in Top500, MLPerf and Graph500 rankings. As a Principal Supercomputing Software Engineer, you will design and develop high volume low latency telemetry pipelines, connect to existing telemetry pipelines, and stitch together data to deliver first to know insights on customer facing issues across the infrastructure stack – from datacenter events to various hardware and networking subsystem events affecting job reliability and causing job interrupts. In this role, you will bring exceptional design and development expertise, with a solid background in large-scale High-Performance Computing & GPU systems, cloud computing platforms, high-performance data processing infrastructure. This opportunity will give you hands-on experience managing the largest scale of supercomputers delivered to our customers. As a key technical leader, you will engage deeply with strategic customers, directly influencing their business outcomes as well as drive engineering improvements in the Azure ecosystem benefiting the broader fleet. Your work will enable the next wave of growth and innovation in AI and high-performance computing (HPC) in the cloud.

Job Responsibility:

  • Architect, design and develop high volume low latency end to end event pipelines that can provide first-to-know-insights on events causing job interrupts and job reliability
  • Conduct analysis of existing event pipelines to evaluate fidelity, granularity and latency of critical events
  • Contribute to improving key metrics such as Job Mean Time to Interrupt, Nodes in Service, Mean Time to Resolve on flagship supercomputers by enabling data scientists and domain experts to use the telemetry to identify events & issues at the intersection of datacenter and hardware, develop hypothesis, conduct A/B tests and synthesize results
  • Partner with cross organizational teams to evaluate available telemetry and latency drive architecture, design, development and deployment of end-to-end solutions to manage core infrastructure including current & next generation datacenter, IT hardware, power & cooling technologies
  • Drive engineering and operational excellence based on issues and learnings from strategic customers on their usage scenarios to improve product features and capabilities
  • Partner with teams on continuous learning and continuous improvement programs by leading the resolution of complex incidents, driving root cause analyses and championing initiatives to minimize future customer impact

Requirements:

  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python - OR equivalent experience
  • 5+ years hands on experience designing and developing high volume low latency pipelines using products such as AzPubSub, Event Hubs, Azure Stream Analytics, Kafka, Grafana, Event Hubs, Prometheus or equivalent products
  • 3+ years of experience with one of AI/HPC system management OR High-Speed Networks OR HPC Storage OR managing Cloud Infrastructure
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter

Nice to have:

  • Bachelor's Degree in Computer Science - OR related technical field AND 10+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, OR Python - OR Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python - OR equivalent experience
  • 5+ years of experience in operating AI/HPC systems, developing and running AI/HPC applications on clusters, or operating Cloud Infrastructure
  • 3+ years of experience in multiple DataCenter technologies: power, cooling, IT hardware, telemetry

Additional Information:

Job Posted:
February 13, 2026

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Principal Software Engineer

Principal Software Engineer, Cloud Storage Engineering

We are working on a greenfield storage platform built on top of Kubernetes and P...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelors, Masters, or PhD in Computer science in a related technical field or similar experience
  • 10+ years of experience in software development and architecture
  • Expert-level experience with one or more prominent languages such as Java, Kotlin, or Go is crucial.
  • An expert in Kubernetes stateful sets and/or databases such as PostgreSQL.
  • Passion for collaborating with and mentoring junior members of the team
  • A real appetite for helping others learn and grow
  • Considers the customer impact when making technical decisions
Job Responsibility
Job Responsibility
  • Regularly tackle the largest and most complex problems on the team, from technical design to launch
  • Deliver solutions that are used by other teams and products
  • Determine plans-of-attack on large projects
  • Routinely tackle complex architecture challenges and apply architectural standards and start using them on new projects
  • Lead code reviews & documentation as well as take on complex bug fixes, especially on high-risk problems
  • Set the standard for thorough, meaningful code reviews
  • Partner across engineering teams to take on company-wide initiatives spanning multiple projects
  • Transfer your depth of knowledge from your current language to excel as a Software Engineer
  • Mentor more junior members
What we offer
What we offer
  • health coverage
  • paid volunteer days
  • wellness resources
  • Fulltime
Read More
Arrow Right

Principal Software Engineer

The Principal Software Engineer role involves tackling complex problems, deliver...
Location
Location
United States , Mountain View; San Francisco
Salary
Salary:
170800.00 - 274300.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Expertise in multiple backend technologies
  • Ability to mentor junior members
  • Experience in tackling complex architecture challenges
  • Ability to lead code reviews and documentation
  • Experience in delivering solutions used by other teams and products
Job Responsibility
Job Responsibility
  • Tackle the largest and most complex problems on the team
  • Deliver solutions that are used by other teams and products
  • Determine plans-of-attack on large projects
  • Tackle complex architecture challenges and apply architectural standards
  • Lead code reviews & documentation
  • Take on complex bug fixes
  • Set the standard for thorough, meaningful code reviews
  • Partner across engineering teams to take on company-wide initiatives
  • Transfer depth of knowledge from current language to excel as a Software Engineer
  • Mentor more junior members
What we offer
What we offer
  • Health coverage
  • Paid volunteer days
  • Wellness resources
  • Fulltime
Read More
Arrow Right

Senior Principal Software Engineer

The Senior Principal Software Engineer role involves tackling complex problems, ...
Location
Location
United States , San Francisco
Salary
Salary:
198300.00 - 318600.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science or equivalent
  • 10+ years of experience working as senior Software Engineer
  • 5+ years of experience acting as Technical Lead for a team of engineers
  • Experience in supporting compliance environments related to FedRAMP, DoD, or mature compliance offerings at large scale tech companies
Job Responsibility
Job Responsibility
  • Regularly tackle the largest and most complex problems on the team, from technical design to launch
  • Partner across engineering teams to take on company-wide initiatives spanning multiple projects
  • Routinely tackle complex architecture challenges and apply architectural standards and start using them on new projects
  • Work across senior engineering and product leaders to build strategy and design solutions to earn customers trust and business
  • Lead a large team of Principals to tackle some of the broadest impacting and technically complex problems with a platform mindset
What we offer
What we offer
  • Health coverage
  • paid volunteer days
  • wellness resources
  • Fulltime
Read More
Arrow Right

Senior Principal Software Engineer

To become a 100 year company, we need a world-class engineering organisation mad...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years of experience building and developing backend applications - Bachelor's or Master's degree with a preference for Computer Science degree
  • Experience crafting and implementing highly scalable and performant micro-services
  • Proficiency in any modern object-oriented programming language (e.g., Java, Kotlin, etc.)
  • Real passion for collaboration and strong interpersonal and communication skills
  • Familiarity with cloud architecture patterns and an engineering discipline to produce software with quality
  • Experience in Elasticsearch, being able to deep dive into performance at scale problems, is a bonus
Job Responsibility
Job Responsibility
  • Build and ship JQL Search features and capabilities daily in highly scalable, cross-geo distributed environment
  • Be responsible technical evolution for products and platforms in JQL Search and partially in Jira Platform
  • Be part of an amazing open and collaborative work environment with other experienced engineers, architects, product managers, and designers, working closely with JQL Search and collaborating with other Jira Platform or Jira product teams
  • Mentor and level up the skills of your teammates by sharing your expertise in formal and informal knowledge sharing sessions
  • Ensure full visibility, error reporting, and monitoring of high performing backend services
  • Foster a culture of efficient delivery
What we offer
What we offer
  • Health coverage
  • Paid volunteer days
  • Wellness resources
  • Visit go.atlassian.com/perksandbenefits to learn more
  • Fulltime
Read More
Arrow Right

Principal Software Engineer

Atlassian is seeking a Principal Software Engineer for Cloud Storage Engineering...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelors, Masters, or PhD in Computer science in a related technical field or similar experience
  • 10+ years of experience in software development and architecture
  • Expert-level experience with one or more prominent languages such as Java, Kotlin, or Go is crucial
  • An expert in Kubernetes stateful sets and/or databases such as PostgreSQL
  • Passion for collaborating with and mentoring junior members of the team
  • A real appetite for helping others learn and grow
  • Considers the customer impact when making technical decisions
Job Responsibility
Job Responsibility
  • Regularly tackle the largest and most complex problems on the team, from technical design to launch
  • Deliver solutions that are used by other teams and products
  • Determine plans-of-attack on large projects
  • Routinely tackle complex architecture challenges and apply architectural standards and start using them on new projects
  • Lead code reviews & documentation as well as take on complex bug fixes, especially on high-risk problems
  • Set the standard for thorough, meaningful code reviews
  • Partner across engineering teams to take on company-wide initiatives spanning multiple projects
  • Transfer your depth of knowledge from your current language to excel as a Software Engineer
  • Mentor more junior members
What we offer
What we offer
  • Health coverage
  • Paid volunteer days
  • Wellness resources
  • Fulltime
Read More
Arrow Right

Principal Software Engineer RR Level 3

We're responsible for prototyping and developing new technologies and novel solu...
Location
Location
United States , Annapolis Junction
Salary
Salary:
Not provided
anthemengineering.com Logo
Anthem Engineering
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Polygraph Required
  • Onsite work at customer location required
  • Experience with TypeScript and JavaScript
  • Experience with React, Angular, Material UI, Bootstrap, Storybook
  • Experience with Java, Spring Boot
  • Experience with Elasticsearch, MongoDB, MySQL
  • Experience with npm, Webpack, Maven
  • Experience with Jest, JUnit
  • Experience with Git
  • Ability to learn and grow
Job Responsibility
Job Responsibility
  • Prototype and develop new technologies and novel solutions to issues within the division
  • Work closely with analysts to understand their workflows, identify gaps, and create solutions that make things easier and more effective
  • Work on a variety of constantly changing applications and services that support a wide array of missions
  • Build tools that ingest, process, and analyze large volumes of information
  • Provide direct support to forces protecting Americans
  • Organize, process, and visualize a source of information for critical intelligence
  • Identify and track critical assets using geospatial tools and analytics
  • Apply AI/ML algorithms to real-world problems
  • Provide advice and recommendations to Government customers
  • Choose the right tools to address each challenge
  • Fulltime
Read More
Arrow Right

Principal Software Engineer

About LogRocket: Founded in 2016, LogRocket's goal is to make every experience o...
Location
Location
United States , Boston or NYC
Salary
Salary:
225000.00 - 260000.00 USD / Year
logrocket.com Logo
LogRocket
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in a full stack or backend development role, preferably with a SaaS company
  • Experience with state of the art cloud technologies, including architectural principles, specific tools of the trade, and their strengths and weaknesses
  • Hands-on experience building and redesigning highly available distributed systems
  • Familiarity with modern Javascript-based applications and frameworks
  • Experience in development environments with demanding scalability or availability requirements
  • A strong collaborator who is transparent about progress on tasks, seeks feedback early and often, works effectively with the team to deliver on engineering estimates
  • Acts as a mentor to junior engineers and provides coaching to upskill other team members
Job Responsibility
Job Responsibility
  • Design a system to automatically detect the most common user paths across millions of events
  • Implement a search backend that allows users to search in real time across billions of log entries
  • Build a machine learning pipeline that automatically detects bugs in our users' apps
  • Automate database scaling to improve operating cost while maintaining the ability to respond to traffic spikes
  • Build a system that automatically recommends integrations for customers based on their toolset
What we offer
What we offer
  • Catered lunch and an impressive array of your favorite snacks (healthy AND non-healthy!)
  • Unlimited vacation policy
  • Health, Dental, Vision benefits, 401k, commuter benefits
  • Generous stock options
  • Regular team outings and activities (from boat rides to paintball, we’ll try anything!)
  • Fulltime
Read More
Arrow Right

Principal Software Engineer

Viant’s customers use the Demand Side Platform (DSP) to set up, run and monitor ...
Location
Location
United States , Irvine; Los Angeles
Salary
Salary:
200000.00 - 260000.00 USD / Year
viantinc.com Logo
Viant
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Must have experience in the AdTech domain OR possess strong proficiency in the Go programming language
  • 12+ years of professional experience in the software industry
  • Experience with web services, microservices, or other SOA
  • Relational database experience, such as MySQL, Postgres, or Oracle, and experience with complex SQL
Job Responsibility
Job Responsibility
  • Determine a long-term technical vision and scope of future projects, while continuously developing and reviewing existing projects
  • Lead end-to-end product development efforts for Viant DSP’s suite of products
  • Own & contribute to major features & areas of the products
  • Work closely with cross-functional teams internally as well as customers externally to understand pain points, underserved needs, and identify opportunities
  • Mentor engineers who thrive working in an innovative, user-focused, fast paced environment
  • Drive initiatives to reduce operational load, and build operationally sustainable products
What we offer
What we offer
  • fully paid health insurance
  • paid parental leave
  • unlimited PTO
Read More
Arrow Right