CrawlJobs Logo

Senior Software Engineer - Performance

United States, Mountain View 119800.00 - 234700.00 USD / Year · Job Posted March 24, 2026
Apply Position
Job Link Share

Job Description

The Artificial Intelligence Performance team at Microsoft develops AI software that enables running AI models everywhere, from world’s fastest AI supercomputers, to servers, desktops, mobile phones, IoT devices and internet browsers. We collaborate with our hardware teams and partners, both internal and external, and operate at the intersection of AI algorithmic innovation, purpose-built AI hardware, systems, and software. We are a team of highly capable and motivated people that pride themselves on a collaborative and inclusive culture. We own inference performance of OpenAI and other state of the art LLM models and work directly with OpenAI on the models hosted on the Azure OpenAI service serving some of the largest workloads on the planet with trillions of inferences per day in major Microsoft products, including Office, Windows, Bing, SQL Server, and Dynamics. As a Senior Software Engineer - Performance on the team, you will have the opportunity to work on multiple levels of the AI software stack, including the fundamental abstractions, programming models, compilers, runtimes, libraries and APIs to enable large scale training and inferencing of models. You will benchmark OpenAI and other LLM models for performance on GPUs and Microsoft HW, debug and optimize performance, monitor performance and enable these models to be deployed in the shortest amount of time and the least amount of HW possible helping achieve Microsoft Azure's capex goals. This is a hands-on technical role requiring software design and development skills. We’re looking for someone who has a demonstrated history of solving technical problems and is motivated to tackle the hardest problems in building a full end-to-end AI stack. An entrepreneurial approach and ability to take initiative and move fast are essential.

Job Responsibility

  • Identify and drive improvements to end-to-end inference performance of OpenAI and other state-of-the-art LLMs
  • Measure, benchmark performance on Nvidia/AMD GPUs and first party Microsoft silicon
  • Optimize and monitor performance of LLMs and build SW tooling to enable insights into performance opportunities ranging from the model level to the systems and silicon level to improve customer experience and reduce the footprint of the computing fleet
  • Enable fast time to market of LLMs/models and their deployments at scale by building SW tools that afford velocity in porting models on new Nvidia and AMD GPUs
  • Design, implement, and test functions or components for our AI/DNN/LLM frameworks and tools
  • Speeding up/reducing complexity of key components/pipelines to improve performance and/or efficiency of our systems
  • Communicate and collaborate with our partners both internal and external
  • Embody Microsoft's Culture and Values

Requirements

  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Nice to have

  • Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
  • Technical background and solid foundation in software engineering principles, computer architecture, GPU architecture, HW neural net acceleration
  • Experience in end-to-end performance analysis and optimization of state of the art LLMs, including proficiency using GPU profiling tools
  • Experience in DNN/LLM inference and experience in one or more DL frameworks such as PyTorch, Tensorflow, or ONNX Runtime and familiarity with CUDA, ROCm, Triton
  • Cross-team collaboration skills and the desire to collaborate in a team of researchers and developers

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Senior Software Engineer - Performance

8 matching positions

Software Engineer II and Senior Software Engineer - Performance

The Artificial Intelligence Performance team at Microsoft develops AI software t...
Location
Location
United States , Mountain View
Salary
Salary:
100600.00 - 199000.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Identify and drive improvements to end-to-end inference performance of OpenAI and other state-of-the-art LLMs
  • Measure, benchmark performance on Nvidia/AMD GPUs and first party Microsoft silicon
  • Optimize and monitor performance of LLMs and build SW tooling to enable insights into performance opportunities ranging from the model level to the systems and silicon level to improve customer experience and reduce the footprint of the computing fleet
  • Enable fast time to market of LLMs/models and their deployments at scale by building SW tools that afford velocity in porting models on new Nvidia and AMD GPUs
  • Design, implement, and test functions or components for our AI/DNN/LLM frameworks and tools
  • Speeding up/reducing complexity of key components/pipelines to improve performance and/or efficiency of our systems
  • Communicate and collaborate with our partners both internal and external
  • Embody Microsoft's Culture and Values
  • Fulltime
Read More
Arrow Right

Senior Staff Software Engineer (Impala Performance Engineer)

At Cloudera, we empower people to transform complex data into clear and actionab...
Location
Location
Salary
Salary:
Not provided
cloudera.com Logo
Cloudera
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of industry experience in performance related work ideally on large scale distributed systems
  • Understanding of DBMS algorithms and data structure fundamentals
  • Understanding of hardware trends and full stack systems performance: CPU, RAM, storage, network, Linux kernel, JVM, distributed systems performance
  • Deep understanding of performance measurement methodologies and performance analysis tools and techniques
  • Strong design and coding skills (Java/C++/Golang/Python preferred)
  • Ability to work in a distributed setting with team members spread across multiple geographies
  • Demonstrated ability to work on large cross-functional projects, including strong communication skills and a collaborative mindset
  • Hands-on experience with containerization and Kubernetes
  • B.S. or M.S. in Computer Science or equivalent experience
Job Responsibility
Job Responsibility
  • Work with internal development teams and the open source community to proactively drive performance improvements/optimizations across our data warehouse stack
  • Work with product managers, developers and the field team to understand performance and scale requirements and customer workload characteristics, and develop benchmarks and related performance analysis tooling based on these requirements
  • Analyze performance and scalability characteristics to identify bottlenecks in large-scale distributed systems
  • Perform root cause analysis of performance issues identified by internal testing and from customers and suggest corrective actions
  • Evaluate performance of competitor systems
What we offer
What we offer
  • Generous PTO Policy
  • Support work life balance with Unplugged Days
  • Flexible WFH Policy
  • Mental & Physical Wellness programs
  • Phone and Internet Reimbursement program
  • Access to Continued Career Development
  • Comprehensive Benefits and Competitive Packages
  • Paid Volunteer Time
  • Employee Resource Groups
  • Fulltime
Read More
Arrow Right

Senior Staff Software Engineer (Impala Performance Engineer)

At Cloudera, we empower people to transform complex data into clear and actionab...
Location
Location
Hungary , Budapest; Szeged; Remote
Salary
Salary:
Not provided
cloudera.com Logo
Cloudera
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of industry experience in performance related work ideally on large scale distributed systems
  • Understanding of DBMS algorithms and data structure fundamentals
  • Understanding of hardware trends and full stack systems performance: CPU, RAM, storage, network, Linux kernel, JVM, distributed systems performance
  • Deep understanding of performance measurement methodologies and performance analysis tools and techniques
  • Strong design and coding skills (Java/C++/Golang/Python preferred)
  • Ability to work in a distributed setting with team members spread across multiple geographies
  • Demonstrated ability to work on large cross-functional projects, including strong communication skills and a collaborative mindset
  • Hands-on experience with containerization and Kubernetes
  • B.S. or M.S. in Computer Science or equivalent experience
Job Responsibility
Job Responsibility
  • Work with internal development teams and the open source community to proactively drive performance improvements/optimizations across our data warehouse stack
  • Work with product managers, developers and the field team to understand performance and scale requirements and customer workload characteristics, and develop benchmarks and related performance analysis tooling based on these requirements
  • Analyze performance and scalability characteristics to identify bottlenecks in large-scale distributed systems
  • Perform root cause analysis of performance issues identified by internal testing and from customers and suggest corrective actions
  • Evaluate performance of competitor systems
What we offer
What we offer
  • Generous PTO Policy
  • Support work life balance with Unplugged Days
  • Flexible WFH Policy
  • Mental & Physical Wellness programs
  • Phone and Internet Reimbursement program
  • Access to Continued Career Development
  • Comprehensive Benefits and Competitive Packages
  • Paid Volunteer Time
  • Employee Resource Groups
  • Fulltime
Read More
Arrow Right

Software Engineer / Senior Software Engineer - CoreAI

Azure DevOps is a suite of modern development services that enables software dev...
Location
Location
Czech Republic , Prague
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Solid software development experience
  • Demonstrable experience with C#, C++, Java or any other OOP language
  • Strong analytical skills as well as communication skills both verbal and written
  • Ability to understand unfamiliar code bases, debug client and service side applications (including database stored procedures)
  • Knowledge and experience with Microsoft Azure, AWS or similar cloud computing platforms is preferred
  • Experience with SQL performance tuning (preferably Microsoft SQL Server)
  • Solid understanding of testing principles
  • Ability to prioritize and handle multiple tasks completely and independently and generate clarity in ambiguous situations
  • Troubleshooting skills across network, application, caching, queuing, load-balancing storage and distributed services layers
Job Responsibility
Job Responsibility
  • Design, develop, test and support features, experiences
  • Collaborate on the design and development of features and solutions, contributing to technical direction across business scenarios
  • Support highly available services used by top companies and millions of developers on a daily basis
  • Troubleshooting of complex issues through the entire tech stack including frontend and database layers
  • Participate in on-call rotations with your team. Triage and respond to issues and advocate for opportunities to improve service health
  • Collaborate through pairing and code reviews and contribute to a culture of learning and growth
  • Fulltime
Read More
Arrow Right

Software engineer 2 / Senior Software engineer - Azure Data

Microsoft's Azure Data engineering team is leading the transformation of analyti...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 3+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • Experience with the Azure stack including Storage, Compute, Networking, Fabric, Purview, Synapse, AKS, DevOps, Data Factory, or Power BI
  • Experience with big data technologies such as Spark, Kafka, Hadoop, or HBase
  • Experience building data lake or data engineering products, tools, or pipelines
  • Familiarity with container-based architectures (Docker, Kubernetes)
  • Ability to debug complex distributed systems on Linux and/or Windows platforms
Job Responsibility
Job Responsibility
  • Write extensible, maintainable code in C#, Java, Scala, or Python for Fabric Materialized Lake View services and HDInsight components
  • Use AI tools and coding best practices across the development lifecycle
  • Design data refresh, scheduling, and query optimisation features with minimal supervision
  • Review code from teammates for correctness, test coverage, security risks, and adherence to team standards
  • Coach junior engineers through code reviews
  • Debug complex issues in distributed systems running on Azure, Linux, and Windows
  • Run live site operations on a rotational, on-call basis
  • Integrate logging and instrumentation to gather telemetry on system health, performance, reliability, and security
  • Work with product managers, technical leads, and partners across geographies to define customer requirements for Materialized Lake View features
  • Fulltime
Read More
Arrow Right

Software Engineer 2 / Senior Software Engineer

We are looking for an experienced Software Engineers for our Bangalore location ...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
komprise.com Logo
Komprise, Inc.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Solid grasp of computer science fundamentals and especially data structures, algorithms, multi-threading
  • Ability to solve difficult problems with a simple elegant solution
  • Should have solid object-oriented programming background with impeccable design skills
  • Experience in developing management applications and performance management applications is ideal
  • Experience with object-based file systems and REST interfaces is a plus (e.g. Amazon S3, Azure, Google Cloud Service)
  • Should have a BE or higher in CS, EE, Math or related engineering or science field
  • At least 5+ years of experience in software deployment
  • Tech Stack: Java, Maven Virtualisation, SaaS, Github, Jira, Slack, Cloud Solutions and Hypervisors
Job Responsibility
Job Responsibility
  • Responsible for designing and developing features that powers Komprise data management platform to manage billions of files and petabytes of data
  • Responsible for designing of major components and systems of our product architecture, ensuring that Komprise data management platform is highly available and scalable
  • Responsible for writing performance code, evaluate feasibility, develop for quality and optimize for maintainability
  • Work in agile, customer focused and fast paced team with direct interaction with the customers
  • Responsible for analysing customer escalated issues and provide resolutions in a timely manner
  • Should be able to design and implement highly performant, scalable distributed systems
Read More
Arrow Right

Senior Software Engineer / Principal Software Engineer - Copilot CLI

Within GitHub and Microsoft CoreAI, the Copilot CLI team builds GitHub's coding ...
Location
Location
United States , Redmond
Salary
Salary:
119800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years
Job Responsibility
Job Responsibility
  • Take ownership of critical product and platform areas of the Copilot CLI and shared agent runtime
  • Set a high technical and quality bar for agentic systems and developer-facing tooling
  • Design and ship performant, reliable terminal experiences that developers depend on for daily work
  • Use data, benchmarks, and direct user feedback to guide iteration and investment
  • Collaborate across org boundaries to enable other teams to build agentic products on top of a shared foundation
  • Influence architecture, technical direction, and engineering standards beyond your immediate team
What we offer
What we offer
  • Certain roles may be eligible for benefits and other compensation
  • Fulltime
Read More
Arrow Right

Senior Software Engineer / Principal Software Engineer

We are hiring multiple Senior Software Engineers and Principal Software Engineer...
Location
Location
Costa Rica , Multiple Locations
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science or Engineering OR equivalent experience
  • At least 5 years of experience in software design and backend service development
  • 5+ years of hands-on experience in an object-oriented language such as C++, C#, Java, Python, or equivalent experience with C
  • Proficiency in English is essential for effective communication
Job Responsibility
Job Responsibility
  • Drive a project through to delivery removing obstacles as needed to ensure success
  • Contribute to the design of large-scale distributed software services and solutions
  • Write clean, well-tested code
  • Champion and drive improvements to team software engineering practices
  • Mentor junior engineers
  • Collaborate with related teams to achieve goals
  • Promote knowledge sharing and contribute to onboarding documentation for new engineers
  • Act as a participant in the on-call rotation to monitor service for degradation, downtime, or interruptions and perform complex incident management and problem resolution for the service
  • Fulltime
Read More
Arrow Right