CrawlJobs Logo

Principal Software Engineering - AI Frameworks

United States, Redmond 139900.00 - 274800.00 USD / Year · Job Posted February 05, 2026
Apply Position
Job Link Share

Job Description

Are you looking for opportunities to deliver innovations to hundreds of millions of users that impact their daily life? Do you want to be part of a fast-cycle, highly competitive, and hardcore AI/ML frameworks team with ambitious goals to advance AI technology? The AI Frameworks team at Microsoft develops software that pushes the cutting edge of performance and experience in machine learning in production and research. We bridge the gap between the latest state-of-the-art AI models and hardware eco-systems. We build software to enable running AI models everywhere, from the world’s fastest super cloud computers, to servers, desktops, mobile phones, IoT devices, and internet browsers. The framework we develop is powering AI experience for all major Microsoft products, such as Windows, Office, Microsoft 365, Bing, SQL Server, VSCode, Cognitive Service, Copilot+ PC, and many others. As the Principal Software Engineer on our team, you would have the opportunity to work on: ONNX: an open standard format for representing AI models, enabling AI developers to work on AI models with a variety of frameworks, tools, runtimes, and compilers. ONNX Runtime: ONNX based cross-platform, high performance ML inferencing and training accelerator. Foundry Local: an on-device AI inference solution offering performance, privacy, customization, and cost advantages. It integrates seamlessly into your existing workflows and applications through an intuitive CLI, SDK, and REST API. We value our team members’ happiness and encourage everyone to constantly learn new things and take risks. We collaborate closely and openly to accomplish big things together while having fun. Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Job Responsibility

  • Partnering with appropriate stakeholders to determine user requirements for one or more complex scenarios
  • Providing technical leadership for the identification of dependencies and the development of design documents for a product, application, service, or platform
  • Leading by example and mentoring others to produce extensible and maintainable code used across the company
  • Leveraging deep subject-matter expertise of cross-product features with appropriate stakeholders (e.g., project managers) to lead multiple product's project plans, release plans, and work items
  • Holding accountability as a Designated Responsible Individual (DRI), mentoring engineers across products/solutions, working on-call to monitor system/product/service for degradation, downtime, or interruptions
  • Proactively seeking new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale and shares knowledge with other engineers
  • Embodying our Culture and Values

Requirements

  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
  • Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Experience developing Inference software stack
  • Experience working on systems performance optimization
  • Working with Open-Source code

Nice to have

  • Experience developing Inference software stack
  • Experience working on systems performance optimization
  • Working with Open-Source code

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Principal Software Engineering - AI Frameworks

8 matching positions

Principal Software Engineering Manager - AI Frameworks

As a Principal Software Engineering Manager - AI Frameworks on the team, you wil...
Location
Location
United States , Redmond
Salary
Salary:
139900.00 - 304200.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Master’s Degree in Computer Science or related technical field AND 10+ years of software engineering experience, including 6+ years in engineering management, OR Bachelor’s Degree in Computer Science or related technical field AND 12+ years of software engineering experience, including 6+ years in engineering management, or equivalent experience
  • Strong technical foundation in software engineering principles, computer architecture, GPU architecture, and hardware acceleration for neural networks, with the ability to guide teams working in these areas
  • Experience leading teams responsible for end-to-end performance analysis and optimization of LLMs, AI systems, or HPC workloads, including use of GPU profiling and performance analysis tools
  • Demonstrated ability to lead cross-team initiatives, align stakeholders, and translate research or platform capabilities into scalable, production-ready solutions
  • Proven people leadership skills, including hiring, coaching, performance management, and career development, with a track record of building high-performing, inclusive teams
  • Exposure to AI / ML infrastructure, including DNN or LLM training and/or inference systems, and experience with at least one modern deep learning framework (e.g., PyTorch, TensorFlow, ONNX Runtime)
  • Familiarity with GPU software stacks and acceleration technologies such as CUDA, ROCm, Triton, or equivalent, sufficient to guide technical direction and evaluate tradeoffs
Job Responsibility
Job Responsibility
  • Lead and develop a team of engineers working across multiple layers of the AI software stack to enable large-scale training and inference
  • Set technical vision and execution strategy for model performance benchmarking, optimization, and deployment across GPUs and Microsoft hardware
  • Drive performance outcomes by prioritizing and overseeing efforts to benchmark, profile, debug, and optimize training and inference workloads
  • Own performance health by establishing mechanisms to monitor regressions, measure impact, and continuously improve time-to-deploy and hardware efficiency
  • Partner cross-functionally with research, product, infrastructure, and hardware teams to deliver scalable, production-ready AI performance improvements
  • Balance short-term delivery and long-term investments, ensuring the team’s work aligns with organizational goals, platform roadmaps, and Azure capex objectives
  • Build a strong engineering culture through coaching, feedback, hiring, and career development, enabling the team to operate with increasing autonomy and impact
  • Fulltime
Read More
Arrow Right

Principal Software Engineering Manager - Data Science & Engineering

The MSRC Data Science team is responsible in building data pipelines, data minin...
Location
Location
United States , Redmond
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Leads team on the disciplined use of, and improving artificial intelligence (AI) tools and practices across the software development lifecycle (SDLC)
  • Guides team on proactively taking responsibility for the content of their AI-generated requirements, design documents, code, and other assets, and assisting other members of the team to do the same
  • Leads team on incorporating Responsible AI practices into the SDLC to ensure appropriate controls over AI-generated assets
  • Coaches team on applying SDLC and engineering health measures (e.g., Accelerate, SPACE framework, Engineering System Success Playbook [ESSP]) to guide improvements to processes and practices, especially those involving AI
  • Leads team on experimenting with AI tools and practices to improve their own capabilities, and providing recommendations on how to adopt them to others
  • Reviews debugging tools, tests, logs, telemetry, and other methods, and acts as an expert for others to proactively verify assumptions while developing code before issues occur across products in production
  • Guides team to perform machine learning/data extraction, transformation, and loading (ETL) pipelines (e.g., data collection, cleaning) based on data prepared
  • Guides the architecture of scalable pipelines and datasets
  • Influences the direction of the team
  • Begins to anticipate potential data pipeline issues and provides solutions
  • Fulltime
Read More
Arrow Right

Ai-first Core It Software Engineering: Software, Ml & Data

This is a Unified Application for our AI-First IT Transformation portfolio. We r...
Location
Location
United States , Santa Clara
Salary
Salary:
Not provided
paloaltonetworks.com Logo
Palo Alto Networks
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3-5+ years of experience in Software Engineering, Data Science, or Machine Learning (Staff level)
  • 6-8+ years (Senior Staff)
  • 8-12+ years (Principal level)
  • Expert-level server-side development (Python, Java, Go) OR deep expertise in statistical modeling, ML algorithms, and LLM fine-tuning
  • Direct experience with RAG architectures, LLM APIs, and Vector Databases (e.g., Pinecone, Milvus)
  • Hands-on experience with Kubernetes, CI/CD, and distributed systems for large-scale AI deployment
Job Responsibility
Job Responsibility
  • Lead the hands-on development of core Enterprise IT Business software leveraging AI components and LLM infrastructure with both traditional and Generative AI model deployment
  • Build and industrialize agentic AI systems and multi-agent frameworks, ensuring secure and effective use of GenAI technologies at the platform level
  • Design and implement robust foundational data pipelines, perform advanced statistical analysis, and develop new ML models to drive autonomous system behavior
  • Design large-scale, distributed AI/ML systems optimized for low latency, high throughput, and developer-friendliness (Inference optimization)
  • Establish evaluation frameworks to measure AI quality (accuracy, hallucination rates) and overall system reliability across the Enterprise AI Factory
  • Fulltime
Read More
Arrow Right

Principal AI Software Architect

Principal Artificial Intelligence (AI) Software Architect leads the design, deve...
Location
Location
United States , St. Louis
Salary
Salary:
Not provided
vantagelinks.com Logo
VantageLinks
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in the field of computer science, information systems, or computer engineering or equivalent experience
  • 7+ years of experience in software architecture and engineering, with at least 2 years focused on AI/ML systems
  • 5+ years of hands-on experience with application development tools, including Visual Studio .NET
  • 5+ years of hands-on experience with relational database technology, preferably MS SQL Server
Job Responsibility
Job Responsibility
  • Architects and leads the development of AI-driven platforms and scalable software systems aligned with strategic business goals
  • Collaborates cross-functionally with engineering, product, and business teams to define use cases and deliver impactful solutions
  • Evaluates and selects AI and software technologies, frameworks, and tools for optimal performance, scalability, and maintainability
  • Establishes and promotes best practices in AI model development, MLOps, software engineering, DevOps, and system architecture
  • Mentors engineering teams, fostering a culture of innovation, excellence, and continuous improvement
  • Drives technical strategy and long-term planning, aligning engineering efforts with enterprise objectives
  • Ensures compliance with data privacy, security, and ethical standards in all AI and software initiatives
  • Champions engineering excellence through code reviews, automated testing, CI/CD, and agile methodologies
  • Other duties as needed or assigned
Read More
Arrow Right

Principal Software Engineering Manager - Substrate Efficiency

M365 Copilot inference is a high-impact engineering team advancing applied AI an...
Location
Location
United States , Redmond
Salary
Salary:
142800.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Job Responsibility
Job Responsibility
  • Build and lead a high-performing engineering team focused on inference runtime efficiency and model execution performance
  • Define and drive strategy to improve throughput per GPU through runtime optimizations
  • Increase engineering agility, enabling faster experimentation, iteration, and rollout of performance improvements
  • Partner across M365 Core, AI Core, Azure, and Microsoft Research to co-design and productionize advanced inference optimizations
  • Establish metrics, telemetry, and experimentation frameworks to measure efficiency gains and guide investment decisions
  • Own live-site performance, reliability, and operational excellence for inference engines at scale
  • Drive alignment across partner teams on engine interfaces, performance goals, and optimization priorities.
  • Fulltime
Read More
Arrow Right

Principal AI Software Engineer, Senior Vice President

Are you looking for a career move that will put you at the heart of a global fin...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Exceptional Python Expertise: Demonstrated mastery of core Python, including advanced features, performance optimization, and a deep understanding of the FastAPI framework
  • Prior hands-on experience with Generative AI, Large Language Model (LLM) frameworks (e.g. LangChain, LlamaIndex), and their application in enterprise environments is a must. This must be underpinned by a profound understanding of core machine learning principles, algorithms, and data science methodologies
  • Full Lifecycle Ownership: Extensive hands-on experience and technical authority throughout the entire software development lifecycle, from conceptualization and design to implementation, deployment, and operational ownership of enterprise software solutions, involving significant cross-functional collaboration
  • Strategic System Design: Significant hands-on experience in architecting and designing (architecture, design patterns, reliability, scaling) highly complex new and current systems with broad technical impact
  • Hands-on expertise with containerized deployment technologies (e.g. Kubernetes, OpenShift, Docker) and orchestration strategies
  • Hands-on experience and in-depth understanding of C++ is a significant bonus, particularly for complex code analysis, parsing, and integration into knowledge graph structures
Job Responsibility
Job Responsibility
  • Architect and implement cutting-edge software systems, defining the technical design for our AI solutions to ensure scalability, performance, and reliability
  • Drive the hands-on design, implementation, and deployment of sophisticated systems that automate the analysis of data, code, and documentation
  • Apply deep expertise to structure extracted knowledge within a Credit Risk Domain-aware knowledge graph, including advanced strategies for effectively modelling complex codebases, particularly C++, within this graph
  • Act as a critical technical partner with data scientists, business analysts, and other engineering teams to translate challenging business requirements into robust technical solutions and ensure successful, high-quality project delivery
  • Tackle the most complex technical challenges within our AI initiatives, providing solutions that set the standard for engineering excellence
What we offer
What we offer
  • Generous holiday allowance starting at 27 days plus bank holidays
  • increasing with tenure
  • A discretional annual performance related bonus
  • Private medical insurance packages to suit your personal circumstances
  • Employee Assistance Program
  • Pension Plan
  • Paid Parental Leave
  • Special discounts for employees, family, and friends
  • Access to an array of learning and development resources
  • Fulltime
Read More
Arrow Right

Principal Ai Software Engineer

WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great prod...
Location
Location
United States , San Jose
Salary
Salary:
240000.00 - 360000.00 USD / Year
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Knowledge in GPU architectures, basic knowledge of CPU architecture
  • Experience in AI/ML software stack spanning compilers, kernels, runtime, libraries, models, frameworks, and performance optimization layers
  • Understanding of GPU programming such as ROCm, CUDA, OpenCL, etc
  • Experience in hardware/software co-design, building high-performance products across the full product lifecycle
  • Experience with operating systems (OS) and device driver development is a plus
  • Undergrad degree required. Bachelor of Science, Masters, or PhD degree with emphasis in Electrical Engineering, Computer architecture, or Computer Science with relevant experience preferred
Job Responsibility
Job Responsibility
  • Hardware-Software Co-design: Collaborate across hardware architecture, compiler, math libraries, kernel and framework teams to influence future silicon features based on evolving AI workload trends
  • Strong Execution: Deliver innovations and roadmap for AI software stack across all AMD products, ensuring AMD remains the platform of choice for top-tier AI customers
  • Workload Performance Engineering: Lead the profiling, analysis, and tuning of large-scale models (LLMs, Diffusion, Multimodal, and MoE) to ensure out-of-the-box performance excellence on AMD hardware
  • Ecosystem Innovation: Drive the development of advanced tools and frameworks for performance estimation, modeling, and automated reporting
  • Customer Engagement: Partner with top customers and hyperscalers to understand their unique workload requirements and deliver tailored architectural wins and software optimizations
  • Community & Open Source: Mentor and inspire other engineers and contribute to ROCm Opensource
What we offer
What we offer
  • AMD benefits at a glance
  • Fulltime
Read More
Arrow Right

Principal AI Software Engineer

AMD AI Group is seeking a highly influential technical leader for OneROCm — driv...
Location
Location
United States , San Jose
Salary
Salary:
Not provided
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Knowledge in GPU architectures, basic knowledge of CPU architecture
  • Experience in AI/ML software stack spanning compilers, kernels, runtime, libraries, models, frameworks, and performance optimization layers
  • Understanding of GPU programming such as ROCm, CUDA, OpenCL, etc
  • Experience in hardware/software co-design, building high-performance products across the full product lifecycle
  • Experience with operating systems (OS) and device driver development is a plus
  • Undergrad degree required. Bachelor of Science, Masters, or PhD degree with emphasis in Electrical Engineering, Computer architecture, or Computer Science with relevant experience preferred
Job Responsibility
Job Responsibility
  • Hardware-Software Co-design: Collaborate across hardware architecture, compiler, math libraries, kernel and framework teams to influence future silicon features based on evolving AI workload trends
  • Strong Execution: Deliver innovations and roadmap for AI software stack across all AMD products, ensuring AMD remains the platform of choice for top-tier AI customers
  • Workload Performance Engineering: Lead the profiling, analysis, and tuning of large-scale models (LLMs, Diffusion, Multimodal, and MoE) to ensure out-of-the-box performance excellence on AMD hardware
  • Ecosystem Innovation: Drive the development of advanced tools and frameworks for performance estimation, modeling, and automated reporting
  • Customer Engagement: Partner with top customers and hyperscalers to understand their unique workload requirements and deliver tailored architectural wins and software optimizations
  • Community & Open Source: Mentor and inspire other engineers and contribute to ROCm Opensource
What we offer
What we offer
  • Benefits offered are described: AMD benefits at a glance
  • Fulltime
Read More
Arrow Right