CrawlJobs Logo

Senior and Principal Software Engineer - Core AI

United States, Redmond 139900.00 - 274800.00 USD / Year · Job Posted April 06, 2026
Apply Position
Job Link Share

Job Description

Core AI is at the forefront of Microsoft’s mission to redefine how software is built and experienced. We are responsible for building the foundational platforms, services, programming models, and developer experiences that power the next generation of applications using Generative AI. Our work enables developers and enterprises to harness the full potential of AI to create intelligent, adaptive, and transformative software. The Observability group is focused on developing solutions to monitor, evaluate, and optimize AI agent performance. We are seeking Senior and Principal Software Engineers to join the Observability platform team. This team is responsible for building the services that power Observability in Foundry.

Job Responsibility

  • Design, implement and deliver AI services to support product offerings for large-scale agent observability
  • Collaborate closely with product management and partner teams to align technical direction with business goals
  • Take end-to-end responsibility for the development lifecycle and production readiness of the services you build and drive the team’s DevOps culture
  • Engage with customers to gather feedback and resolve complex issues
  • Understand Microsoft businesses and collaborate with stakeholders towards cohesive, end-to-end experiences for Microsoft customers
  • Innovate on technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products
  • Embody our culture and values

Requirements

  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C++, C#, Go, Java, or Python
  • OR equivalent experience
  • 6+ years technical engineering experience designing and delivering highly available, large-scale cloud services and distributed systems
  • Experience building AI or ML related applications
  • 1+ years of technical engineering experience with machine learning or Artificial Intelligence (AI) systems

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Senior and Principal Software Engineer - Core AI

8 matching positions

Principal AI Software Engineer, Senior Vice President

Are you looking for a career move that will put you at the heart of a global fin...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Exceptional Python Expertise: Demonstrated mastery of core Python, including advanced features, performance optimization, and a deep understanding of the FastAPI framework
  • Prior hands-on experience with Generative AI, Large Language Model (LLM) frameworks (e.g. LangChain, LlamaIndex), and their application in enterprise environments is a must. This must be underpinned by a profound understanding of core machine learning principles, algorithms, and data science methodologies
  • Full Lifecycle Ownership: Extensive hands-on experience and technical authority throughout the entire software development lifecycle, from conceptualization and design to implementation, deployment, and operational ownership of enterprise software solutions, involving significant cross-functional collaboration
  • Strategic System Design: Significant hands-on experience in architecting and designing (architecture, design patterns, reliability, scaling) highly complex new and current systems with broad technical impact
  • Hands-on expertise with containerized deployment technologies (e.g. Kubernetes, OpenShift, Docker) and orchestration strategies
  • Hands-on experience and in-depth understanding of C++ is a significant bonus, particularly for complex code analysis, parsing, and integration into knowledge graph structures
Job Responsibility
Job Responsibility
  • Architect and implement cutting-edge software systems, defining the technical design for our AI solutions to ensure scalability, performance, and reliability
  • Drive the hands-on design, implementation, and deployment of sophisticated systems that automate the analysis of data, code, and documentation
  • Apply deep expertise to structure extracted knowledge within a Credit Risk Domain-aware knowledge graph, including advanced strategies for effectively modelling complex codebases, particularly C++, within this graph
  • Act as a critical technical partner with data scientists, business analysts, and other engineering teams to translate challenging business requirements into robust technical solutions and ensure successful, high-quality project delivery
  • Tackle the most complex technical challenges within our AI initiatives, providing solutions that set the standard for engineering excellence
What we offer
What we offer
  • Generous holiday allowance starting at 27 days plus bank holidays
  • increasing with tenure
  • A discretional annual performance related bonus
  • Private medical insurance packages to suit your personal circumstances
  • Employee Assistance Program
  • Pension Plan
  • Paid Parental Leave
  • Special discounts for employees, family, and friends
  • Access to an array of learning and development resources
  • Fulltime
Read More
Arrow Right

Senior Principal Software Engineer - AI Governance

As a Senior Principal Software Engineer, you will serve as a technical leader fo...
Location
Location
United States , San Francisco
Salary
Salary:
165000.00 - 220000.00 USD / Year
onetrust.com Logo
OneTrust
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or master's degree in computer science, Engineering, or related technical or business field
  • 12+ years of professional software engineering/development experience
  • Strong expertise in Java/J2EE, Spring, design patterns, microservices architecture, and cloud-native distributed systems
  • Proven experience building production-grade agentic AI systems with robust tool-use, guardrails, and observability for enterprise-scale workloads
  • Solid understanding of RAG pipelines, agent workflows, model orchestration, and evaluation practices
  • Working experience with one or more AI platforms like Amazon SageMaker, Google Vertex, AWS Bedrock etc.
  • Experience with elastic search and data streaming tools like Kafka
  • Good understanding of web services and SOA related standards like REST/OAuth/JSON
  • Moderate understanding of code and script (Python, Bash)
  • Good experience with SQL and NoSQL databases
Job Responsibility
Job Responsibility
  • Lead the design and development of Java/Python microservices and shared libraries integrating with AI platforms for OneTrust's AI Governance product
  • Design, build, and test cloud-native applications deployed on Microsoft Azure using Core Java, REST, and the Spring ecosystem
  • Build features with RAG, agent workflows, and model orchestration
  • Own technical design for critical systems, ensuring scalability, security, and reliability
  • Maintain strong automated unit/integration test coverage and engineering standards
  • Work closely with UX, Product Managers and/or Product Owners, as well as other developers to contribute to planning and grooming sessions and drive team's discussions on system architecture and component design
  • Partner with Product, UX, and Customer Success to understand customer AI use cases and governance needs
  • Lead architecture discussions and technical planning
  • Drive alignment across teams on platform design and priorities
  • Contribute to sprint planning and delivery
What we offer
What we offer
  • Comprehensive healthcare coverage
  • flexible PTO
  • equity RSUs
  • annual performance bonus opportunities
  • retirement account support
  • 14+ weeks of paid parental leave
  • career development opportunities
  • company-paid privacy certification exam fees
  • Fulltime
Read More
Arrow Right

Senior Principal Software Engineer - AI Governance

As a Senior Principal Software Engineer, you will serve as a technical leader fo...
Location
Location
United States , Atlanta
Salary
Salary:
165000.00 - 220000.00 USD / Year
onetrust.com Logo
OneTrust
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or master's degree in computer science, Engineering, or related technical or business field
  • 12+ years of professional software engineering/development experience
  • Strong expertise in Java/J2EE, Spring, design patterns, microservices architecture, and cloud-native distributed systems
  • Proven experience building production-grade agentic AI systems with robust tool-use, guardrails, and observability for enterprise-scale workloads
  • Solid understanding of RAG pipelines, agent workflows, model orchestration, and evaluation practices
  • Working experience with one or more AI platforms like Amazon SageMaker, Google Vertex, AWS Bedrock etc.
  • Experience with elastic search and data streaming tools like Kafka
  • Good understanding of web services and SOA related standards like REST/OAuth/JSON
  • Moderate understanding of code and script (Python, Bash)
  • Good experience with SQL and NoSQL databases
Job Responsibility
Job Responsibility
  • Lead the design and development of Java/Python microservices and shared libraries integrating with AI platforms for OneTrust’s AI Governance product
  • Design, build, and test cloud-native applications deployed on Microsoft Azure using Core Java, REST, and the Spring ecosystem
  • Build features with RAG, agent workflows, and model orchestration
  • Own technical design for critical systems, ensuring scalability, security, and reliability
  • Maintain strong automated unit/integration test coverage and engineering standards
  • Work closely with UX, Product Managers and/or Product Owners, as well as other developers to contribute to planning and grooming sessions and drive team’s discussions on system architecture and component design
  • Partner with Product, UX, and Customer Success to understand customer AI use cases and governance needs
  • Lead architecture discussions and technical planning
  • Drive alignment across teams on platform design and priorities
  • Contribute to sprint planning and delivery
What we offer
What we offer
  • Comprehensive healthcare coverage
  • Flexible PTO
  • Equity RSUs
  • Annual performance bonus opportunities
  • Retirement account support
  • 14+ weeks of paid parental leave
  • Career development opportunities
  • Company-paid privacy certification exam fees
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, Managed AI - AI model LifeCycle

The Senior Software Engineer for the Model LifeCycle team will contribute to bui...
Location
Location
United States , San Francisco
Salary
Salary:
172425.00 - 209000.00 USD / Year
crusoe.ai Logo
Crusoe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Engineering, or a related field
  • Experience delivering production-ready features
  • Familiarity with essential cloud-based services (e.g., compute, storage, networking)
  • Familiarity with Generative AI (Large Language Models, Multimodal)
  • Experience with AI infrastructure components (training, inference)
  • 4-5+ years of industry experience with demonstrated history of consistent success leading a varied portfolio of initiatives across your function
Job Responsibility
Job Responsibility
  • Implement and maintain systems for fine-tuning large foundation models (SFT, PEFT, LoRA, adapters), including multi-node orchestration, checkpointing, failure recovery, and cost-efficient scaling
  • Implement and maintain end-to-end training pipelines for Large Language Models
  • Implement components for distillation and reinforcement learning pipelines (e.g., preference optimization, policy optimization, reward modeling)
  • Develop and maintain core agent execution infrastructure
  • Implement features for dataset, model, and experiment management, focusing on versioning, lineage, evaluation, and reproducible fine-tuning
  • Work closely with Senior Engineers and Principal Engineers, as well as product and platform teams, to implement system abstractions and APIs
  • Contribute to technical discussions on training runtimes, scheduling, storage, and model lifecycle management
  • Engage with the open-source LLM ecosystem
What we offer
What we offer
  • Restricted Stock Units
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Tuition reimbursement
  • Fulltime
Read More
Arrow Right

Senior Principal Software Engineer

The Senior/Principal Software Engineer provides architectural depth, delivery ex...
Location
Location
United States
Salary
Salary:
138568.00 - 221709.00 USD / Year
onecallcm.com Logo
One Call
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years building enterprise .NET applications in C# (.NET Framework + .NET Core)
  • Experience modernizing large legacy .NET codebases (service extraction, modularization, domain consolidation)
  • Familiarity with Azure cloud services, logging, monitoring, and observability patterns
  • Experience in healthcare, workers’ compensation, claims processing, or regulated industries
  • Experience with distributed queues, messaging patterns, event-driven architecture
  • Experience maintaining VB.NET codebases
  • Deep experience with ASP.NET MVC, Web API, WCF, Web Forms, WinForms
  • Strong proficiency in SQL Server (T-SQL, performance tuning, SSIS, SSRS)
  • Strong front-end fundamentals in JavaScript, TypeScript, HTML, CSS/SASS
  • Familiarity with jQuery, Knockout, and Kendo UI
Job Responsibility
Job Responsibility
  • Own and drive architecture and design discussions across multiple applications and services
  • Provide pragmatic, hands-on guidance to teams working across .NET, SQL, and mixed front-end stacks
  • Evaluate legacy architectures and propose modernization patterns (service boundaries, APIs, refactors, consolidation)
  • Facilitate technical decision-making with crisp rationales, tradeoff analysis, and clear documentation
  • Champion engineering excellence: code quality, testing, CI/CD maturity, observability, and maintainability
  • Unblock teams quickly by diagnosing issues across the entire stack (app, API, data, infrastructure)
  • Collaborate with Product, QA, Design, and other engineering teams to ensure predictable delivery
  • Translate complex requirements into executable technical plans, work breakdowns, and sequencing
  • Identify risks early and recommend options that balance delivery velocity with long-term stability
  • Contribute high-quality code in .NET (.NET Framework, .NET Core), C#, and VB.NET where necessary
What we offer
What we offer
  • Remote Work: We are a remote-first company, and almost all positions receive the flexibility of working from home
  • Generous Time Off: Besides 8 company holidays and 2 personal days every year, all colleagues receive a minimum of 18 days of paid time off
  • Comprehensive Benefits Package: Including medical, dental, vision, and pet insurance
  • 401(k) matching program
  • and company-paid life insurance and short and long-term disability coverage
  • Supportive Services: We offer a Colleague Assistance Program that provides free counseling and financial services, and our One Call Foundation, a non-profit arm of our company, provides colleagues financial assistance during times of unexpected hardships
  • Fulltime
Read More
Arrow Right

Senior Principal Software Engineer

Rapid7’s Exposure Management engineering team is seeking a Sr Principal Software...
Location
Location
United Kingdom , Belfast
Salary
Salary:
Not provided
rapid7.com Logo
Rapid7
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A minimum of 12 years experience in software development using Java or other equivalent programming languages
  • Excited by technology, curious and eager to learn, with the ability to mentor more junior members of the team
  • Strong creative problem solving skills
  • Solid communicator with the ability to clearly convey objectives and rationale whilst fostering commitment from others both within the team and cross functionally
  • Brings a customer-first mindset to engineering, and integrates customer impact into everyday decision making
  • Technical Expertise: Proven track record of designing complex enterprise-class software at scale
  • experience with wide variety of programming languages, environments, and platforms
  • experience with high scale use of large language and machine learning models
  • Strategic Vision: Strong facility with following and synthesizing strategic trends in your industry, and dynamically setting course as required to ensure that the engineering team enables the company to respond to those trends and the resulting customer needs
  • Collaboration: Ability to engage closely with customers and to influence cross-functional product management, support, strategy and other areas to ensure alignment on the priorities of the engineering team to promote customer success
Job Responsibility
Job Responsibility
  • Collaborate to develop and drive the technical strategy for our unified Exposure Management offerings
  • Help formulate our cyber security data aggregation and correlation strategy at scale (millions of assets, 100’s of millions of findings, etc.)
  • Consult and help design secure, scalable, stable, and resilient mutli-tenant cloud applications and deployment patterns
  • Consult and partner closely with engineers across Rapid7 to ensure an aligned overarching platform strategy in support of all of our products
  • Engage directly with customers to understand their needs, incorporate their feedback, and remove points of friction from their product experience and use those experiences to reinforce a customer-first mentality throughout the organization
  • Help define, socialize, and enforce our SDLC best practices including nurturing the practical utilization of AI as a productivity tool and to enhance product features
  • Set the bar for operational excellence in engineering, supporting resilience, performance, and quality initiatives across the organization
  • Support the growth and skills development of technical team members across the organization to create a thriving culture for both individual contributors and team success
  • Bring an informed technical perspective to organizational roadmap planning and customer commitments
Read More
Arrow Right

Ai-first Core It Software Engineering: Software, Ml & Data

This is a Unified Application for our AI-First IT Transformation portfolio. We r...
Location
Location
United States , Santa Clara
Salary
Salary:
Not provided
paloaltonetworks.com Logo
Palo Alto Networks
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3-5+ years of experience in Software Engineering, Data Science, or Machine Learning (Staff level)
  • 6-8+ years (Senior Staff)
  • 8-12+ years (Principal level)
  • Expert-level server-side development (Python, Java, Go) OR deep expertise in statistical modeling, ML algorithms, and LLM fine-tuning
  • Direct experience with RAG architectures, LLM APIs, and Vector Databases (e.g., Pinecone, Milvus)
  • Hands-on experience with Kubernetes, CI/CD, and distributed systems for large-scale AI deployment
Job Responsibility
Job Responsibility
  • Lead the hands-on development of core Enterprise IT Business software leveraging AI components and LLM infrastructure with both traditional and Generative AI model deployment
  • Build and industrialize agentic AI systems and multi-agent frameworks, ensuring secure and effective use of GenAI technologies at the platform level
  • Design and implement robust foundational data pipelines, perform advanced statistical analysis, and develop new ML models to drive autonomous system behavior
  • Design large-scale, distributed AI/ML systems optimized for low latency, high throughput, and developer-friendliness (Inference optimization)
  • Establish evaluation frameworks to measure AI quality (accuracy, hallucination rates) and overall system reliability across the Enterprise AI Factory
  • Fulltime
Read More
Arrow Right

Senior Principal AI Infrastructure Architect

The Senior Principal AI Infrastructure Architect is a highly skilled and advance...
Location
Location
Italy , Milano
Salary
Salary:
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Significant experience in a consulting, presales or architecture role within a large-scale (preferably multi-national) technology services environment, with a track record of leading AI infrastructure pursuits
  • Demonstrable experience designing and delivering production AI platforms — from single multi-GPU servers through to multi-rack training clusters and inference factories
  • Strong working knowledge of the AI hardware vendor landscape (NVIDIA, AMD, Intel, Dell, HPE, Lenovo, Supermicro, Cisco, Pure, VAST, WEKA, DDN, NetApp) and how to position partner ecosystems competitively
  • Proven ability to translate AI workload requirements (model size, parameter count, sequence length, throughput SLOs, latency targets) into accurate hardware bills of materials and sizing justifications
  • Significant client engagement and consulting experience, including client needs assessment, change management and the ability to identify whitespace for follow-on AI infrastructure and managed-services work
  • Significant business development and presales experience on infrastructure-led deals, ideally including sovereign AI, AI Factory or regulated-industry GenAI programmes
  • Strong understanding of how AI infrastructure integrates with business processes, applications, data platforms and existing enterprise architecture
  • Bachelor's degree or equivalent in Information Technology, Engineering, Computer Science or a related field
  • Deep, hands-on knowledge of AI hardware: GPU and accelerator portfolios (NVIDIA Hopper / Blackwell, AMD MI300/MI325, Intel Gaudi 3, emerging custom silicon), host CPU platforms (Intel Xeon, AMD EPYC, NVIDIA Grace), system topologies (HGX, DGX, MGX, OAM) and how each choice maps to specific AI workloads
  • Strong understanding of AI-class storage: parallel filesystems, all-flash NVMe platforms, S3-class object stores, checkpoint and dataset pipelines and the I/O patterns of large-scale training and inference (VAST, WEKA, DDN EXAScaler, Pure FlashBlade, NetApp ONTAP AI, Dell PowerScale)
Job Responsibility
Job Responsibility
  • Lead the end-to-end design of large, complex AI infrastructure solutions — covering accelerated compute (NVIDIA H100/H200/B200 and GB200 NVL72, AMD Instinct MI300X/MI325X, Intel Gaudi 3), CPU host platforms (Intel Xeon, AMD EPYC, NVIDIA Grace), high-throughput storage tiers and lossless AI fabric — for enterprise, sovereign AI and AI Factory clients
  • Architect reference designs built on NVIDIA DGX/HGX SuperPOD, Dell AI Factory with NVIDIA, Cisco Nexus HyperFabric AI, HPE / Lenovo / Supermicro accelerated compute and equivalent platforms, balancing single-node performance with cluster-scale efficiency
  • Size and validate GPU clusters against real workloads — foundation-model pre-training, distributed fine-tuning, RAG, real-time and batch inference — using the right combination of NVLink/NVSwitch domains, InfiniBand NDR/XDR or Ultra Ethernet / NVIDIA Spectrum-X fabrics and tiered NVMe and parallel storage (VAST, WEKA, DDN, Pure FlashBlade, NetApp ONTAP AI, Dell PowerScale)
  • Define the supporting datacenter design: high-density power (50–140 kW/rack), direct-to-chip and rear-door liquid cooling, structured cabling for AI fabrics and modular deployment models across on-prem, colo and sovereign-cloud footprints
  • Work closely with the sales team to drive the presales process for AI infrastructure pursuits — client discovery, technical workshops, proposal writing, executive presentations and bid defence
  • Translate clients' AI ambitions and business outcomes into a hardware and platform roadmap, positioning NTT DATA's end-to-end portfolio — silicon, systems, storage, fabric, MLOps stack and managed services — to land service-led AI solutions
  • Lead integration of compute, storage, networking, the AI software stack (CUDA, ROCm, Triton, NIM, NVIDIA AI Enterprise, Run:ai, Slurm, Kubernetes / Kubeflow) and managed-service operating models across multiple domains, delivery units and geographies
  • Build business cases, TCO and unit-economics models (cost per token, cost per training run, GPU-hour economics) and end-to-end transition roadmaps for cloud-to-private AI migrations and sovereign AI deployments
  • Define architectural principles for AI infrastructure — accelerator utilisation, data gravity, multi-tenancy, model lifecycle, energy efficiency — and apply them to influence architectural outcomes and governance
  • Develop As-Is, Vision, FMO and To-Be AI platform architectures, identify gaps and develop transition roadmaps
  • Fulltime
Read More
Arrow Right