CrawlJobs Logo

Senior Business Manager - AI Infrastructure

United States, Redmond 116900.00 - 203600.00 USD / Year · Job Posted June 09, 2026
Apply Position
Job Link Share

Job Description

The AI Infrastructure team builds systems that turn hardware and AI models into working software that runs at scale across Microsoft. We design and develop the platforms behind AI features in Microsoft products and for external customers who rely on those capabilities. We also manage the underlying compute capacity and use real production data to continuously improve reliability, efficiency, and cost over time. This enables product teams and partners to deliver AI solutions more quickly and effectively by turning AI infrastructure into usable capabilities they can build on, scale, and improve over time. We are seeking a Senior Business Manager - AI Infrastructure to drive financial management, workforce planning, and operational rigor for a rapidly growing organization. You’ll partner closely with the Chief of Staff, Finance, HR, and Recruiting to make sense of a complex organization. Your job is to bring clarity to our financials and headcount, help leaders make smart trade-offs, and build the systems and reporting that keep everything running smoothly as we grow. This role is critical to enabling the organization to scale effectively and goes beyond tracking numbers - you’ll help shape how we grow. Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Job Responsibility

  • Own end-to-end financial tracking including spend, forecasts, and variance analysis. Deliver accurate, timely, and actionable reporting that supports leadership decision-making
  • Establish and maintain strong financial discipline, including data accuracy, consistent processes, and adherence to reporting standards
  • Lead headcount planning, hiring pacing, and Position Control Number (PCN) management. Support trade-off decisions aligned to organizational priorities and financial targets. Plan and support Early in Profession (EIP) hiring across the team
  • Design, build, and continuously improve reporting and tooling to move from manual processes to scalable, automated, and reliable systems
  • Translate data into clear insights, identifying risks and opportunities and providing recommendations to leadership
  • Partner with Finance, HR, Recruiting, and business leaders to align financials, workforce, and business priorities

Requirements

  • Bachelor's Degree in relevant field (e.g., Liberal Arts, Business Administration, Management, Computer Science) AND 6+ years experience in financial management, business planning, operations management, strategy, project management, human resources, or business-related roles OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:  Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

Nice to have

  • Experience in financial management, budgeting, and forecasting
  • Experience with workforce planning and headcount management (including PCN management)
  • Proven ability to build and maintain reporting and analytics tools (e.g., Excel, Power BI, or similar)
  • Strong cross-functional collaboration with Finance, HR, and Recruiting
  • Ability to manage complexity and operate effectively in a fast-paced environment

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Senior Business Manager - AI Infrastructure

8 matching positions

Senior Manager, AI & ML Engineering

You will be a leader in Alter Domus' AI & ML engineering organization driving en...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
alterdomus.com Logo
Alter Domus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience in software engineering, data engineering, or AI/ML engineering roles
  • 5+ years managing technical teams, including experience managing managers
  • Deep expertise with Python, modern ML frameworks (PyTorch, TensorFlow), and data science ecosystems
  • Production experience with LLM APIs (OpenAI, Anthropic, AWS Bedrock) and agentic frameworks (LangChain, LangGraph etc)
  • Strong understanding of vector databases, RAG and GraphRAG architectures, and semantic search
  • Experience with cloud platforms (AWS, Azure or GCP), microservices, APIs, and containerization (Docker/Kubernetes)
  • Proven track record delivering enterprise AI/ML solutions at scale
  • Experience with portfolio management, budget ownership, and ROI tracking and OKRs
Job Responsibility
Job Responsibility
  • Help define and drive the multi-year technical roadmap for Alter Domus' AI platforms
  • Make critical build-buy-partner decisions for AI/ML capabilities
  • Sponsor platform modernization programs and guide the evolution from proof-of-concept to production-scale AI solutions
  • Drive measurable and accurate business outcomes through AI adoption
  • Lead the technical architecture for RAG and GraphRAG pipelines, multi-agent orchestration, and LLM based documents processing technologies
  • Manage portfolio resource mix and capital allocation
  • Collaborate with business leaders to define product strategy
  • Monitor AI/ML market trends, competitor capabilities, and emerging technologies
  • Build business cases with clear ROI tracking
  • Design sustainable cost structure for the AI/ML portfolio
What we offer
What we offer
  • Support for professional accreditations such as ACCA and study leave
  • Flexible arrangements, generous holidays, plus an additional day off for your birthday
  • Continuous mentoring along your career progression
  • Active sports, events and social committees across our offices
  • 24/7 support available from our Employee Assistance Program
  • The opportunity to invest in our growth and success through our Employee Share Plan
  • Plus additional local benefits depending on your location
Read More
Arrow Right

Senior Manager, AI & ML Engineering

You will be a leader in Alter Domus' AI & ML engineering organization driving en...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
alterdomus.com Logo
Alter Domus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience in software engineering, data engineering, or AI/ML engineering roles
  • 5+ years managing technical teams, including experience managing managers
  • Deep expertise with Python, modern ML frameworks (PyTorch, TensorFlow), and data science ecosystems
  • Production experience with LLM APIs (OpenAI, Anthropic, AWS Bedrock) and agentic frameworks (LangChain, LangGraph etc)
  • Strong understanding of vector databases, RAG and GraphRAG architectures, and semantic search
  • Experience with cloud platforms (AWS, Azure or GCP), microservices, APIs, and containerization (Docker/Kubernetes)
  • Proven track record delivering enterprise AI/ML solutions at scale
  • Experience with portfolio management, budget ownership, and ROI tracking and OKRs
Job Responsibility
Job Responsibility
  • Help define and drive the multi-year technical roadmap for Alter Domus' AI platforms
  • Make critical build-buy-partner decisions for AI/ML capabilities
  • Sponsor platform modernization programs and guide the evolution from proof-of-concept to production-scale AI solutions
  • Drive measurable and accurate business outcomes through AI adoption
  • Lead the technical architecture for RAG and GraphRAG pipelines, multi-agent orchestration, and LLM based documents processing technologies
  • Manage portfolio resource mix and capital allocation
  • Collaborate with business leaders to define product strategy
  • Monitor AI/ML market trends, competitor capabilities, and emerging technologies
  • Build business cases with clear ROI tracking
  • Design sustainable cost structure for the AI/ML portfolio
What we offer
What we offer
  • Support for professional accreditations such as ACCA and study leave
  • Flexible arrangements, generous holidays, plus an additional day off for your birthday
  • Continuous mentoring along your career progression
  • Active sports, events and social committees across our offices
  • 24/7 support available from our Employee Assistance Program
  • The opportunity to invest in our growth and success through our Employee Share Plan
  • Plus additional local benefits depending on your location
  • Fulltime
Read More
Arrow Right

Senior Manager, AI Engineering

By leading the strategic adoption and scaling of AI across the organisation this...
Location
Location
United Kingdom , London OR Newbury
Salary
Salary:
Not provided
vodafone.com Logo
Vodafone
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience in AI strategy, delivery, and enablement
  • Strong understanding of GenAI, ML Ops, and AI governance
  • Familiarity with infrastructure provisioning and model lifecycle
  • Ability to influence cross-functional teams and stakeholders
  • Experience in training, consulting, and change management
  • Knowledge of privacy, security, and ethical AI practices
Job Responsibility
Job Responsibility
  • Define and deliver the AI strategy and roadmap
  • Build and maintain self-service AI environments and infrastructure
  • Implement use cases to demonstrate business value
  • Operate and monitor AI models for accuracy and performance
  • Collaborate with architecture, governance, and security teams
  • Establish best practice and enable reuse across solutions
  • Drive AI enablement through training and consulting
  • Evangelise AI adoption across internal and customer-facing teams
  • Monitor industry trends and pilot emerging opportunities
  • Measure and report on efficiency gains and impact
What we offer
What we offer
  • Great pay, bonuses, up to 28 days off plus bank holidays, and paid time for charity work
  • Personalise benefits for you and your family, like discounts, vouchers, a pension plan and loads more
  • Amazing learning tools and top-notch parental leave policies
  • Fulltime
Read More
Arrow Right

Senior Manager, AI Platform Engineering

Socure is building the identity trust infrastructure for the digital economy — v...
Location
Location
United States
Salary
Salary:
190000.00 - 210000.00 USD / Year
socure.com Logo
Socure
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of professional software engineering experience, including time spent building or operating large-scale ML, data, or distributed systems platforms
  • 3+ years of engineering leadership experience managing multiple teams or engineering managers
  • Strong technical background in ML infrastructure, data engineering, and/or cloud-native distributed systems
  • Demonstrated experience delivering complex, cross-functional platform initiatives
  • Excellent communication and stakeholder management skills, with the ability to translate between technical detail and business priorities
  • Experience working in fast-paced, iterative environments using modern development practices
Job Responsibility
Job Responsibility
  • Develop and own the roadmap for Socure’s AI/ML platform, including data and feature engineering workflows, training infrastructure, experimentation tooling, model deployment/serving, monitoring, and governance
  • Define architecture and standards that create clear, scalable, and secure paths for building and operating AI systems
  • Assess technology options and drive consolidation across the company to reduce fragmentation and improve consistency across the ML toolchain
  • Partner with Data Science, Engineering, Product, and Sales-Enablement teams to develop AI infrastructure that delights Customers
  • Lead the design and operation of the end-to-end ML lifecycle: data ingestion, feature engineering, experimentation, training, model registry, deployment, and continuous monitoring
  • Partner closely with Data Science to enable fast, reproducible experimentation and reduce operational friction
  • Ensure the platform delivers reliability, traceability, observability, and performance for both batch and real-time model workloads
  • Guide the team to deliver high-quality platform capabilities with predictable timelines and strong technical rigor
  • Remove cross-team bottlenecks, align dependencies, and ensure seamless execution across Data, Infrastructure, and Product
  • Establish SLAs, operational standards, and production-readiness guidelines for ML pipelines and serving systems
What we offer
What we offer
  • Offers Equity
  • Offers Bonus
  • benefits
  • Fulltime
Read More
Arrow Right

Senior Manager, AI Strategy & Operations

We’re building the next chapter of Talkiatry – one where AI reimagines how patie...
Location
Location
United States , New York
Salary
Salary:
130000.00 - 160000.00 USD / Year
talkiatry.com Logo
Talkiatry
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in product management, process automation, or technology operations
  • Hands-on experience applying AI or large language model (LLM) solutions to real-world workflows
  • Proven success delivering large-scale AI or automation projects from concept through deployment
  • Strong understanding of conversational AI platforms and voice/text agent design principles
  • Systems thinking to architect how AI connects across data, telephony, CRM, and workflow tools
  • Highly analytical with experience in product and operational analytics
  • Skilled at translating business and user requirements into technical specification
  • Experience in product management, technical program management, or systems enablement preferred
  • Exceptional project management and vendor management capabilities
  • Thrives in cross-functional environments
Job Responsibility
Job Responsibility
  • Lead the design and deployment of AI solutions
  • Partner with Product and Engineering leadership to define Talkiatry’s long-term AI infrastructure strategy
  • Gather and translate patient needs and agent workflows into AI product requirements
  • Evaluate emerging technologies to guide the roadmap for AI tooling, model management, and automation pipelines
  • Champion responsible AI principles in vendor selection, governance, and deployment practices
  • Own day-to-day AI performance through platform dashboards
  • Conduct root-cause analyses on performance deviations
  • Surface performance insights to the Quality & Training team
  • Maintain governance documentation for model versions, updates, and validation
  • Oversee the AI-facing knowledge infrastructure
What we offer
What we offer
  • Medical, dental, vision, effective day 1 of employment
  • 401K with match
  • Generous PTO plus paid holidays
  • Paid parental leave
  • Fulltime
Read More
Arrow Right

Senior Principal AI Infrastructure Architect

The Senior Principal AI Infrastructure Architect is a highly skilled and advance...
Location
Location
Italy , Milano
Salary
Salary:
Not provided
nttdata.com Logo
NTT DATA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Significant experience in a consulting, presales or architecture role within a large-scale (preferably multi-national) technology services environment, with a track record of leading AI infrastructure pursuits
  • Demonstrable experience designing and delivering production AI platforms — from single multi-GPU servers through to multi-rack training clusters and inference factories
  • Strong working knowledge of the AI hardware vendor landscape (NVIDIA, AMD, Intel, Dell, HPE, Lenovo, Supermicro, Cisco, Pure, VAST, WEKA, DDN, NetApp) and how to position partner ecosystems competitively
  • Proven ability to translate AI workload requirements (model size, parameter count, sequence length, throughput SLOs, latency targets) into accurate hardware bills of materials and sizing justifications
  • Significant client engagement and consulting experience, including client needs assessment, change management and the ability to identify whitespace for follow-on AI infrastructure and managed-services work
  • Significant business development and presales experience on infrastructure-led deals, ideally including sovereign AI, AI Factory or regulated-industry GenAI programmes
  • Strong understanding of how AI infrastructure integrates with business processes, applications, data platforms and existing enterprise architecture
  • Bachelor's degree or equivalent in Information Technology, Engineering, Computer Science or a related field
  • Deep, hands-on knowledge of AI hardware: GPU and accelerator portfolios (NVIDIA Hopper / Blackwell, AMD MI300/MI325, Intel Gaudi 3, emerging custom silicon), host CPU platforms (Intel Xeon, AMD EPYC, NVIDIA Grace), system topologies (HGX, DGX, MGX, OAM) and how each choice maps to specific AI workloads
  • Strong understanding of AI-class storage: parallel filesystems, all-flash NVMe platforms, S3-class object stores, checkpoint and dataset pipelines and the I/O patterns of large-scale training and inference (VAST, WEKA, DDN EXAScaler, Pure FlashBlade, NetApp ONTAP AI, Dell PowerScale)
Job Responsibility
Job Responsibility
  • Lead the end-to-end design of large, complex AI infrastructure solutions — covering accelerated compute (NVIDIA H100/H200/B200 and GB200 NVL72, AMD Instinct MI300X/MI325X, Intel Gaudi 3), CPU host platforms (Intel Xeon, AMD EPYC, NVIDIA Grace), high-throughput storage tiers and lossless AI fabric — for enterprise, sovereign AI and AI Factory clients
  • Architect reference designs built on NVIDIA DGX/HGX SuperPOD, Dell AI Factory with NVIDIA, Cisco Nexus HyperFabric AI, HPE / Lenovo / Supermicro accelerated compute and equivalent platforms, balancing single-node performance with cluster-scale efficiency
  • Size and validate GPU clusters against real workloads — foundation-model pre-training, distributed fine-tuning, RAG, real-time and batch inference — using the right combination of NVLink/NVSwitch domains, InfiniBand NDR/XDR or Ultra Ethernet / NVIDIA Spectrum-X fabrics and tiered NVMe and parallel storage (VAST, WEKA, DDN, Pure FlashBlade, NetApp ONTAP AI, Dell PowerScale)
  • Define the supporting datacenter design: high-density power (50–140 kW/rack), direct-to-chip and rear-door liquid cooling, structured cabling for AI fabrics and modular deployment models across on-prem, colo and sovereign-cloud footprints
  • Work closely with the sales team to drive the presales process for AI infrastructure pursuits — client discovery, technical workshops, proposal writing, executive presentations and bid defence
  • Translate clients' AI ambitions and business outcomes into a hardware and platform roadmap, positioning NTT DATA's end-to-end portfolio — silicon, systems, storage, fabric, MLOps stack and managed services — to land service-led AI solutions
  • Lead integration of compute, storage, networking, the AI software stack (CUDA, ROCm, Triton, NIM, NVIDIA AI Enterprise, Run:ai, Slurm, Kubernetes / Kubeflow) and managed-service operating models across multiple domains, delivery units and geographies
  • Build business cases, TCO and unit-economics models (cost per token, cost per training run, GPU-hour economics) and end-to-end transition roadmaps for cloud-to-private AI migrations and sovereign AI deployments
  • Define architectural principles for AI infrastructure — accelerator utilisation, data gravity, multi-tenancy, model lifecycle, energy efficiency — and apply them to influence architectural outcomes and governance
  • Develop As-Is, Vision, FMO and To-Be AI platform architectures, identify gaps and develop transition roadmaps
  • Fulltime
Read More
Arrow Right

Senior Product Manager, AI & ML Platform

Reporting to the Senior Director of Product, Customer Value, you will be instrum...
Location
Location
United States , New York; San Francisco
Salary
Salary:
150000.00 - 202400.00 USD / Year
springhealth.com Logo
Spring Health
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of Product Management experience, including ownership of highly technical platforms, infrastructure, or developer-facing products
  • Demonstrated experience partnering deeply with Engineering to ship scalable, reliable systems in production environments
  • Strong understanding of AI/ML systems and workflows (e.g., model lifecycle, evaluation, deployment, observability), with the ability to translate complex technical concepts into clear product direction
  • Track record of driving roadmap clarity in ambiguous, fast-moving spaces and delivering measurable outcomes
  • Experience running continuous discovery with internal users and translating feedback into high-impact platform improvements
  • Ability to define clear success metrics, service standards, and rollout strategies that drive adoption
  • Excellent cross-functional communication skills, with the ability to influence senior technical stakeholders without formal authority
  • Prior machine learning engineering experience is preferred but not required
Job Responsibility
Job Responsibility
  • Shape and drive the AI/ML platform strategy and roadmap
  • Lead continuous discovery with feature teams
  • Align platform investments to business impact
  • Define platform solutions in close partnership with Engineering leadership
What we offer
What we offer
  • Health, Dental, Vision benefits start on your first day
  • Access to One Medical accounts
  • HSA and FSA plans with Spring contributing up to $1K for HSAs
  • Employer sponsored 401(k) match of up to 2%
  • Yearly allotment of no cost visits to Spring Health network
  • Competitive paid time off policies including vacation, sick leave and company holidays
  • Parental leave of 18 weeks for birthing parents and 16 weeks for non-birthing parents
  • Access to Noom
  • Access to fertility care support through Carrot
  • $4,000 reimbursement for related fertility expenses
  • Fulltime
Read More
Arrow Right

Senior Finance Manager - AI Group

WHAT YOU DO AT AMD CHANGES EVERYTHING  At AMD, our mission is to build great pro...
Location
Location
United States , Austin
Salary
Salary:
161920.00 - 242880.00 USD / Year
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Self-starter with excellent interpersonal communication, teaming, and problem-solving skills
  • Progressive experience in Finance, including significant exposure to BU, platform, or P&L Management roles
  • Experience supporting infrastructure, platform, or engineering-heavy organizations (semiconductor, datacenter or hardware environments preferred)
  • Strong financial modeling, forecasting, and analytical skills
  • Exceptional communication, presentation, and interpersonal skills, with the ability to articulate concepts clearly and concisely to diverse audiences
  • Self-motivated, intellectually curious, and comfortable with ambiguity, possessing a proactive approach to problem-solving
  • Proven ability to work effectively in a fast-paced, high-growth, and challenging environment
  • Robust analytical skills required and the ability to turn complex data and concepts into valuable information to present in a simple/understandable manner
  • Ability to work in teams and with all levels of an organization as well as work independently
  • Proficiency in Excel, AI tools, MSFT Co-Pilot, and ERP systems experience (SAP, SAP HANA)
Job Responsibility
Job Responsibility
  • Serve as the Finance Lead for Datacenter business unit program financials
  • Directly support the business unit Engineering leadership team, deliver data-driven insights, financial guidance, and decision support
  • Engage/Lead core finance processes including Life of Program planning, Annual Operating Plans, and intra-quarter Outlooks
  • Collaborate with engineering, program management, procurement and operations to align financial plans with platform roadmaps and execution priorities
  • Support investment planning, including capacity, cost optimization, and ROI analysis for datacenter platforms
  • Prepare and disseminate timely and accurate financial information to allow the business to plan, forecast, and make decisions using controlled and consistent data
  • Drive budgeting, forecasting, and variance analysis
  • communicate financial performance and risks to senior leadership and corporate finance
  • Monitor operating expenses and capital-related investments to ensure efficient resource allocation and alignment with strategic priorities
  • Develop long-range financial planning and scenario analysis to guide strategic decisions
  • Fulltime
Read More
Arrow Right