CrawlJobs Logo

Datacenter Program Manager New Product Integration

https://www.microsoft.com/ Logo

Microsoft Corporation

Location Icon

Location:
United States , West Des Moines

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

102600.00 - 202800.00 USD / Year

Job Description:

As a Datacenter Program Manager New Product Integration, you will lead complex infrastructure integration programs that enable the deployment of new hardware and technologies into live datacenter environments. You will operate with broad autonomy, driving end‑to‑end integration readiness across the datacenter metro, working with engineering, deployment, operations, and partner teams to ensure solutions meet technical, operational, safety, and business requirements. This role is an individual contributor position with significant scope and influence across multiple teams and campuses in the metro.

Job Responsibility:

  • Drive the integration of new hardware and complex systems into mission‑critical datacenter environments, from requirements assessment through execution and operational readiness
  • Assess existing datacenter infrastructure and component dependencies to determine integration requirements for new technology deployments
  • Define integration strategies, methods, sequencing, and readiness criteria aligned with DCO deployment and change governance expectations
  • Leverage existing infrastructure, platforms, and standard solutions to reduce cost, minimize operational risk, and improve delivery efficiency
  • Own integration planning artifacts, including dependency tracking, execution plans, and milestone alignment
  • Partner with engineering, deployment, operations, and vendor teams to troubleshoot and resolve issues encountered during new integrations
  • Validate that completed integrations meet defined technical, operational, safety, and business success criteria prior to operational handoff
  • Identify integration risks and constraints early and drive mitigation plans across stakeholders
  • Enable operational readiness by educating DCO teams and partners on system and hardware integration procedures
  • Create and maintain high‑quality technical and program documentation to support execution, auditability, and long‑term sustainment.

Requirements:

  • High School Qualification or equivalent AND 3+ years experience supporting IT equipment or related technology or delivering server and network deployment projects in large-scale environments
  • OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements
  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Nice to have:

  • Proven understanding of datacenter networking and server infrastructure, including high‑availability and mission‑critical considerations
  • Proven experience working in large‑scale enterprise or hyperscale datacenter environments leading infrastructure, hardware, or systems integration programs
  • Familiarity with datacenter operations deployment, change management, and operational readiness processes
  • Demonstrated ability to interpret architectural designs, hardware implementation blueprints, and technical documentation
  • Experience managing multiple concurrent integration efforts with competing priorities
  • Demonstrated ability to operate effectively in ambiguous environments and drive clarity through structured program management
  • Verbal communication skills with the ability to influence and align cross‑functional technical and operational stakeholders
  • Written communication skills, including producing clear technical documentation and program deliverables.

Additional Information:

Job Posted:
March 25, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Datacenter Program Manager New Product Integration

Senior Technical Program Manager

Microsoft’s Cloud Operations & Innovation (CO+I) organization powers the infrast...
Location
Location
United States , Redmond
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree AND 4+ years experience in engineering, product/technical program management, data analysis, or product development OR equivalent experience
  • 2+ years of experience managing cross-functional and/or cross-team projects
Job Responsibility
Job Responsibility
  • Lead delivery of RADAR’s mission by implementing and scaling sensor‑health detection, alerting, and triage capabilities across Microsoft datacenters, ensuring high‑quality signal visibility and reliable operational outcomes
  • Design and operationalize core workflows for sensor‑health detection, alert routing, validation, and triage, partnering closely with upstream telemetry systems and downstream incident‑response teams
  • Drive cross‑team orchestration by creating and strengthening relationships across engineering, hardware, operations, and service teams to integrate and execute multi‑feature scenarios and platform capabilities
  • Build and manage onboarding processes for new telemetry types and detection scenarios, including requirements templates, validation criteria, handoff procedures, and governance frameworks
  • Champion Process Excellence by maturing workflows, training partners, and driving adoption of consistent operating models for new signals, anomaly detection patterns, and incident‑response processes
  • Lead partner alignment and influence to shape and deliver shared roadmaps across divisional boundaries, ensuring detection, alerting, and observability capabilities evolve cohesively
  • Identify gaps and opportunities through structured feedback loops
  • synthesize insights into clear problem statements, repeatable patterns, and actionable guidance for leadership and engineering stakeholders
  • Manage schedules and execution across epics, sprints, semester plans, and releases, tracking dependencies, anticipating risks, and driving cohesive delivery across partner teams
  • Produce clear technical documentation including specifications, decision records, runbooks, and operational procedures to support partner readiness and consistent implementation
  • Fulltime
Read More
Arrow Right

Director, Technical Program Management — Global Cluster Engineering

AMD’s Global Cluster Engineering (GCE) team designs, validates, and deploys larg...
Location
Location
United States , Seattle, Washington or Austin, Texas
Salary
Salary:
224640.00 - 336960.00 USD / Year
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 12+ years of experience in technical program management, engineering program leadership, infrastructure delivery, or adjacent roles
  • Proven track record delivering large-scale infrastructure programs (datacenter, cloud, AI/HPC clusters, platforms, or complex hardware/software systems)
  • Demonstrated experience partnering with supply chain organizations (procurement, sourcing, planning, manufacturing, logistics) and managing long-lead constraints and supplier dependencies
  • Strong program fundamentals: scope definition, critical path, integrated schedules, RAID management, executive communications, and stakeholder alignment in a matrix environment
  • Comfort with technical depth across compute platforms, networking/storage concepts, and operational tooling—enough to drive decisions and resolve ambiguity
  • Undergraduate degree is preferred
  • Applied Science Degree, PMP, and/or MBA are desired
Job Responsibility
Job Responsibility
  • Own a multi-year program portfolio for global cluster initiatives (new cluster builds, cluster validation and operational excellence), including critical milestones, dependencies, risk management, and executive reporting
  • Establish program governance (operating rhythms, QBRs, escalation paths, decision logs) across engineering, operations, finance, procurement, and suppliers
  • Lead end-to-end supply chain planning and execution for cluster infrastructure: server/GPU platforms, networking, storage, racks, power/cooling, spares, and long-lead components
  • Drive build readiness and NPI-style execution: BOM maturity, lead-time management, contract manufacturer alignment, and deployment sequencing
  • Partner with sourcing/procurement to optimize cost, availability, and resiliency across suppliers, balancing time-to-deploy with design and qualification constraints
  • Build and scale supply chain product automation for cluster delivery: forecasting, allocation, inventory visibility, exception management, and ETA/lead-time prediction
  • Own “product-like” delivery of internal platforms and tools (dashboards, APIs, workflow automation, digital-twin planning models) that improve supply chain decisions and reduce manual overhead
  • Define KPIs and data products for planning accuracy, schedule predictability, cost-to-serve, inventory health, and deployment velocity
  • Translate business and engineering objectives into executable program plans, including infrastructure requirements, capacity models, and deployment playbooks
  • Drive technical and operational trade-offs across performance, reliability, cost, availability, and schedule
Read More
Arrow Right

Senior Engineering Manager DevSecOps

AMD India (SPSE) is looking for a strong Manager to lead an Embedded Software De...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years (or more) of overall relevant industry experience and a track record of shipping server/storage/networking products for the enterprise, cloud data center and service provider markets
  • Prior experience in customer-facing / applications engineering role will be a big plus
  • At least 5 years of experience as a first/second-line manager leading the development of embedded software
  • Deep understanding of full product life cycle, software development methods (both Agile and Waterfall), and development and build environments
  • Ability to undertake loosely defined goals or complex problems to create order, and drive closure
  • Ability to organize, delegate, and effectively deliver to large and complex programs
  • Ability to drive multi-geo projects by working effectively with remote teams
  • Ability to thrive in fast-paced, highly dynamic environment, with a bias towards action and results
  • Manage major software release deliveries as a release manager
  • Conflict resolution skills including ability to bridge style difference
Job Responsibility
Job Responsibility
  • Lead an Embedded Software DevSecOps engineering team to lead and deliver modular, quality oriented, and extensible FW infrastructure
  • Managing resources effectively to deliver commitment on schedule
  • Cultivate a high performing team and constantly raise the bar
  • Closely collaborate with peer development teams, architecture, customer support and product line management
  • Contribute to the vision and strategy of continuous integration, improved development processes, quality and productivity improvements
  • Lead end-to-end DevOps programs from planning through execution, ensuring alignment with business goals
  • Design and implement secure CI/CD pipelines across cloud, hybrid, and on-prem environments
  • Review technical designs and pipeline code for security, scalability, and reliability
  • Establish monitoring and observability frameworks to track performance, adoption, and security posture
  • Identify and mitigate technical risks and process inefficiencies across teams
Read More
Arrow Right

Staff Thermal Attainment Engineer

Technical, hands-on engineer responsible for post-silicon thermal activities rel...
Location
Location
Malaysia , Penang
Salary
Salary:
Not provided
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree or higher in Electrical/Computer Engineering or Electronics / Mechanical Engineering related with 2-5 years of experience in SoC thermal validation and debug
  • Strong background in thermodynamics and heat transfer
  • Solid understanding of thermal management methodologies in datacenter products
  • Experience with power and thermal controllers and management
  • Experience developing validation methodologies and infrastructure
  • Test plan and test development experience
  • Participated in silicon bring up and debug, support to internal engineering teams
  • Debug skills at both GPU and system level
  • Familiarity with programming / scripting language (C/C++, Python, Perl, ...)
  • Working knowledge of Server OSes (Linux, Windows Server)
Job Responsibility
Job Responsibility
  • Learn and execute thermal attainment test plans in post-silicon time periods in support of Data Center GPU product roadmap
  • Investigating thermal management techniques through both hardware and firmware-based solutions
  • Actively participate in analysis of post silicon thermal and power data, ensure integrity of results and provide summary and conclusions of results
  • Hands-on experience to work locally or remotely with computers, systems or data center hardware for practical knowledge with hardware applicable to servers, data centers or thermal equipment as a means to accomplish thermal attainment work
  • Calibration of thermal sensors and working with other groups to correlate sensor accuracy across platforms
  • Support prototyping experiments for new GPU features that impact thermal and power characteristics
  • Work with cross-functional teams internally and externally to improve post-silicon validation test strategy, methodology, and process
  • Leading collaborative technical discussions to drive resolution on technical issues and roll out technical initiatives
  • Be able to work in a high demand, fast paced environment with lots of real-time problem solving and critical thinking
  • Fulltime
Read More
Arrow Right

Systems Software Engineer

The Crusoe Cloud Software Development team is seeking a passionate and experienc...
Location
Location
United States , San Francisco
Salary
Salary:
137000.00 - 161000.00 USD / Year
crusoe.ai Logo
Crusoe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Linux Systems Familiarity: Experience building applications on Linux kernels, specifically pertaining to virtualization, device drivers, memory management, and process scheduling
  • Hardware Integration: Solid understanding of hardware devices such as GPUs, CPUs, Infiniband and Ethernet NICs, Ephemeral Disks, and PCI Express
  • Systems Design: Strong grasp of distributed applications and highly-scalable systems design. Specific focus around communications protocols (GRPC, REST, TCP/IP, etc.), databases (Postgres, Redis), and systems design applications (Pub/Sub, Kafka)
  • Software Architecture: Strong experience building software applications, both at the higher (Golang, Java, Python) and lower (C, C++, Rust) levels. Keen eye for clean, maintainable code, and a unit-test driven mindset
  • Excellent Communication Skills: Ability to collaborate with teams across an organization, blocking out noise, and focusing on what needs to get done to get a project across the line
  • Rapid and Agile Learner: Capable of adapting quickly, eager to research new technology and not get overwhelmed by unfamiliar tech stacks
  • Virtualization Concepts: General knowledge of hypervisors, virtual machine lifecycles, and Linux KVM tooling
  • CI/CD and Validation: Understanding of how to build Gitlab or Github CI/CD pipelines that deliver bug-free code across a multitude of compute platforms
Job Responsibility
Job Responsibility
  • Compute Application Development & Scaleout: Design highly reliable and performant Linux applications used to manage our virtualization stack across thousands of AI compute servers in multiple global datacenters
  • AI Hardware Platform Integration: Integrate Crusoe applications with a wide variety of hardware and software AI chip-vendor stacks. Build solutions to optimize and monitor virtualized hardware (GPUs, Infiniband/ROCe NICs, Ephemeral Storage, etc.) in cutting-edge AI/HPC environments
  • Kernel & Hypervisor Integration - Work side by side with our Linux Kernel and Hypervisor teams to ensure our Crusoe applications are seamlessly integrated with a variety of kernels and hypervisors
  • Performance Analysis & Tuning: Analyze and enhance the performance of the entire virtualization stack, from the hypervisor to the virtualized guest OS, with a specific focus on optimizing AI/ML workloads. This includes profiling, bottleneck identification, and implementing low-level optimizations
  • System-Level Troubleshooting: Diagnose and resolve complex system issues across our virtualization stack (drivers, kernel, hypervisor, guest OS, and crusoe applications). Work closely with kernel and hypervisor teams to debug and resolve integration challenges
  • Code Review and Quality Assurance: Conduct thorough code reviews to ensure the highest level of software quality, reliability, and security within compute applications and virtualization stack
  • Cross-Functional Collaboration: Collaborate with other engineering teams, including hardware design, OS development, and AI/ML application teams, to ensure cohesive and integrated product development
  • Technical Leadership: Provide technical guidance and mentorship to junior engineers, fostering a culture of technical excellence and collaborative problem-solving within the compute applications team
What we offer
What we offer
  • Restricted Stock Units in a fast growing, well-funded technology company
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Tuition reimbursement
  • Fulltime
Read More
Arrow Right

Senior Systems Software Engineer

The Crusoe Cloud Software Development team is seeking a passionate and experienc...
Location
Location
United States , San Francisco
Salary
Salary:
172000.00 - 209000.00 USD / Year
crusoe.ai Logo
Crusoe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience building applications on Linux kernels, specifically pertaining to virtualization, device drivers, memory management, and process scheduling
  • Solid understanding of hardware devices such as GPUs, CPUs, Infiniband and Ethernet NICs, Ephemeral Disks, and PCI Express
  • Strong grasp of distributed applications and highly-scalable systems design. Specific focus around communications protocols (GRPC, REST, TCP/IP, etc.), databases (Postgres, Redis), and systems design applications (Pub/Sub, Kafka)
  • Strong experience building software applications, both at the higher (Golang, Java, Python) and lower (C, C++, Rust) levels. Keen eye for clean, maintainable code, and a unit-test driven mindset
  • Ability to collaborate with teams across an organization, blocking out noise, and focusing on what needs to get done to get a project across the line
  • Capable of adapting quickly, eager to research new technology and not get overwhelmed by unfamiliar tech stacks
  • General knowledge of hypervisors, virtual machine lifecycles, and Linux KVM tooling
  • Understanding of how to build Gitlab or Github CI/CD pipelines that deliver bug-free code across a multitude of compute platforms
Job Responsibility
Job Responsibility
  • Design highly reliable and performant Linux applications used to manage our virtualization stack across thousands of AI compute servers in multiple global datacenters
  • Integrate Crusoe applications with a wide variety of hardware and software AI chip-vendor stacks. Build solutions to optimize and monitor virtualized hardware (GPUs, Infiniband/ROCe NICs, Ephemeral Storage, etc.) in cutting-edge AI/HPC environments
  • Work side by side with our Linux Kernel and Hypervisor teams to ensure our Crusoe applications are seamlessly integrated with a variety of kernels and hypervisors
  • Analyze and enhance the performance of the entire virtualization stack, from the hypervisor to the virtualized guest OS, with a specific focus on optimizing AI/ML workloads. This includes profiling, bottleneck identification, and implementing low-level optimizations
  • Diagnose and resolve complex system issues across our virtualization stack (drivers, kernel, hypervisor, guest OS, and crusoe applications). Work closely with kernel and hypervisor teams to debug and resolve integration challenges
  • Conduct thorough code reviews to ensure the highest level of software quality, reliability, and security within compute applications and virtualization stack
  • Collaborate with other engineering teams, including hardware design, OS development, and AI/ML application teams, to ensure cohesive and integrated product development
  • Provide technical guidance and mentorship to junior engineers, fostering a culture of technical excellence and collaborative problem-solving within the compute applications team
What we offer
What we offer
  • Restricted Stock Units in a fast growing, well-funded technology company
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Tuition reimbursement
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Network Automation

Crusoe's mission is to accelerate the abundance of energy and intelligence. We’r...
Location
Location
United States , San Francisco
Salary
Salary:
209000.00 - 253000.00 USD / Year
crusoe.ai Logo
Crusoe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of network automation experience within a large-scale environment, building tools and software for production Networks
  • Solid experience in developing and understanding network device configurations for at least one network vendor (e.g. Arista, Juniper, Cisco, Brocade, Ciena, Infinera, Nokia, etc.)
  • Experience in at least one programming language (e.g. Python, Go, C++, or Java), and rapidly learning new development languages
  • Demonstrated knowledge of TCP, IPv4/6, Routing Protocols (one or more of BGP, MPLS, ISIS, or similar), and related network services (e.g. DHCP and DNS)
  • Experience with software and network debugging, profiling, and instrumentation techniques
  • Additional experience in developing automation tools for network operations such as provisioning (e.g. ZTP), deployments, monitoring, remediation, and software push systems in a DevOps environment
  • Experience with developing distributed systems and operating them at scale
  • Experience designing and maintaining automated testing infrastructure
  • In-depth knowledge of network protocols including TCP/IP, QoS, BGP, OSPF/IS-IS, EVPN, VXLAN, QoSand MPLS-related technologies like RSVP-TE, LDP, etc
  • Bachelor's in Computer Science, Information Science, Engineering, Mathematics, or a related field, or experience equivalent to a Bachelor's degree based on three or more years of work experience
Job Responsibility
Job Responsibility
  • Conceptualize, build, and maintain automation and tools to support New Product Introductions, network deployment, release engineering, and operations
  • Develop and implement operational process improvements in scalable, automated workflows to enhance operational efficiency
  • Lead enhancements of automation for continuous integration, validations, testing infrastructure, release, and configuration management across our global backbone, data center, and edge networks
  • Perform deep dives on complex technical issues across networks, ranging from automated tooling to hardware and network failures
  • Help increase operational efficiency between peers and cross-functional teams by identifying roadblocks, designing and delivering automation solutions, and driving change
  • Proactively improve our network infrastructure by designing, developing, and implementing automation solutions/tools
  • Manage current tooling for network provisioning, configuration, monitoring, and troubleshooting
  • Improve access to network telemetry data across the Crusoe Datacenters, Backbone, and Edge Networks
  • Collaborate with various Network Engineering teams to ensure implementation consistency across the entire network
  • Maintain comprehensive documentation for automation processes, tools, and procedures
What we offer
What we offer
  • Restricted Stock Units in a fast growing, well-funded technology company
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Tuition reimbursement
  • Fulltime
Read More
Arrow Right

Sr. Infrastructure Engineer - Distributed Hosting

Owens Corning’s Global Information Services (GIS) provides a technology platform...
Location
Location
United States , Toledo; Tampa
Salary
Salary:
Not provided
owenscorning.com Logo
Owens Corning
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related field
  • 10+ years of experience in hosting architecture, with a focus on enterprise-scale environments
  • Proven track record of providing technical consultation and collaboration
  • Understanding of Windows Server platforms, enterprise OS platforms, VMware vSphere/ESXi, and enterprise compute and storage systems
  • Familiarity with engineering and deploying backup technologies along with implementing robust disaster recovery (DR) and business continuity (BC) processes and procedures
  • Experience with infrastructure monitoring, capacity planning, and performance tuning
  • Knowledge of datacenter environment solutions required for the design, deployment, health and support of the infrastructure and application ecosystem (power, cooling, fire, connectivity, security, etc.)
  • Familiarity with automation tools and scripting for infrastructure management (e.g., PowerShell, Ansible)
  • Understanding of Artificial Intelligence and its efficient usage in infrastructure environments
  • Strong analytical and problem-solving skills with a proactive mindset
Job Responsibility
Job Responsibility
  • Develop and maintain a strong understanding of Owens Corning specific business processes and operations locally and globally
  • Build relationships within the organization, cross-functionally, and with key business stakeholders
  • Understand how IT infrastructure and services are directly aligned with the company's strategic objectives
  • Monitor and assess the health, capacity, and performance of hosting platforms
  • Provide advanced technical expertise throughout the lifecycle of Wintel servers, enterprise OS servers (e.g. Linux), VMware environments, and storage infrastructure
  • Collaborate with architecture and operations teams to design and implement enhancements
  • Provide technical experience and perspective to the lifecycle management of hosting environments
  • Help build and communicate advanced platform standards
  • Participate in the design and implementation of backup solutions along with disaster recovery and business continuity strategies
  • Establish and maintain engineering standards, templates, and automation for consistent platform deployment and configuration
Read More
Arrow Right