This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Meta is seeking a Data Center Capacity Engineer to help ensure our global data center infrastructure can meet the demands of billions of users across Meta's family of apps and services. In this role, you will analyze server and infrastructure capacity trends, model future demand, and partner with hardware, network, and site operations teams to plan and deliver capacity at scale. Your work will directly influence how Meta provisions, allocates, and optimizes compute, storage, and network resources across its data center fleet, enabling reliable and efficient operation of the platforms that connect people worldwide.
Job Responsibility
Own accountability for the three capacity workflows (receiving, moves, decommissioning), facilitating collaboration among various cross-functional partners to meet capacity demands
Collaborate with key stakeholders and partners to develop a strategy and drive initiatives that lead to meaningful improvements in support of data center operations. Maintain consistent touchpoints with key XFN partners across data centers
Analyze business capacity demands and translate that data into local plans to enable rapid delivery of capacity to the data center
Plan, lead and collaborate with cross-functional data center teams to deliver complex data center infrastructure capacity projects in support of Meta’s growth, considering the interdependencies of production resiliency, power, cooling, network, server and application layers
Build cross-functional relationships and have the ability to influence policies and procedures to improve regional/global data center operations
Develop and share best practices across all global data centers for all elements of capacity while creating a culture of innovation, collaboration, accountability, continuous improvement, and safety
Drive alignment and execution of key capacity strategic, engineering, and operational initiatives across functional partners at the data center. Ensure operational consistency, to scale operations efficiently and effectively
Lead data analytics, metrics, and the interpretation of a complex environment to identify inefficiencies, opportunities, exceptions, and correlations, and proactively respond before they impact data center uptime and utilization. Perform root cause analysis of complex technical and engineering issues and drive resolution
Create/improve global standards for processes, workflows, and automation roadmaps for software automation that facilitate deployment, maintenance, and decommissioning of server hardware at scale
Work with Meta hardware and software engineering teams to help resolve complex technical issues that affect Meta's computing infrastructure
Mentor capacity team members both locally and globally. Seek out and provide guidance on challenges others are having and actively fix them in a scalable way
Apply deep knowledge of infrastructure (including, but not limited to cooling, power, networking, and automation) as it relates to the capacity role. Be a Subject Matter Expert (SME) in one or more of these areas
Requirements
7+ years of experience in a combination of capacity planning, demand and supply management, production planning, operations planning, or infrastructure management
B.S., B.A., or B.Eng. in a relevant field, or equivalent degree or certification
Experience with process ownership and development, and systems development
Knowledge of enterprise-level networking, servers, and storage installations
Demonstrated understanding of data center infrastructure systems and applications
Familiarity with data center power and cooling constraints, and their impact on server density and capacity planning decisions
Ability to communicate effectively, in a clear and concise manner, appropriately tailoring messages to multiple audiences
Demonstrated ability to solve complex problems and to deliver at scale
Nice to have
Master's degree in an engineering discipline
Proficiency in programming and scripting languages such as SQL, R, Python, Bash, or other programming languages
Background in developing scenario-based capacity models that account for hardware supply chain variability and workload elasticity
Experience with server hardware lifecycle management, including procurement planning, rack integration, and hardware refresh programs in a large-scale data center environment
Experience in driving results through AI in a hyperscale environment
Experience building automated capacity tracking systems or integrating capacity data across multiple infrastructure domains
Experience in the application of data-driven continuous improvement through Lean Six Sigma or other process analysis methodologies, visualization, and modeling
Project management and delivery experience through Agile methodology or PMP certification