CrawlJobs Logo

Manager, Lab Data Center Operations

https://www.t-mobile.com Logo

T-Mobile

Location Icon

Location:
United States , Bellevue

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

101900.00 - 183800.00 USD / Year

Job Description:

Manages highly complex, data center operations across multiple geographically distributed locations. Accountable for executing operational strategy, driving consistency, and improving efficiency in environments supporting critical testing, integration, and pre-production activities. This role leads a high-performing operations team, ensures standardized processes across sites, and delivers reliable execution in a fast-paced, evolving environment. Requires strong technical fluency in data center infrastructure—including power, electrical systems, cooling, and network hardware—to effectively oversee operations and partner with multiple engineering teams. Success is measured through operational performance, system readiness, efficiency improvements, and the team’s ability to consistently deliver results. The role also supports budgeting, resource planning, and cost management.

Job Responsibility:

  • Lead and Develop Talent: Manage, coach, and develop a geographically distributed team
  • set clear expectations, drive accountability, and support a high-performance culture
  • Execute Operational Excellence: Implement and enforce standardized processes, procedures, and performance metrics to ensure consistency and efficiency across all locations
  • Deliver Results: Monitor and manage key performance indicators (KPIs), service levels, and operational goals
  • take ownership for team performance and outcomes
  • Manage Multi-Site Operations: Oversee day-to-day operations across multiple data center locations, ensuring alignment to standards and consistent execution
  • Infrastructure Oversight: Provide operational oversight of data center infrastructure including power, electrical systems, cooling, and network hardware to ensure reliability and performance
  • Incident and Problem Management: Manage escalations, support root cause analysis (RCA), and drive resolution of operational issues
  • Continuous Improvement: Identify and implement process and operational improvements to increase efficiency and effectiveness
  • Cross-Functional Collaboration: Partner with engineering, network, and infrastructure teams to support system readiness and operational needs
  • Customer Focus: Deliver responsive, high-quality support to internal stakeholders with a strong sense of urgency and accountability
  • Workforce & Resource Planning: Manage staffing plans, shift coverage, and resource allocation to support 24x7 operations
  • Budget & Cost Awareness: Support budget management, track expenses, and identify and drive opportunities for cost efficiency
  • Ensure Compliance & Standards: Maintain adherence to operational, safety, and organizational standards across all locations

Requirements:

  • Bachelor's Degree plus 3 years of related work experience OR advanced degree with 1 year of related work experience OR combination of education and experience deemed equivalent
  • 2–4+ years of experience in data center operations or critical infrastructure environments
  • 2–4+ years of leadership experience managing teams in multi-site, 24x7 environments
  • Experience driving operational improvements, process standardization, and efficiency initiatives
  • Experience leading distributed teams and managing remote operations
  • At least 18 years of age
  • Legally authorized to work in the United States

Nice to have:

Experience leading distributed teams and managing remote operations

What we offer:
  • medical insurance
  • dental insurance
  • vision insurance
  • flexible spending account
  • 401(k)
  • employee stock grants
  • employee stock purchase plan
  • paid time off
  • paid holidays
  • paid parental and family leave
  • family building benefits
  • back-up care
  • enhanced family support
  • childcare subsidy
  • tuition assistance
  • college coaching
  • short- and long-term disability
  • voluntary AD&D coverage
  • voluntary accident coverage
  • voluntary life insurance
  • voluntary disability insurance
  • voluntary long-term care insurance
  • annual bonus
  • mobile service & home internet discounts
  • pet insurance
  • commuter and transit programs

Additional Information:

Job Posted:
May 04, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Manager, Lab Data Center Operations

Production Systems Engineer

Meta is seeking a forward thinking, experienced candidate to join the Hardware D...
Location
Location
United States , Menlo Park
Salary
Salary:
118000.00 - 170000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Experience communicating across multiple groups and types of work
  • Experience influencing cross functional teams
  • Experience managing technical projects with a high degree of ambiguity
  • 2+ years experience in one or more of the following core areas: Capacity Planning, Networking, Project Management, Tooling and Automation, Hardware Design, Systems Administration, Hardware Validation (NPI), or Data Center Operations
  • 2+ years of experience managing servers in a large-scale distributed environment
Job Responsibility
Job Responsibility
  • Drive labs participation in program design, test, phase exit, and retrospective efforts
  • Complex, open-ended troubleshooting and diagnostics for new hardware platforms
  • Troubleshoot, repair, document, and provide feedback for Linux-based data center hardware platforms
  • Work closely with remote hardware design and validation teams, and vendors to deploy and manage new server, storage, and networking products in the data center infrastructure
  • Test and troubleshoot new hardware products and components with minimal documentation and direction
  • Manage full lifecycle for lab hardware assets from provisioning through decommissioning
  • Identify, characterize, and root cause hardware failures and error conditions
  • Collaborate with hardware teams by running small scale experiments, collecting data, and providing feedback on failure symptoms for lab and production servers
  • Drive cross-functional coordination & communication with other data center operations teams
  • Lead efforts to deliver operational and serviceability feedback on new hardware platforms
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Network Operations Engineering Manager

Meta is seeking a Network Operations Engineering Manager to lead our Edge and Ne...
Location
Location
United States , Fort Worth
Salary
Salary:
162000.00 - 227000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience
  • 8+ years of experience in network engineering, data center infrastructure, or large-scale network deployment, including hands-on responsibility for planning, building, or operating production or lab networks
  • 3+ years of people management experience leading engineering or technical operations teams responsible for network deployment, reliability, or site infrastructure
  • Experience managing external vendors or MSPs, including establishing expectations, monitoring SLAs (service level agreements), and holding vendors accountable for quality and timelines
  • Practical experience and knowledge of network, optical, and physical layer infrastructure in hyperscale environments
  • Demonstrated experience in communication and stakeholder management skills, with a track record of influencing cross-functional teams and external partners
  • Demonstrated experience to drive operational programs from inception to delivery, using metrics to convey impact
Job Responsibility
Job Responsibility
  • Lead, mentor, and grow a team of network operations engineers, fostering an environment of technical expertise, collaboration, and continuous learning
  • Set clear goals, provide regular feedback, and support career development for direct reports
  • Manage the team’s response to major incidents and site events (SEVs), ensuring rapid resolution and root cause analysis for edge, caching, and network infrastructure
  • Develop and implement strategies to improve incident response processes, reduce operational risk, and enhance network reliability
  • Own and evolve change management processes, ensuring security, business continuity, and compliance across Meta’s network infrastructure
  • Proactively identify and mitigate operational risks, collaborating with partner teams to design robust processes for data and asset protection
  • Represent ENS in cross-functional forums, influencing the design and integration of new technologies and infrastructure
  • Build and maintain key partnerships with internal business teams, external vendors, and engineering leaders to align on priorities and deliver critical projects
  • Drive continuous improvement in operational policies, processes, and procedures to optimize efficiency, quality, and scalability
  • Champion automation initiatives, working with engineering teams to ensure Tier-1 network faults are remediated by software and operational toil is minimized
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Senior Director, Critical Environments (Lab Operations)

We are seeking an industry veteran to serve as the Senior Director, Critical Env...
Location
Location
Taiwan , New Taipei City
Salary
Salary:
Not provided
jll.com Logo
JLL
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 20+ years of progressive experience in Critical Environments (Data Centers, Semiconductor, Pharma, or R&D Labs), covering operations, engineering, planning, and innovation
  • 15+ years of direct people management experience, specifically leading large technical teams (50-100+ staff) and 'managing managers' in a multi-site, matrixed environment
  • Bachelor’s degree in Engineering (Mechanical/Electrical), Facilities Management, or a related technical field is required
  • A Master’s degree or MBA is highly preferred
  • Professional Engineer (PE), Certified Facility Manager (CFM), or PMP is preferred
Job Responsibility
Job Responsibility
  • Executive Leadership & Organizational Strategy: Manage and mentor a high-performing organization of 100+ staff members through direct supervision of five specialized Directors
  • Foster a 'No Ego' culture of accountability and collaboration across diverse teams
  • Serve as the primary strategic partner to senior client stakeholders
  • Present complex technical and data concepts as clear business strategies to the C-Suite
  • Define the competency requirements and training standards for the entire critical environments organization
  • Operational Resilience & 24/7 Command: Oversee the Director of Critical Operations and Senior Director of Engineering & Ops Center to ensure 100% uptime in critical operations
  • Serve as the ultimate escalation point for major incidents
  • Lead executive communication, mitigation strategy, and systemic Root Cause Analysis (RCA)
  • Direct the strategy of the 24/7 Operations Center
  • Technical Governance & Engineering Excellence: Oversee comprehensive design reviews for MEP (Mechanical, Electrical, Plumbing) topology
  • Fulltime
Read More
Arrow Right

Product Manager - AI Data Center Infrastructure

Product Manager - AI Data Center Infrastructure. We are seeking a Product Line M...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5–10+ years of experience in data center networking, AI infrastructure, or HPC environments
  • Strong hands-on experience with Juniper QFX platforms and JunOS
  • Deep understanding of GPU architectures: NVIDIA: H100/H200, GB200/GB300, NVLink/NVSwitch AMD: MI300/MI400, Pollara NICs, Infinity Fabric
  • Proven expertise in scale-up GPU interconnects and scale-out Ethernet fabrics
  • Strong knowledge of RDMA/ROCEv2, ECN, PFC, and buffer management
  • Familiarity with distributed AI workloads, collective operations (NCCL, RCCL)
  • Hands-on troubleshooting experience with high-speed optics, AEC cables, link training, and NIC firmware
  • Proficiency in automation and scripting (Python, Ansible, Bash, Terraform)
Job Responsibility
Job Responsibility
  • AI Data Center & Fabric Architecture: Define product requirements for AI data center network architectures supporting thousands of GPUs
  • Develop requirements for low-latency Ethernet fabrics using Juniper QFX platforms and Apstra-based automation
  • Enable high-bandwidth GPU and NIC interconnects optimized for large-scale distributed training and inference workloads
  • GPU, NIC & Interconnect Strategy: Lead requirements definition for next-generation GPUs, NICs, and interconnect technologies, staying ahead of industry roadmaps
  • Drive alignment with NVIDIA and AMD ecosystems
  • Ensure interoperability across DAC, AEC, ACC, and optical transceivers between switches and NIC endpoints
  • Define scale-up paths using PCIe, NVLink, NVSwitch, ensuring GPU-to-GPU symmetry, consistency, and bandwidth determinism
  • Switching, Routing & Telemetry: Specify and optimize L2/L3 architectures, including EVPN-VXLAN, Class-E IPv4, and AI-optimized buffer tuning
  • Leverage hardware telemetry, streaming sensors, and analytics for proactive performance assurance
  • Drive automation using Python, Ansible, Apstra, Terraform, and related tools to enforce configuration consistency and compliance
What we offer
What we offer
  • Health & Wellbeing: comprehensive suite of benefits that supports physical, financial and emotional wellbeing
  • Personal & Professional Development: specific programs catered to helping you reach any career goals
  • Unconditional Inclusion: unconditionally inclusive in the way we work and celebrate individual uniqueness
Read More
Arrow Right

Manager, Network Operations and Support (Labs)

Own and shape Meta’s next-generation Lab infrastructure. This role has direct in...
Location
Location
United States , Menlo Park
Salary
Salary:
162000.00 - 227000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience in network engineering, data center infrastructure, or large-scale network deployment, including hands-on responsibility for planning, building, or operating production or lab networks
  • 3+ years of people management experience leading engineering or technical operations teams responsible for network deployment, reliability, or site infrastructure
  • Experience delivering end-to-end network deployments, including reading/interpreting engineering design packages (EDPs), managing BOMs, overseeing rack/stack/cabling, and validating fiber and structured cabling work
  • Demonstrated experience with network troubleshooting and operations, including Layer 1–3 fundamentals (fiber troubleshooting, optics/transceivers, routing/switching, and common protocol stacks)
  • Experience managing external vendors or MSPs, including establishing expectations, monitoring SLAs/SLIs, and holding vendors accountable for quality and timelines
  • Experience leading cross-functional programs, such as migrations, network upgrades, capacity augments, deployments, or infrastructure redesign projects
  • Bachelor’s degree in a technical field or equivalent practical experience
  • Relevant industry certification, such as CCNP or JNCIP, or equivalent demonstrated proficiency through hands-on experience
Job Responsibility
Job Responsibility
  • Lead Lab Network Deployment Lifecycle (Expert-Level Proficiency) is required for this role
  • Oversee planning, design, and execution of Lab network deployments—including EDP creation, topology validation, BOM oversight, scheduling, and cross-functional coordination. Drive consistency and quality across MDF/IDF builds, patching, cross-connects, and tie-cable augments
  • Own Operational Reliability for Lab Infrastructure (Advanced Proficiency)
  • Ensure stable operations of ProdLabs and EngLabs, including monitoring KPIs/SLIs (MTTR, availability), triaging incidents, and driving root-cause analysis. Partner with NIS, SiteOps, and hardware validation teams to proactively mitigate risks and harden the Lab environment
  • Drive Automation, Tooling, and Process Improvements (Advanced Proficiency)
  • Identify operational friction, manual workflows, and scaling bottlenecks
  • architect and implement automation opportunities across deployments, migrations, orchestration, inventory, patching, and ticketing flows. Work closely with XFN's to influence roadmap and adoption
  • Manage Vendor and Managed Service Provider (MSP) execution (Advanced Proficiency) Lead a healthy, scalable ecosystem of vendors and MSPs focused on network availability, performance, and OE. Set clear expectations, oversee day-to-day execution, and ensure quality delivery across rack/stack, cabling, fiber work, and operational support. Establish or measure SLAs/SLIs, drive continuous improvement, and hold vendors accountable for reliability, responsiveness, and adherence to Meta’s standards
  • Oversee vendor performance across deployments, fiber work, rack-and-stack, and structured cabling. Set clear expectations, measure SLAs/SLIs, enforce accountability, and guide vendors through complex, multi-phase migration windows
  • Act as the operational leader for initiatives such as topology standardization, and infrastructure consolidations. Align on scope, dependencies, and timelines
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Sr. Engineer – Critical Facilities

This role supports a highly dynamic lab environment responsible for the assuranc...
Location
Location
United States , Bellevue
Salary
Salary:
90300.00 - 162800.00 USD / Year
https://www.t-mobile.com Logo
T-Mobile
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree plus 3 years of related work experience OR advanced degree with 1 year of related work experience OR combination of education and experience deemed equivalent (Required)
  • Acceptable areas of study include Engineering, facilities, or related field (Preferred)
  • 4-7 years' Experience in mechanical/HVAC, electrical, or a combination of the two (Preferred)
  • 4-7 years' Experience in data center facilities operations, working with equipment mentioned above, preferably in the telecommunications industry (Preferred)
  • 4-7 year's Experience managing vendor work and quality control (Preferred)
  • 4–7 years of direct experience operating, designing, constructing, maintaining, and managing large‑scale production data centers or critical facilities at the enterprise level
  • Demonstrated engineering leadership across disciplines required to sustain continuous availability (power, cooling, controls, fire/life safety, etc.)
  • Proven experience in critical facility operations and construction support, including vendor management, maintenance programs, fault analysis, and emergency response
  • Experience in lab, R&D, or development environments supporting evolving technology stacks is a strong plus
  • At least 18 years of age
Job Responsibility
Job Responsibility
  • Conduct daily site walks to review and QA ongoing tickets (maintenance, break/fix, customer projects), confirming vendor and customer needs are fully met
  • Direct and manage vendors for upgrades and Operations‑initiated projects, ensuring quality, safety, and adherence to scope, schedule, and budget
  • Maintain vigilant safety awareness in a complex, high‑risk environment with critical infrastructure and diverse occupants
  • Identify, troubleshoot, and drive resolution for tactical issues discovered during walks or reported by customers, seeing them through to final solution
  • Participate in construction and project meetings, collaborating as both service provider and customer to ensure Operational and critical facility requirements are represented
  • Share in an on‑call rotation with a teammate to provide 24x7 emergency response coverage for facility‑impacting events
  • Oversee day‑to‑day operation of facility infrastructure in close collaboration with vendors, maintaining resilience and performance targets
  • Produce routine, periodic, and ad hoc reports as requested (operations, incidents, capacity, risk, etc.)
  • Apply strong budget awareness to maintenance, projects, and recommendations, dynamically balancing cost, risk, and customer need
  • Dedicate at least 20% of your time to “moving the bar forward” through continuous improvement, innovation, and initiatives that increase resiliency, efficiency, and customer experience
What we offer
What we offer
  • competitive base salary and compensation package
  • annual stock grant
  • employee stock purchase plan
  • 401(k)
  • free, year-round money coaches
  • annual bonus or periodic sales incentive or bonus
  • medical, dental and vision insurance
  • flexible spending account
  • paid time off
  • up to 12 paid holidays
  • Fulltime
Read More
Arrow Right

Senior Facilities Manager

The Senior Facilities Manager is a key leader on the account team who is respons...
Location
Location
United States , Milpitas
Salary
Salary:
155000.00 - 185000.00 USD / Year
jll.com Logo
JLL
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years Facilities Management experience, with cleanroom manufacturing and lab space experience preferred
  • Experience working in union environments and with union work rules
  • Strong executive presence, effective communication and leadership skills, and ability to build lasting client relationships among a wide variety of stakeholders
  • Strong organizational, management, interpersonal, and supervisory skills with ability to manage ambiguity and change
  • effectively delegate
  • Excellent communications skills, both written and verbal, and an ability to effectively present to large groups
  • Technology proficiency (MS Office, a wide variety of web-based applications including mobile technologies, and CMMS (Corrigo) supervisory responsibilities)
  • Strong financial acumen with background in successfully managing P&L outcomes
  • Working knowledge of skilled trades and building-related systems with demonstrated safety leadership
  • Familiar with project delivery process and capital planning functions
Job Responsibility
Job Responsibility
  • Ensure quality delivery of Facility Management Operations while supporting the Engineering, Project Management, Relocation (MAC), and Occupancy Planning functions
  • Responsible for the Milpitas HQ campus and outlying facilities, encompassing approximately 1 million SF
  • Ensure all contract deliverables are met or exceeded, including key performance indicators, operational up time, cost savings initiatives, energy consumption reduction initiatives, service improvements using standardized operating procedures, and introduction of best practices and innovations
  • Develop operating and capital budgets, control costs, and coordinate service provider and staff activities
  • Resolve escalated issues, drive continuous process improvement and team development
  • Provide regular performance feedback, development and coaching to direct reports
  • Develop and maintain meaningful relationships with all key client stakeholders
  • Ensure high client satisfaction by continually monitoring performance and delivery
  • Ensure uptime and proper maintenance of electrical, mechanical, life safety, and critical back-up systems
  • Understand the engineering design and operational aspects of all critical systems and equipment with strong emphasis on UPS, SEP, MEP systems and cleanroom, lab, and data center environments
What we offer
What we offer
  • 401(k) plan with matching company contributions
  • Comprehensive Medical, Dental & Vision Care
  • Paid parental leave at 100% of salary
  • Paid Time Off and Company Holidays
  • Early access to earned wages through Daily Pay
  • Fulltime
Read More
Arrow Right

Reliability Engineer

JLL is seeking a Reliability Engineer to join our team! This position is with a ...
Location
Location
Belgium , Brussels
Salary
Salary:
Not provided
jll.com Logo
JLL
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • University Degree in an Engineering discipline, mechanical or electrical preferred
  • 5-10 years’ experience implementing RCM, CbM/PdM methods, operating building automation and energy management systems, and thorough understanding of asset management, data analytics, and capital planning approaches
  • Experience with building automation systems and automated fault detection & diagnostics with integration of connected systems
  • Experience in critical/ regulated environments preferred (data centers, laboratories, manufacturing environments, automotive, petro-chemicals, pharmaceuticals etc.)
  • Extensive knowledge of mechanical, electrical, plumbing and fire suppression systems
  • Extensive knowledge of commercial, critical, manufacturing, labs, or distribution facility types required
  • Ability to use a variety of Computerized Maintenance Management Systems and IT tools
  • Fluent in English and Flemish / Dutch /French
  • Proven ability to read, comprehend and apply information from technical manuals and other reference materials
  • Ability to make informed recommendations in situations where data sets may be incomplete
Job Responsibility
Job Responsibility
  • Implementing JLL’s enhanced Reliability & Asset Services program that delivers a whole life approach to asset management
  • The acquisition and analysis of data from connected systems to continually improve the maintenance program and meet outcomes-based performance measures
  • Facilitation of root cause failure analysis (RCFA) of equipment failures to determine the required corrective action for the situation
  • Analysis to determine the reliability of components, equipment, and processes
  • Determination of the cost advantages of alternative maintenance approaches and development of action plans that comply with internal/external customer demands for reliability processes/equipment to avoid failures
  • Completing ongoing maintenance maturity assessments to assure progress to plan
  • Works under general direction of Reliability & Asset Services platform for instruction in deploying client’s strategic asset management plan
  • Integrates data from BAS, automated fault detection diagnostic engines, maintenance programs, and capital planning processes to develop life-cycle analyses and recommendations for repair vs. replacement decisions
  • Support all efforts in the execution of site level Asset Management & Reliability programs and processes to effectively increase machine/facility system reliability and compliance
  • Conducts program and system/equipment audits on a periodic and as needed basis
  • Fulltime
Read More
Arrow Right