This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Lambda, The Superintelligence Cloud, is a leader in AI cloud infrastructure serving tens of thousands of customers. Our customers range from AI researchers to enterprises and hyperscalers. Lambda's mission is to make compute as ubiquitous as electricity and give everyone the power of superintelligence. One person, one GPU. If you'd like to build the world's best AI cloud, join us.
Job Responsibility:
Manage and lead a team of data center technicians
Maintain high availability, reliability, and security in the data center environment
Ensure new server, storage and network infrastructure is properly racked, labeled, cabled, and configured
Troubleshoot hardware and software issues
Document data center layout and network topology in DCIM software
Work with supply chain & manufacturing teams to ensure timely deployment of systems and project plans for large-scale deployments
Assess current and future state data center requirements based on growth plans and technology trends
Manage a parts depot inventory and track equipment through the delivery-store-stage-deploy-handoff process
Create installation standards and documentation for placement, labeling, and cabling
Oversee deployments and day-to-day operations of the data center
Maintain uptime for assets and infrastructure, and ensure customer SLAs are met
Participate in technical discussions and provide expertise on data center integration and deployment strategies
Understand power/cooling requirements as well as cabling needs
Work closely with cross-functional teams
Ensure the data center complies with Lambda’s standards and policies
Requirements:
5+ years experience with critical infrastructure systems supporting data centers, such as power distribution, air flow management, environmental monitoring, capacity planning, DCIM software, structured cabling, and cable management
Basic understanding of Linux administration
Experience in setting up networking appliances (Ethernet and InfiniBand) across multiple data center locations
Attention to detail and ability to follow instructions
Action-oriented and strong willingness to learn
Desire to mentor other team members
English fluency is required
Nice to have:
Experience with troubleshooting and theoretical knowledge of network layers, technologies, and system protocols: TCP/IP, OSPF, SNMP, SSL, HTTP, FTP, SSH, Syslog, DHCP, DNS, RDP, NETBIOS, IP routing, Ethernet, switched Ethernet, 802.11x, NFS, and VLANs
Experience with working in large-scale distributed data center environments
Experience working with auditors to meet all compliance requirements (ISO/SOC)
Experience Supermicro & Nvidia hardware
Previous data center team management experience
What we offer:
Generous cash & equity compensation
Health, dental, and vision coverage for you and your dependents