CrawlJobs Logo

Data Center Operations Engineer

lambda.ai Logo

Lambda

Location Icon

Location:
United States , Vernon

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

109000.00 - 145000.00 USD / Year

Job Description:

Lambda, The Superintelligence Cloud, is a leader in AI cloud infrastructure serving tens of thousands of customers. Our customers range from AI researchers to enterprises and hyperscalers. Lambda's mission is to make compute as ubiquitous as electricity and give everyone the power of superintelligence. One person, one GPU. If you'd like to build the world's best AI cloud, join us.

Job Responsibility:

  • Ensure new server, storage and network infrastructure is properly racked, labeled, cabled, and configured
  • Troubleshoot hardware and software issues in some of the world’s most advanced GPU and Networking systems
  • Document and update data center layout and network topology in DCIM software
  • Work with supply chain & manufacturing teams to ensure timely deployment of systems and project plans for large-scale deployments
  • Manage a parts depot inventory and track equipment through the delivery-store-stage-deploy-handoff process in each of our data centers
  • Partner with HW Support teams to ensure data center hardware incidents with higher level troubleshooting challenges are resolved, reported on and solutions are disseminated to the large operations organization
  • Work with the RMA team to ensure faulty parts are returned and replacements are ordered
  • Follow installation standards and documentation for placement, labeling, and cabling to drive consistency and discoverability across all data centers
  • Improve installation standards, MOPs, and runbooks
  • Act as a technical escalation point for DC infrastructure issues
  • Participate in an on-call rotation, serving as an escalation point for data center incidents

Requirements:

  • Strong experience with critical infrastructure systems supporting data centers (power distribution, air flow management, environmental monitoring, capacity planning, DCIM software, structured cabling, cable management)
  • Familiar with carrier DIA circuit test and turn ups, understanding LOA’s, and fiber testing and troubleshooting
  • Solid understanding of cable, fiber, and optics and their different use cases
  • Solid understanding of single and three phase power theories including PDU balancing
  • Base level network fundamentals (CCNA preferred but not required)
  • Knowledge of cold aisle and hot aisle containment
  • Solid understanding of server hardware and boot process (PXE, DHCP, & TFTP)
  • Work with product management, support, and other teams to align operational capabilities with company goals
  • Translating business priorities into technical and operational requirements
  • Supporting cross-functional projects where infrastructure plays a critical role
  • Action-oriented and willing to train junior staff on best practices
  • Willing to travel to bring up new data center locations as needed

Nice to have:

  • 3+ years experience with critical infrastructure systems supporting data centers
  • Experience with/or knowledge of network topology and configurations and 400gb Infiniband architectures
  • Experience with project management
  • 3+ years working with and reporting from a ticketing systems like Service Now, JIRA, and Zendesk
  • Experience with Linux administration
  • Experience with High Performance Compute GPU systems (air or water cooled) - especially Nvidia NVL72
  • Experience with troubleshooting the following network layers, technologies, and system protocols: TCP/IP, DP/IP, BGP, OSPF, SNMP, SSL, HTTP, FTP, SSH, Syslog, DHCP, DNS, RDP, NETBIOS, IP routing, Ethernet, switched Ethernet, 802.11x, NFS, and VLANs
What we offer:
  • Generous cash & equity compensation
  • Health, dental, and vision coverage for you and your dependents
  • Wellness and commuter stipends for select roles
  • 401k Plan with 2% company match (USA employees)
  • Flexible paid time off plan

Additional Information:

Job Posted:
February 18, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Data Center Operations Engineer

Data Center QA Engineer

Designs, develops, troubleshoots and debugs software programs for software enhan...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • MS/BS degree in Computer Science or equivalent experience
  • Expert knowledge of Layer 2 and Layer 3 technologies through validation or deployment of networking products
  • Hands-on experience with at least one popular Networking OS: JunOS, NXOS, IOS, EOS, or SONiC
  • Solid understanding of clos-based Data Center network architectures (3-stage and 5-stage)
  • Familiarity with Data Center protocols such as VXLAN and MP-BGP
  • Proficiency in Python programming
  • Strong grasp of Linux-based systems and network troubleshooting tools
  • A quality-focused mindset with a keen eye for identifying product and interaction limitations
  • Minimum 5+ years of relevant experience
Job Responsibility
Job Responsibility
  • Test IP networking-related software products to ensure they operate as defined by requirements
  • Build network configurations to model well-optimized network reference designs
  • Plan, develop, and execute automated and manual test plans
  • Provide constructive feedback, report issues, and interact with developers to deliver superior product quality
  • Review requirements from the Product Management team
  • Utilize network troubleshooting tools (packet captures, monitoring devices, log files, customer input) to resolve issues effectively
What we offer
What we offer
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment
  • Fulltime
Read More
Arrow Right

Data Center QA Engineer

Designs, develops, troubleshoots and debugs software programs for software enhan...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • MS/BS degree in Computer Science or equivalent experience
  • Expert knowledge of Layer 2 and Layer 3 technologies through validation or deployment of networking products
  • Hands-on experience with at least one popular Networking OS: JunOS, NXOS, IOS, EOS, or SONiC
  • Solid understanding of clos-based Data Center network architectures (3-stage and 5-stage)
  • Familiarity with Data Center protocols such as VXLAN and MP-BGP
  • Proficiency in Python programming
  • Strong grasp of Linux-based systems and network troubleshooting tools
  • A quality-focused mindset with a keen eye for identifying product and interaction limitations
  • Minimum 5+ years of relevant experience
Job Responsibility
Job Responsibility
  • Test IP networking-related software products to ensure they operate as defined by requirements
  • Build network configurations to model well-optimized network reference designs
  • Plan, develop, and execute automated and manual test plans
  • Provide constructive feedback, report issues, and interact with developers to deliver superior product quality
  • Review requirements from the Product Management team
  • Utilize network troubleshooting tools (packet captures, monitoring devices, log files, customer input) to resolve issues effectively
What we offer
What we offer
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment
  • Fulltime
Read More
Arrow Right

Data Center Engineer

Akuna is seeking a Data Center Engineer, based on the East Coast, to join our IT...
Location
Location
United States , East Coast (Secaucus, Carteret, Mahwah, New Jersey metro area)
Salary
Salary:
100000.00 USD / Year
akunacapital.com Logo
AKUNA CAPITAL
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of data center experience
  • Bachelor's degree in computer science, Information Systems, or a related field preferred
  • Knowledge of data center procedures (hardware racking, network cabling, etc.)
  • Experience working in a modern, large-scale data center
  • Understanding network hardware (switches, SFP’s), servers (memory, hard drives), and cabling (fiber optic cabling standards / types)
  • Proven ability to diagnose problems quickly
  • Highly motivated and organized self-starter
  • Ability to clearly communicate detailed instructions both verbally and written
  • Strong appreciation for teamwork
  • Willingness to travel out-of-state and occasionally work outside of core business hours and on weekends
Job Responsibility
Job Responsibility
  • Oversee building, organizing, and maintaining data center infrastructure at colocation facilities
  • Lead scheduling of daily East Coast Data Center Operations activities
  • Install, replace, upgrade, and move cables and equipment
  • Plan and execute data center moves and re-organization
  • Partner with Asset Manager on tracking and reporting
  • Maintain a local inventory of cables and equipment
  • Use monitoring tools to detect and respond to critical issues
  • Troubleshoot technical issues and respond to hardware failures
  • Coordinate with third-party vendors for support and remote hands
  • Develop and maintain documentation of data center procedures and deployments
What we offer
What we offer
  • Discretionary performance bonus
  • Comprehensive benefits package (employer-paid medical, dental, vision, retirement contributions, paid time off, and other benefits)
  • Fulltime
Read More
Arrow Right

Sr. Network Data Center Engineer

If you live and breathe networking, virtualization, and high-availability system...
Location
Location
United States
Salary
Salary:
150000.00 USD / Year
corporatetools.com Logo
Corporate Tools
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience with Proxmox or other hypervisors (VMware, KVM, Xen, Hyper-V)
  • 5+ years of network engineering, data center operations, or cloud infrastructure
  • Experience with Ceph or SAN-based storage solutions (iSCSI, NFS, ZFS)
  • Experience with containers and networking
  • Excellent problem-solving skills and a keen eye for detail
  • Ability to work on projects solo or with a team
  • Love for learning and improving code
  • Strong communication and collaboration skills
  • Understanding of Ceph storage architecture (OSDs, MONs, MDS, RADOS, etc.)
  • Experience in iSCSI/NFS/ZFS SAN setups and performance tuning
Job Responsibility
Job Responsibility
  • Develop and design robust and scalable software solutions
  • Take ownership of projects from conception to deployment, ensuring timely delivery and meeting the specified requirements
  • Work closely with cross-functional teams, including IT, product management, and other software teams, to ensure seamless integration and alignment with business objectives
  • Stay updated with the latest industry trends, technologies, and best practices to bring innovative solutions to the table
  • Design, implement, and maintain a robust network architecture that supports Proxmox virtualization, Ceph/SAN storage, and container networking
  • Manage firewalls (iptables, pfSense, UFW, etc.) to secure access to virtualized environments and hosting services
  • Configure and optimize VLANs, subnets, and routing to ensure isolated and secure network segments for virtual machines, storage, and frontend applications
  • Configure and maintain VPNs, BGP, OSPF, or other routing protocols to ensure proper network redundancy and failover
  • Set up and maintain bridged, NAT, and VXLAN networking in Proxmox for efficient VM communication
  • Implement high-availability (HA) networking for Hypervisor networks and Ceph/SAN clusters
What we offer
What we offer
  • 100% employer-paid medical, dental and vision for employees
  • Annual review with raise option
  • 22 days Paid Time Off accrued annually, and 4 holidays
  • After 3 years, PTO increases to 29 days. Employees transition to flexible time off after 5 years with the company—not accrued, not capped, take time off when you want
  • The 4 holidays are: New Year’s Day, Fourth of July, Thanksgiving, and Christmas Day
  • Paid Parental Leave
  • Up to 6% company matching 401(k) with no vesting period
  • Quarterly allowance
  • Use to make your remote work set up more comfortable, for continuing education classes, a plant for your desk, coffee for your coworker, a massage for yourself... really, whatever
  • Open concept office with friendly coworkers
  • Fulltime
Read More
Arrow Right

Data Center Operations Manager

As the Manager of our datacenter operations team you’ll contribute in the strate...
Location
Location
United States , Santa Clara
Salary
Salary:
122500.00 - 179630.00 USD / Year
rackspace.com Logo
Rackspace
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in computer science, computer engineering or a related field. Additional experience may substitute for the degree
  • 7+ years of experience as a data center operations technician
  • Previous people management within a Data Center experience is required
  • Demonstrated successful experience meeting data center production/operation schedules
Job Responsibility
Job Responsibility
  • Manage a team of datacenter operation engineers and maintain a better than 99.999% uptime through impeccable housekeeping and robust operational discipline
  • Report on operational performance to the leadership team
  • Recommend changes in procedures or equipment that would increase productivity, reduce cost, and better serve Data Center requirements and customers
  • Train employees on policies and procedures and engage them in change
  • Recommend employees for hiring, firing, promotions and demotions
  • Provide input on pay reviews
  • Prepare and perform performance appraisals
  • Monitor and prioritize an internal ticketing system
  • Provide operating system storage troubleshooting, along with storage upgrades, hardware troubleshooting and Raid configuration changes
  • Provide hardware support and upgrades for servers running Microsoft Windows Server, Red Hat Enterprise Server, Ubuntu Linux or VMWare ESX Server
What we offer
What we offer
  • Incentive compensation opportunities in the form of an annual bonus or incentives, equity awards and an Employee Stock Purchase Plan (ESPP)
  • Fulltime
Read More
Arrow Right

Is Data Center Operations Engineer

Bridging Information Technology (IT) and the Mechanical, Electrical, and Plumbin...
Location
Location
United States , New Albany
Salary
Salary:
91731.00 - 114948.00 USD / Year
amgen.com Logo
Amgen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master’s degree
  • Bachelor’s degree and 2 years of data center operations experience
  • Associate’s degree and 6 years of data center operations experience
  • High school diploma / GED and 8 years of data center operations experience
  • Hands-on experience with rack/stack, structured cabling, and IT hardware installation
  • Familiarity with Dell PowerEdge, Nutanix, NetApp, and Cisco platforms
  • Ability to interpret electrical and mechanical drawings (awareness-level competency)
  • Experience using monitoring, alerting, or automation systems (AI-enabled platforms preferred)
  • Solid understanding of IT operations concepts including hardware lifecycle management and disaster recovery
  • Ability to read and update documentation, diagrams, and cable records
Job Responsibility
Job Responsibility
  • Serve as the liaison between IT teams and facilities staff, ensuring flawless communication
  • Interpret electrical one-line diagrams, distribution drawings, and cooling schematics to support incident response and planning
  • Install, rack, cable, and support enterprise IT systems including Dell PowerEdge, Nutanix, NetApp, and Cisco technologies
  • Support day-to-day moves, adds, and changes (MACs) in building IDF and VDER environments
  • Perform fiber and copper patch cabling in data centers, IDFs, and VDER closets
  • Trace and troubleshoot cabling issues to restore connectivity
  • Monitor infrastructure, proactively detect issues, and bring up with urgency to appropriate teams
  • Apply AI-enabled monitoring and automation platforms to enhance data center operations
  • Maintain documentation of infrastructure layouts, procedures, and operational standards
  • Participate in capacity planning, disaster recovery drills, and continuous improvement initiatives
What we offer
What we offer
  • A comprehensive employee benefits package, including a Retirement and Savings Plan with generous company contributions, group medical, dental and vision coverage, life and disability insurance, and flexible spending accounts
  • A discretionary annual bonus program, or for field sales representatives, a sales-based incentive plan
  • Stock-based long-term incentives
  • Award-winning time-off plans
  • Flexible work models, including remote and hybrid work arrangements, where possible
  • Fulltime
Read More
Arrow Right

Data Operations Engineer

BA Markets wants to professionalise and streamline its activities with regards t...
Location
Location
Poland , Katowice
Salary
Salary:
Not provided
vattenfall.com Logo
Vattenfall
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Interest in understanding what the user needs with several years of hands-on experience as software developer with an interest in the responsibilities of a data engineer or vice versa
  • A proactive, communicative team player
  • Fluent in English
  • Deep understanding of Kafka architecture (brokers, topics, partitions, replication), and experience with Kafka Streams, Kafka Connect, and schema registry (e.g., Confluent)
  • Proficiency in designing and managing Kafka clusters (including monitoring and scaling)
  • Hands-on Experience building and maintaining real-time ETL pipelines
  • Familiarity with stream processing frameworks like: Apache Flink or Apache Spark Streaming
  • Strong skills in: Python and Java and at least basic Scala and at least solid SQL experience
  • Has build several CI/CD pipelines (e.g., Jenkins, GitLab CI, GitHub Actions)
  • And used infrastructure as Code (IaC) tools like Terraform or Ansible
Job Responsibility
Job Responsibility
  • Stream deployment and stream architecture, developments and deployments
  • Automate workflows and orchestrate data pipelines
  • Implement CI/CD routines
  • Implement and monitor “system health” with observability tools and data quality checks
  • Support the development of Client Libraries so other applications can integrate streams in own application and services
  • Perform Python development
  • Perform “glue code” development that 95% of use cases can apply
What we offer
What we offer
  • Good remuneration
  • Challenging and international work environment
  • Possibility to work with some of the best in the field
  • Working in interdisciplinary teams
  • Support from committed colleagues
  • Attractive employment conditions
  • Opportunities for personal and professional development
  • Fulltime
Read More
Arrow Right

Data Engineer

We are looking for a skilled and enthusiastic Data Engineer to help design and o...
Location
Location
United States , East Windsor
Salary
Salary:
Not provided
beaconfireinc.com Logo
Beaconfire
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Advanced working SQL knowledge and experience working with relational databases, query authoring (SQL) as well as working familiarity with a variety of databases
  • Experience building and optimizing ‘big data’ data pipelines, architectures and data sets
  • Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement
  • Strong analytic skills related to working with unstructured datasets
  • Build processes supporting data transformation, data structures, metadata, dependency and workload management
  • A successful history of manipulating, processing and extracting value from large disconnected datasets
  • Working knowledge of message queuing, stream processing, and highly scalable ‘big data’ data stores
  • Strong project management and organizational skills
  • Experience supporting and working with cross-functional teams in a dynamic environment
Job Responsibility
Job Responsibility
  • Create and maintain optimal data pipeline architecture
  • Assemble large, complex data sets that meet functional / non-functional business requirements
  • Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc
  • Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and AWS ‘big data’ technologies
  • Build analytics tools that utilize the data pipeline to provide actionable insights into customer acquisition, operational efficiency and other key business performance metrics
  • Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs
  • Keep our data separated and secure across national boundaries through multiple data centers and AWS regions
  • Create data tools for analytics and data scientist team members that assist them in building and optimizing our product into an innovative industry leader
  • Work with data and analytics experts to strive for greater functionality in our data systems
  • Fulltime
Read More
Arrow Right