This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Meta is seeking a Production Systems Engineer to join our Hardware Design and Release to Production (HDRTP) team in Dublin, Ireland. Our servers and data centers are the foundation upon which our rapidly scaling infrastructure operates efficiently to deliver Meta's services globally. The HDRTP team is responsible for the end-to-end Hardware Lifecycle of all Meta servers, from exploration and development to production health. HDRTP Engineers work closely with Production Engineering teams, Enterprise Networking, Hardware Designers, Networking Teams, Manufacturers, Vendors, Datacenter Operation teams and New Product Introduction teams to ensure the smooth operation of systems across the planet. We encounter problems from the very smallest of scales (errors occurring at the microscopic scale, within single registers of a CPU) up to the very largest - deploying solutions to Meta's millions of devices globally. We look for people with proven experience of finding solutions to complex issues, embracing ambiguity and driving impact, who want to tackle the hardest problems in the domain. Typically we will hire engineers from backgrounds such as Site Reliability Engineer (SRE), Software Engineer, Systems Engineer, Systems Development Engineer, DevOps Engineer, Systems Administrator, or similar.
Job Responsibility:
Build and develop tooling solutions to automate business critical processes in service of managing the health of the Meta production hardware fleet
Troubleshoot, diagnose and root cause system failures, working with key partners to identify and deliver solutions
Proactively identify opportunities to fix or enhance tooling, hardware and processes
Build subject matter expertise in one or more of the specialist areas covered by the RTP (Release To Production) team in Dublin
Scientific approach to troubleshooting, root-cause analysis and investigation
Requirements:
An engineering degree is typical, or related technical discipline, or equivalent work experience
4+ years experience coding in a higher-level language (Python, PHP, Java, Go, Rust, C++)
Experience building, maintaining and debugging production services or platforms - usually (but not necessarily) in a linux/unix environment
Knowledge of server architecture and components across Compute/Storage/AI Systems/Networking
Nice to have:
4+ years experience coding in a higher-level language (Python, PHP, Java, Go, Rust, C++)
Experience managing and debugging hardware platforms in a cloud environment
Demonstrated ability to drive projects to successful business outcomes