CrawlJobs Logo

Hardware Development Infrastructure Engineer

openai.com Logo

OpenAI

Location Icon

Location:
United States , San Francisco

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

260000.00 - 335000.00 USD / Year

Job Description:

We’re looking for a Hardware Development Infrastructure Engineer to build and run the infrastructure that powers OpenAI’s hardware development lifecycle. You’ll work closely with hardware teams to translate their workflows into scalable, observable, and automated systems, and then own the platforms that support them over time. This role sits at the intersection of hardware, cloud, HPC, DevOps, and data. You’ll design regression systems, CI/CD pipelines, cloud and cluster platforms, and the data foundations that make development efficiency visible and measurable.

Job Responsibility:

  • Partner with hardware teams on workflows and tooling: Embed with teams across DV, PD, emulation, formal, and software to understand development flows, identify failure modes, and deliver tooling (CLIs, services, APIs) that reduces manual work and accelerates iteration
  • Build and operate regression systems at scale: Own regressions end-to-end—from definition and scheduling to execution, results ingestion, triage, and reporting—while improving throughput, reproducibility, and flake reduction
  • Own CI/CD for infrastructure and tooling: Design and operate pipelines for infrastructure-as-code, services, images, and cluster configuration changes, including testing, gated deploys, staged rollouts, and safe rollback
  • Run cloud and HPC platforms: Design, provision, and operate cloud infrastructure (Azure preferred) and HPC/HTC clusters (e.g., Slurm), tuning scheduling policies, autoscaling, node lifecycles, and cost-performance tradeoffs
  • Build data foundations and visibility: Develop ETL pipelines to ingest metrics, logs, and results
  • operate databases for workflow metadata and outcomes
  • and build dashboards that surface efficiency, utilization, and reliability trends
  • Drive operational excellence: Establish monitoring and alerting, lead incident response and postmortems, maintain runbooks, and produce clear, durable documentation

Requirements:

  • Familiarity with chip development workflows and at least one deep EDA domain (e.g., DV, PD, emulation, or formal verification)
  • Strong infrastructure fundamentals, including cloud platforms, networking, security, performance, and automation
  • Experience operating cloud environments (Azure preferred
  • AWS, GCP, or OCI acceptable) with strong infrastructure-as-code practices (e.g., Terraform, Bicep
  • configuration management tools a plus)
  • Strong programming skills (Python preferred) and solid software engineering and scripting practices
  • Experience building and operating CI/CD systems (e.g., Jenkins, Buildkite, GitHub Actions), including testing and release workflows
  • Database experience (e.g., Postgres or MySQL), including schema design, migrations, indexing, and operational safety
  • Clear communicator with strong judgment—able to explain tradeoffs, propose pragmatic solutions, and articulate a realistic vision for scalable infrastructure

Nice to have:

  • Experience operating Slurm or other large-scale cluster schedulers
  • Experience with enterprise authentication and directory services (e.g., Entra ID, LDAP, FreeIPA, SSSD)
  • Experience building or operating backend and middleware systems such as message queues, caches, artifact stores, or internal service platforms
  • Familiarity with high-performance storage architectures and data movement optimization
  • Experience running and monitoring license servers for expensive or capacity-constrained toolchains
What we offer:
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Daily meals in our offices, and meal delivery credits as eligible
  • Relocation support for eligible employees
  • Additional taxable fringe benefits, such as charitable donation matching and wellness stipends, may also be provided

Additional Information:

Job Posted:
February 21, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Hardware Development Infrastructure Engineer

MTS Hardware Engineer

T-Mobile is seeking a Member of Technical Staff (MTS) Hardware Engineer to lead ...
Location
Location
United States , Bellevue
Salary
Salary:
Not provided
https://www.t-mobile.com Logo
T-Mobile
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7-10 years related work experience
  • 5+ years developing cloud server and rack hardware solutions
  • 5+ years as lead technologist, architect engineer in the hardware space for a major technology, component, product, product line
  • experience with memory, CPU, signal integrity, peripheral device integration, hardware monitoring sensors, and device firmware interaction
  • experience as a lead engineer for complex electronic systems
  • experience driving the technology roadmap and quality of third-party suppliers and working with supply chain
  • experience with mentoring engineers on technical development
  • bachelor's degree of Electrical Engineering, Computer Science, Mechanical Engineering, a related subject area, or equivalent industry experience
Job Responsibility
Job Responsibility
  • Lead the technical design and implementation of large-scale, cross-platform cloud hardware solutions to optimize infrastructure performance and reliability
  • develop and drive the technology roadmap for cloud hardware, ensuring alignment with organizational goals and industry best practices
  • collaborate with senior engineers and cross-functional teams to evaluate and integrate new hardware products and architectural approaches
  • mentor and guide engineers in technical development, fostering a culture of innovation and continuous improvement
  • conduct signal integrity analysis, and manage CPU, memory, peripheral interfaces, and device firmware interactions to ensure robust system performance
  • oversee the quality and reliability of third-party hardware suppliers, working closely with the supply chain to maintain high standards
  • analyze competitive market and industry trends to identify opportunities for innovation and strategic advantage
  • serve as the technical spokesperson for rack and server engineering, representing T-Mobile's technical capabilities
What we offer
What we offer
  • medical, dental, and vision insurance
  • flexible spending account
  • 401(k)
  • employee stock grants
  • employee stock purchase plan
  • paid time off
  • paid parental and family leave
  • family building benefits
  • back-up care
  • enhanced family support
  • Fulltime
Read More
Arrow Right

Network Infrastructure Engineer

Location
Location
United States , Fort Meade
Salary
Salary:
77600.00 - 176000.00 USD / Year
boozallen.com Logo
Booz Allen Hamilton
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience providing implementation and engineering support for DoD enterprise networks
  • 3+ years of experience with commercial hardware, networks, and cloud environments
  • 2+ years of experience conducting network discovery, including analyzing and documenting system requirements
  • Experience developing and executing test and implementation plans based on requirements
  • Ability to communicate effectively with both technical and non-technical personnel, multitask, and prioritize
  • Ability to travel to CONUS and OCONUS locations up to 40% of the time
  • Secret clearance
  • HS diploma or GED
  • Ability to obtain a DoD 8570.01 IAT Level II Certification within 6 months of hire date
Job Responsibility
Job Responsibility
  • Maintain responsibility for completing site surveys and creating structured designs for the customer’s network in support of voice, data, security, and audio and visual systems
  • Design engineered drawings, specifications, reports, and other technical documents
  • Work closely with other IT and facilities team members, project teams, clients, architects, engineers, subcontractors, vendors, material suppliers, and other technical resources to analyze business and technical requirements to develop system designs, estimates, implementation plans, management and customer reports, and coordinate the structured cabling design with other design disciplines
  • Implement final network solutions to support on-prem and cloud environments
  • Manage the production of network devices and network architecture design and develop all supporting documentation required for implementation in this global network
What we offer
What we offer
  • Health, life, disability, financial, and retirement benefits
  • Paid leave
  • Professional development
  • Tuition assistance
  • Work-life programs
  • Dependent care
  • Recognition awards program
Read More
Arrow Right

Senior Infrastructure Engineer

We are seeking a skilled and proactive individual to play a key role in supporti...
Location
Location
United Kingdom , Manchester
Salary
Salary:
Not provided
ans.co.uk Logo
ANS Group
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Exposure to secure architecture design and implementation
  • Experience with the deployment and management Carbon Black or other EDR solutions across cloud infrastructure
  • Significant previous experience as an infrastructure engineer working on a large scale enterprise or multi-tenant environment
  • VMware 7.0+
  • Significant experience troubleshooting and analysing complex failures
  • Operational experience of NSX 3.0+
  • Scripting abilities in Powershell and PowerCLI
  • Experience with Cisco UCS or other enterprise blade systems
  • Significant Experience with Storage Technologies (HPE 3PAR, Nimble, Dell Compellent)
  • Experience with FC storage networking
Job Responsibility
Job Responsibility
  • Work to ensure conformity to public sector infrastructure requirements are met
  • Work in conjunction with our SoC team to develop and maintain platform security baselines
  • Monitor, diagnose and resolve significant problems within the ANS infrastructure
  • Be an escalation point for team members and the support teams offering technical expertise in virtualization, compute hardware and storage
  • Collaborate and work with other technical teams to provide industry leading support to our customers
  • Responsible for creating high quality documentation
  • Proactively work to identify areas of improvement in the platform
  • Effectively deliver project milestones
  • Responsible for the generation of LLD from HLD
  • Ensure our infrastructure is up to date by planning & performing patching and firmware upgrades
What we offer
What we offer
  • 25 days’ holiday, plus you can buy up to 5 more days
  • Birthday off
  • An extra celebration day
  • 5 days’ additional holiday in the year you get married
  • 5 volunteer days
  • Private health insurance
  • Pension contribution match and 4 x life assurance
  • Flexible working and work from anywhere for up to 30 days per year
  • Maternity: 16 weeks’ full pay
  • Paternity: 3 weeks’ full pay
  • Fulltime
Read More
Arrow Right

Senior Infrastructure Engineer – Hosting

As a Senior Infrastructure Engineer – Hosting you will be responsible for the de...
Location
Location
United States
Salary
Salary:
150000.00 USD / Year
corporatetools.com Logo
Corporate Tools
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3-5 years of experience in Linux system administration, virtualization, and cloud infrastructure
  • Experience with Proxmox or other hypervisors (VMware, KVM, Xen, Hyper-V)
  • Experience with Ceph or SAN storage solutions for virtualization
  • Ability to manage kernel tuning, system performance, and process optimization
  • Hands-on experience with Ceph storage, ZFS, iSCSI, NFS, RAID, and SAN architectures
  • Understanding of storage performance metrics (IOPS, throughput, latency)
  • Ability to work on projects solo or with a team
  • Love for learning and improving code
  • Strong communication and collaboration skills
  • Experience with WordPress hosting, database replication, and caching techniques
Job Responsibility
Job Responsibility
  • Develop and design robust and scalable hardware solutions
  • Take ownership of projects from conception to deployment, ensuring timely delivery and meeting the specified requirements
  • Work closely with cross-functional teams, including IT, product management, and other software teams, to ensure seamless integration and alignment with business objectives
  • Deploy, configure, and maintain Proxmox VE clusters for virtualization or other hypervisors
  • Implement high-availability (HA) and failover solutions for virtual machines
  • Manage resource allocation (CPU, memory, disk, network) to optimize performance for hosted applications
  • Automate VM deployment and configuration using Ansible, Terraform, or SaltStack
  • Maintain backups and disaster recovery plans for virtualized environments
  • Design and manage Ceph clusters or SAN storage (iSCSI, NFS, ZFS, etc.) for high-performance, redundant storage
  • Monitor and optimize storage performance, including IOPS, latency, and throughput
What we offer
What we offer
  • 100% employer-paid medical, dental and vision for employees
  • Annual review with raise option
  • 22 days Paid Time Off accrued annually, and 4 holidays
  • After 3 years, PTO increases to 29 days. Employees transition to flexible time off after 5 years with the company—not accrued, not capped, take time off when you want
  • The 4 holidays are: New Year’s Day, Fourth of July, Thanksgiving, and Christmas Day
  • Paid Parental Leave
  • Up to 6% company matching 401(k) with no vesting period
  • Quarterly allowance
  • Use to make your remote work set up more comfortable, for continuing education classes, a plant for your desk, coffee for your coworker, a massage for yourself... really, whatever
  • Open concept office with friendly coworkers
  • Fulltime
Read More
Arrow Right

Principal Infrastructure Engineer

The Principal Infrastructure Engineer, Electronic Trading is responsible for sys...
Location
Location
Canada , Mississauga
Salary
Salary:
120800.00 - 170800.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of experience
  • experience in delivering infrastructure technologies products and services
  • experience in financial services or large complex and/or global environment preferred
  • experience developing projects for the identification of best practices (design of metrics, analytical tools, benchmarking activities and related reporting)
  • consistently demonstrate clear and concise written and verbal communication with ability to communicate technical concepts to a non-technical audience
  • proven analytical, diagnostic, and multitasking skills with focus on execution and attention to detail
  • demonstrated ability to both work independently and partner with virtual teams in a high-pressure matrix environment
  • demonstrated ability to take ownership of various parts of a project/initiative with tight deadlines or unexpected changes in expectation/requirements
  • bachelor's degree/university degree or equivalent experience
  • master’s degree preferred
Job Responsibility
Job Responsibility
  • conduct work on a variety of high-impact, high-profile problems/projects driving technology infrastructure aligned to the business
  • identify and resolve issues, engaging in Root Cause Analysis (RCA) if escalation
  • conduct responsibilities such as quality control, work allocation, coaching/mentoring, ensuring ongoing compliance with regulatory requirements
  • evaluate controls to help mitigate negative outcomes through prevention, detection, and correction
  • design and create complex processes and reporting streams, participate in the review and approval of requirement documents
  • examine and update processes and procedures for hardware acquisition toward automation
  • understand diverse stakeholder needs and share and influence stakeholder expectations
  • appropriately assess risk when business decisions are made, demonstrating consideration for the firm’s reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to policy, applying sound ethical judgment regarding personal behavior, conduct and business practices, and escalating, managing and reporting control issues with transparency
What we offer
What we offer
  • professional development opportunities
  • equal opportunity employer
  • work-life balance programs
  • Fulltime
Read More
Arrow Right

Staff Infrastructure System Engineer

Staff Infrastructure System Engineer role at Ledger, focused on designing, confi...
Location
Location
France , Paris
Salary
Salary:
Not provided
https://www.ledger.com Logo
Ledger
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 10 years of experience in Linux system administration and management in a high availability environment
  • Experience with virtualization and containerization environments (K8s, Docker, Proxmox)
  • Hands-on experience with configuration management tools such as Ansible and Infrastructure-as-code solutions such as SaltStack or Terraform
  • Working knowledge of system monitoring tools- especially Prometheus
  • Knowledge of encryption and PKI technologies
  • Highly motivated and self-driven
  • Proven level of autonomy
  • Ability to communicate, convince, explain, and justify choices
  • Honest and realistic
  • Creative problem solving and solutions assessment skills with an ability to identify develop and implement solutions to meet the needs of the business
Job Responsibility
Job Responsibility
  • Design, configuration and administration of systems infrastructure including core services, internal tools, monitoring solutions and bare metal servers
  • Researching, piloting, integrating, and implementing new technologies and infrastructure solutions
  • Supporting and contributing to the delivery of source code version management, continuous integration tools, and package management solutions
  • Accurately sizing and forecasting systems work packages within the infrastructure domain
  • Act as point of escalation for the design or setup of any systems delivery
  • Monitoring, optimizing and troubleshooting, diagnosing and resolving hardware or software incidents and problems
  • Protecting data, software, and hardware by coordinating, planning and implementing security measures
  • Configuration, monitoring and maintenance of backup and replication routines and organizing disaster recovery readiness
  • Ensure documentation of Ledger's infrastructure systems is up to date
What we offer
What we offer
  • Flexible work options - work from home up to 3 times per week
  • Health & Wellness support - Health and Life Insurance
  • Financial growth opportunities - employees can become shareholders in Ledger
  • Commuter allowance - contribution to preferred means of transportation
  • Learning & Development - comprehensive suite of training solutions providing personalised learning experience
  • Fulltime
Read More
Arrow Right

Software Developer – DevOps

Location
Location
United States , Libertyville
Salary
Salary:
Not provided
tekassembly.com Logo
tekAssembly
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience developing engineering applications for a large corporation
  • Bachelor of Degree in computer science, IT or similar field
  • Master’s degree in Computer Science is plus
  • Current understanding of best practices regarding system security measures
Job Responsibility
Job Responsibility
  • Analyze current technology utilized within the company and develop steps and processes to improve and expand upon them
  • Establish milestones for necessary contributions from departments and develop processes to facilitate their collaboration
  • Assist other department engineers in creating practical demonstrations of proposed solutions and demonstrating them to other members of the team
  • Provide detailed specifications for proposed solutions including materials, manpower and time necessary
  • Work closely with engineering professionals within the company to maintain hardware and software needed for projects to be completed efficiently
  • Work alongside project management teams to successfully monitor progress and implementation of initiatives
Read More
Arrow Right

Software Engineer Staff - SONiC NOS Developer

This role has been designed as ‘Hybrid’ with an expectation that you will work o...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or a closely related field
  • Experience Required - 9 to 14yrs
  • Minimum one year of hands-on experience working with SONiC NOS
  • Sound understanding of SONiC architecture and operational experience with the SONiC network operating system
  • Experience working with Docker and debugging within environments
  • Proficiency in C/C++
  • Python programming skills are an advantage
  • Hands-on experience with PTF and SpyTest frameworks for network validation
  • Familiarity with Linux system internals and environment
  • Strong analytical and problem-solving capabilities
Job Responsibility
Job Responsibility
  • Design, develop, and maintain new features and enhancements for the SONiC network operating system platform
  • Create and execute comprehensive test plans using PTF (Packet Test Framework) and SpyTest to validate infrastructure robustness
  • Troubleshoot, debug, and resolve issues within SONiC-based environments
  • Collaborate closely with hardware engineers, QA/test teams, and other cross-functional partners to deliver end-to-end solutions
  • Participate in code reviews, contribute to architectural discussions, and lead documentation initiatives
  • Engage with the SONiC open-source community, tracking ecosystem developments and contributing to community-driven enhancements
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right