CrawlJobs Logo

HPC Hardware Service Engineer

United States, Irvine 92700.00 - 187500.00 USD / Year · Job Posted January 24, 2026
Apply Position
Job Link Share

Job Description

Come join our team at Hewlett-Packard/HP as a HPC On-site Hardware Service Engineer, where you will have the opportunity to work with cutting-edge technology and make a significant impact on our company's success. You will play a critical role in implementing and maintaining high-performance computing (HPC) systems that are essential to our business operations. We are looking for a self-motivated problem-solver with excellent communication skills and the desire to constantly learn!

Job Responsibility

  • Report daily to, and physically work at, the Customer’s Site
  • Engage in technical problem solving across multiple technologies
  • Creates and owns service tickets, via Salesforce, updates and drives the case through closure
  • Identifies, analyzes, diagnoses, troubleshoots and repairs hardware issues with focus on responsiveness and communication
  • Gather data, perform analysis, and escalate cases to higher-level product support groups, to ensure timely resolution of system or customer issues
  • Responsible for verifying and implementing detailed technical solutions to problems
  • Maintains ongoing log documenting issues and resolutions, for tracking and monthly discussion
  • Participate as part of a team and maintain a good relationship with team members and customers
  • Owns and produces customer documentation
  • Occasional travel for training is required

Requirements

  • 5+ years of professional experience and a Bachelor of Arts/Science or equivalent degree in computer science or related area of study
  • without a degree, three additional years of relevant professional experience (8+ years in total)
  • Experience installing, troubleshooting and supporting enterprise-level servers, storage, and networking equipment
  • Experience working in large-scale data center environments and/or High Performance Computing (HPC)
  • Experience with Linux based OS, hardware troubleshooting and diagnostics
  • Must be a self-starter who is able to work independently, without supervision, and within a team environment
  • Strong problem solving and self-management skills with attention to detail
  • Possesses an understanding of architectural dependencies of technologies
  • Ability to work in a multi-technology environment, with the ability to diagnose complex technical problems to their root cause
  • Ability to communicate broad and specific concepts with team members and peers
  • Ability to prioritize tasks and effectively communicate verbally and in writing
  • Role models Knowledge sharing and re-use within practice or profession
  • Strong desire to learn and improve
  • US Citizenship required

What we offer

  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

HPC Hardware Service Engineer

8 matching positions

HPC DMF Field Service Engineer

The HPC DMF Field Service Engineer provides technology consulting to external cu...
Location
Location
Singapore , Singapore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of professional experience
  • Bachelor of Arts/Science or equivalent degree in computer science or related area of study
  • Without a degree, 11+ years of relevant professional experience
  • Must have Linux/Unix experience
  • Sufficient depth and breadth of technical knowledge to design and scope multiple deliverables across multiple technologies
  • Demonstrated innovation and communication of new deliverables and offerings
  • Ability to develop solutions that enhance availability, performance, maintainability and agility
  • Understanding of architectural dependencies of technologies in customer's IT environment
  • Vendor or industry certification in at least one discipline area
  • Ability to work in multi-technology environment with ability to diagnose complex technical problems to root cause
Job Responsibility
Job Responsibility
  • Hardware Maintenance support for HP Proliance Server, Apollo Server etc
  • Verify and implement detailed technical design solutions
  • Provide detailed technical design for enterprise solutions
  • Lead technical assessment and delivery of technical solutions to customer
  • Coordinate implementation of new installations, designs, and migrations
  • Provide advanced technical consulting and advice on proposal efforts and solution design
  • Collect and determine data to identify customer needs and requirements
  • Respond to requests for technical information from customers
  • Develop customer technology solutions using various industry products
  • Engage in technical problem solving across multiple technologies
What we offer
What we offer
  • Health & Wellbeing comprehensive benefits suite
  • Personal & Professional Development programs
  • Unconditional Inclusion environment
  • Career development programs
  • Flexible work arrangements
  • Fulltime
Read More
Arrow Right

HPC Service Delivery Consultant

HPE is currently looking for a HPC Technical Consultant with a strong experience...
Location
Location
Italy
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong Linux system expertise, with good experience with distribution management (Red Hat, ...)
  • HA clusters
  • Strong knowledge of HPC systems and underlying components
  • Parallel filesystems (Lustre, GPFS, …)
  • High-speed network (Infiniband, OmniPath, Slingshot …)
  • DevOps: Ansible, Git, Puppet, Bash or Python scripting, …
  • Parallel computing and development software stacks
  • Big Data databases: Elastic/OpenSearch
  • Monitoring tools and dashboards: Prometheus, Grafana
  • Docker containers, Kubernetes
Job Responsibility
Job Responsibility
  • Qualify, analyze, troubleshoot, and resolve whenever possible the incidents with the right level of autonomy & expertise
  • Increase autonomy and expertise of the HPE hardware team involved in repairs and maintenance tasks
  • Work with HPE level 3 support and engineering teams to diagnose complex issues and implement resolution actions
  • Interact with third parties involved in the overall HPC solution: Red Hat, AMD, Intel, Nvidia, SchedMD, …
  • Collaborate with the Customer sysadmin and support teams
  • Manage the customer relationship, including verbal & written communication
What we offer
What we offer
  • A competitive salary and extensive social benefits
  • Diverse and dynamic work environment
  • Work-life balance and support for career development
  • Fulltime
Read More
Arrow Right

HPC Service Delivery Consultant

HPE is seeking a HPC Service Delivery Consultant to monitor and contribute to th...
Location
Location
Italy
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Expert level knowledge and willingness to learn
  • understanding at a detailed level all architectural dependencies of technologies in the customer’s IT environment
  • analytical mindset
  • committed to deliver the service as expected by the customer and to meet contractual SLA
  • partnering, innovating, and continual improvement
  • able to speak Italian and English, with good verbal and written communication skills
Job Responsibility
Job Responsibility
  • Qualify, analyze, troubleshoot, and resolve whenever possible the incidents with the right level of autonomy & expertise
  • increase autonomy and expertise of the HPE hardware team involved in repairs and maintenance tasks
  • work with HPE level 3 support and engineering teams to diagnose complex issues and implement resolution actions
  • interact with third parties involved in the overall HPC solution: Red Hat, AMD, Intel, Nvidia, SchedMD
  • collaborate with the customer sysadmin and support teams
  • manage the customer relationship, including verbal & written communication
What we offer
What we offer
  • Health & Wellbeing benefits
  • personal & professional development programs
  • unconditional inclusion policies
  • Fulltime
Read More
Arrow Right

High Performance Computing Hardware Engineer

High Performance Computing Hardware Engineer role requiring Top Secret clearance...
Location
Location
United States , Dayton
Salary
Salary:
78700.00 - 181200.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Top Secret security clearance
  • 4+ years of professional experience
  • Bachelor's degree in computer science or related field (or 7+ years total experience without degree)
  • Security+ Certification
  • Linux+ Certification (required before start date)
  • Extensive Linux-based hardware troubleshooting and diagnostics experience
  • Breakfix experience
  • Ability to work independently and within a team environment
  • Ability to diagnose complex technical problems to root cause
  • Professional communication skills with customers and internal teams
Job Responsibility
Job Responsibility
  • Reports daily to and works physically at customer site
  • Accountable for meeting and maintaining customer SLA
  • Engages in technical problem solving across multiple technologies
  • Owns and drives service tickets including ordering parts for repairs
  • Gathers data, performs analysis, and escalates problems to higher-level support
  • Performs daily hardware diagnostics and repairs
  • Verifies and implements detailed technical solutions
  • Maintains good relationships with team members and customers
  • Collects data to determine customer needs and requirements
  • Responds to requests for technical information
What we offer
What we offer
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment
  • Comprehensive benefits suite supporting physical, financial and emotional wellbeing
  • Fulltime
Read More
Arrow Right

High Performance Computing Hardware Engineer

Provide technology consulting to external customers and internal project teams. ...
Location
Location
United States , Aberdeen
Salary
Salary:
105500.00 - 243000.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Top Secret Clearance Required
  • 4+ years of professional experience
  • Bachelor of Arts/Science or equivalent degree in computer science or related area of study
  • Without a degree, 7+ years of relevant professional experience
  • Security+ Certification required
  • Linux+ Certification required
  • Extensive Linux based hardware troubleshooting and diagnostics experience
  • Ability to work in a multi-technology environment
  • Ability to diagnose complex technical problems to their root cause
  • Self-starter who can work independently without supervision
Job Responsibility
Job Responsibility
  • Break fix experience required
  • Reports daily to and works physically at the Customer Site
  • Accountable for meeting and maintaining customer's SLA (Service Level Agreement)
  • Engages in technical problem solving across multiple technologies
  • Owns and drives service tickets including ordering parts for needed repairs
  • Gather data, perform analysis, and escalate problems to higher-level product support groups
  • Preforms daily hardware diagnostics and repairs
  • Responsible for verifying and implementing detailed technical solutions to problems
  • Participates as part of a team and maintains good relationships with team members and customers
  • Collects and determines data from appropriate sources to assist in determining customer needs and requirements
What we offer
What we offer
  • 10K Sign-On Bonus
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Comprehensive benefits suite supporting physical, financial and emotional wellbeing
  • Career development programs
  • Unconditional inclusion environment
  • Flexible work management
  • Fulltime
Read More
Arrow Right

Hardware Development Infrastructure Engineer

We’re looking for a Hardware Development Infrastructure Engineer to build and ru...
Location
Location
United States , San Francisco
Salary
Salary:
260000.00 - 335000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Familiarity with chip development workflows and at least one deep EDA domain (e.g., DV, PD, emulation, or formal verification)
  • Strong infrastructure fundamentals, including cloud platforms, networking, security, performance, and automation
  • Experience operating cloud environments (Azure preferred
  • AWS, GCP, or OCI acceptable) with strong infrastructure-as-code practices (e.g., Terraform, Bicep
  • configuration management tools a plus)
  • Strong programming skills (Python preferred) and solid software engineering and scripting practices
  • Experience building and operating CI/CD systems (e.g., Jenkins, Buildkite, GitHub Actions), including testing and release workflows
  • Database experience (e.g., Postgres or MySQL), including schema design, migrations, indexing, and operational safety
  • Clear communicator with strong judgment—able to explain tradeoffs, propose pragmatic solutions, and articulate a realistic vision for scalable infrastructure
Job Responsibility
Job Responsibility
  • Partner with hardware teams on workflows and tooling: Embed with teams across DV, PD, emulation, formal, and software to understand development flows, identify failure modes, and deliver tooling (CLIs, services, APIs) that reduces manual work and accelerates iteration
  • Build and operate regression systems at scale: Own regressions end-to-end—from definition and scheduling to execution, results ingestion, triage, and reporting—while improving throughput, reproducibility, and flake reduction
  • Own CI/CD for infrastructure and tooling: Design and operate pipelines for infrastructure-as-code, services, images, and cluster configuration changes, including testing, gated deploys, staged rollouts, and safe rollback
  • Run cloud and HPC platforms: Design, provision, and operate cloud infrastructure (Azure preferred) and HPC/HTC clusters (e.g., Slurm), tuning scheduling policies, autoscaling, node lifecycles, and cost-performance tradeoffs
  • Build data foundations and visibility: Develop ETL pipelines to ingest metrics, logs, and results
  • operate databases for workflow metadata and outcomes
  • and build dashboards that surface efficiency, utilization, and reliability trends
  • Drive operational excellence: Establish monitoring and alerting, lead incident response and postmortems, maintain runbooks, and produce clear, durable documentation
What we offer
What we offer
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Daily meals in our offices, and meal delivery credits as eligible
  • Fulltime
Read More
Arrow Right
New

Principal AI Factory Solution Product Manager

Product Manager - AI Factory Solution
Location
Location
United States , Spring
Salary
Salary:
152000.00 - 349000.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
July 27, 2026
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Engineering, Business, or a related field
  • MBA or advanced degree preferred
  • 10+ years of product management experience, with at least 5 years focused on AI/ML products or solutions
  • Demonstrated ability to build large-scale AI solutions that bring together hardware, software and services into a cohesive offering
  • Strong understanding of AI technologies, including AI/ML lifecycle (training, tuning, inferencing), large language models, computer vision, and cloud-based AI platforms (e.g., AWS SageMaker, Microsoft AzureML, Google AI)
  • Proven track record of launching successful AI products, with experience in agile methodologies and tools like Jira
  • Background in High Performance Computing (HPC) and experience blending it with AI workloads will be an advantage
  • Excellent analytical skills, with proficiency in data analysis and market testing
  • Outstanding communication and stakeholder management abilities, capable of presenting to technical and non-technical audiences up to the senior executive/SVP levels
  • Ability to thrive in a startup-like fast-paced, innovative environment with strong problem-solving skills
Job Responsibility
Job Responsibility
  • Define and drive the overall AI factory at-scale and sovereign solution vision, roadmap, and features, while closely aligning with customer needs and HPE strategic goals
  • Define and drive the key software components necessary for the solution, which may be a mix of HPE developed, commercial and community IP
  • Conduct market research, competitive analysis, and customer interviews to identify AI factory opportunities and validate solution ideas and software features in a quick turn manner
  • Collaborate with engineers, product managers and presales architects to translate requirements into technical specifications and prototypes
  • Oversee the software integration and end-to-end solution lifecycle, from feature ideation and MVP development to launch, iteration, and scaling
  • Monitor solution performance using KPIs like full-stack wins, product mix, customer satisfaction, and iterate offering based on data insights
  • Work with legal, finance, pricing and supply chain to setup and manage resale contracts for commercial SW
  • Partner with sales and marketing to develop go-to-market strategies, pricing models, support strategies and customer enablement materials
  • Ensure solution complies with ethical AI standards while ensuring highest level of data privacy and sovereignty (e.g., GDPR, CCPA)
  • Stay abreast of AI trends, such as generative models, agentic AI, and industry applications
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

AI/HPC System Performance Engineer

Meta is building some of the world's largest AI and high-performance computing i...
Location
Location
United States , Menlo Park
Salary
Salary:
154000.00 - 217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience profiling and optimizing distributed AI or HPC workloads, including familiarity with GPU interconnects, RDMA networking, and collective communication frameworks such as NCCL or MPI
  • Experience debugging complex, non-reproducible performance issues across multi-layer systems including network fabric, operating system, and application layers
  • Experience designing and implementing performance monitoring systems, including instrumentation, telemetry pipelines, and alerting for large-scale infrastructure
  • Experience driving cross-functional technical projects from requirements definition through production deployment, including communicating performance findings and trade-offs to diverse stakeholders
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • 6+ years of experience in system performance engineering, network infrastructure engineering, or a related field within large-scale distributed computing or HPC environments
Job Responsibility
Job Responsibility
  • Profile and benchmark AI training and inference workloads across large-scale HPC clusters to identify network, compute, and memory bottlenecks
  • Develop and maintain performance analysis frameworks and dashboards to track system-level metrics including GPU utilization, network bandwidth, latency, and collective communication efficiency
  • Investigate and resolve performance regressions in distributed AI training environments, including issues related to RDMA fabrics, collective communication libraries, and job scheduling
  • Collaborate with network infrastructure, hardware, and AI research teams to define performance requirements and validate new HPC cluster configurations
  • Design and execute capacity and scalability experiments to inform network topology decisions for AI supercomputing infrastructure
  • Build tooling and automation to continuously monitor HPC system health, detect anomalies, and reduce mean time to mitigation during performance incidents
  • Establish service level objectives for AI cluster network performance and drive cross-functional alignment on reliability and efficiency targets
  • Lead technical design reviews for network and system architecture changes affecting AI workload performance, communicating trade-offs clearly to engineering and product stakeholders
  • Mentor other engineers on HPC performance methodologies, debugging techniques, and instrumentation best practices
  • Leverage AI-assisted workflows to accelerate root cause analysis, automate routine performance reporting, and expand coverage across the HPC stack
What we offer
What we offer
  • bonus + equity + benefits
  • Fulltime
Read More
Arrow Right