CrawlJobs Logo

Compute Engineer

India, Bangalore · Job Posted June 26, 2025
Apply Position
Job Link Share

Job Description

The Compute Engineer role involves providing advanced technical expertise in compute infrastructure and virtualization solutions, leading troubleshooting and root cause analysis, designing and implementing automation routines, and mentoring junior engineers. The role requires proficiency in Linux administration, VMware environments, scripting tools, and disaster recovery/HA solutions.

Job Responsibility

  • Resolve customer’s issues via the telephone, email or remote sessions
  • Reproduce issues in-house and responding back in a timely manner
  • Regular follow ups with customers with recommendations, updates and action plans
  • Identify and escalate issues in a timely manner to vendor according to Standard Operating Procedures
  • Leverage internal technical expertise, including peers, mentors, knowledge base, community forums and other internal tools, to provide the most effective solutions to customer issues
  • Collaborate with other CoE/HW teams in diagnosing and isolating the cause of complex issues
  • Maintain quality on case documentation, SLA timeframes and operational metrics
  • Performs within the Productivity Measure of the team (scorecard)
  • Incident Management: Resolve single and cross technology incidents independently
  • Lead the team members to resolve complex or cross technology incidents
  • Escalation Management: Identify, manage, and lead technical escalations
  • Participate in formal Escalation when required to support escalation especially during crisis
  • Problem Management: Proactively and reactively look for solutions to prevent problems from occurring in team/technology area
  • Perform Trend and Root cause analysis
  • Change Management/Implementation: Independently prepare, review, implement, rollback and test plan for change records
  • Perform risk and impact analysis for changes
  • Lead or participate in Change Advisory Board
  • Patch and Security Management: Apply patch and security changes per policy
  • Proactively monitor the environment for patch compliance
  • Analyze patches for compatibility with each customer or internal infrastructure environment
  • Configuration Management: Ensure Configuration Management Database (CMDB) entries are complete and accurate
  • Lead resolution of critical incidents and escalations, ensuring minimal business impact
  • Perform in-depth analysis of system logs, kernel dumps, and performance metrics
  • Design and implement automation for routine tasks using Ansible, Shell, Python, etc.
  • Lead patch management, vulnerability remediation, and compliance reporting
  • Maintain and implement high availability (HA) and disaster recovery (DR) solutions
  • Conduct capacity planning, performance tuning, and infrastructure optimization
  • Own and drive problem management processes, including RCA documentation and preventive measures
  • Participate in Change Advisory Boards (CAB) and lead complex change implementations
  • Maintain and audit CMDB for accuracy and completeness
  • Provide technical leadership in customer meetings and strategic planning sessions.

Requirements

  • Deep expertise in HPE Compute platforms (C7000, Synergy, Virtual Connect, ProLiant)
  • Advanced Linux administration (RHEL, SUSE) including kernel tuning, system hardening, and troubleshooting
  • Strong virtualization experience in VMware (vSphere, SRM, Horizon), KVM, and Hyper-V
  • Proficient in VMware infrastructure management: VM lifecycle operations, cluster management, performance monitoring, capacity planning, patching, backup/restore, and snapshot handling
  • Skilled in analyzing logs (VM-support, HPSreport, SOSreport) and performing root cause analysis
  • Solid understanding of storage technologies (SAN/NAS/DAS) and protocols (FC, iSCSI, FCoE)
  • Experience with Red Hat Satellite, SUSE Manager, and patch lifecycle management
  • Expertise in HA/DR solutions using Serviceguard, Pacemaker, and Linux clustering
  • Familiarity with networking fundamentals (VLANs, MTU, flow control) and troubleshooting
  • Strong scripting and automation skills using Bash, Python, and Ansible
  • Excellent communication, documentation, and customer engagement skills.

Nice to have

  • Experience with cloud platforms (AWS, Azure, GCP) and hybrid cloud integration
  • Familiarity with Infrastructure as Code (IaC) tools like Terraform
  • Knowledge of REST APIs, PowerShell, and database systems (PostgreSQL, MySQL)
  • Exposure to containerization (Docker, Podman) and orchestration (Kubernetes)
  • Experience in DevOps practices and CI/CD pipelines
  • Ability to deliver internal training and mentor junior engineers
  • Proven track record of driving continuous improvement and innovation.

What we offer

  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion fostering diversity and flexibility.

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Compute Engineer

8 matching positions

Compute Engineer

HPE Operations is our innovative IT services organization. It provides the exper...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Broad technical knowledge on HPE ISS solutions – Installing, Configuring & Troubleshooting of C7000 enclosures, HPE Synergy, Virtual Connects, Blade Switches- SAS,Ethernet & FC, ProLiant Blades & Storage Blades
  • Operating Systems Knowledge – Install, configure, administration and troubleshoot RHEL/SUSE(as Bare-Metal OS & as VMs on Hypervisors) and VMware
  • Working knowledge on Redhat/SUSE Linux
  • Troubleshooting OS logs for hardware issues from VM-support, HPSreport, SOSreport, Support-Config etc
  • Knowledge on SAN, NAS technologies (Ethernet / iSCSI, FC, FCOE)
  • Knowledge on DAS Storage & HBAs – Smart Array /RAID, SSDs SAS, SATA etc
  • Disaster Recovery planning and conducting DR tests
  • Performed routine Performance Analysis, Capacity analysis, security audit analysis reports to customer for necessary planned changes
  • Linux Vulnerability assessment and Mitigation
  • Serviceguard cluster configuration and management on Linux and Integration with Database and ERP Solution
Job Responsibility
Job Responsibility
  • Provides Operate and Admin support on Compute infrastructure and the Operating system in accordance with contractually established terms and conditions and established technical standards
  • Provides technical input, solutions, and recommendations to deal pursuit
  • Engaged in and provides support for transition/ transformation efforts
  • Provides IT infrastructure and/ or application infrastructure lifecycle technical support, including planning, project management, installation, on- going management/ monitoring/ troubleshooting, and de- installation, following operational policies and processes that are compliant with industry standards (e.g. Information Technology Infrastructure Library (ITIL))
  • Manages the technical/ service relationship between the company and the customer, and between the company and subcontractors/ vendors
  • Works with the key customers and/ or internal businesses/ end user representatives (Infrastructure Support Managers, Client Manager and the Account Delivery Manager) to retain customers and build the business
  • Resolve customer’s issues via the telephone, email or remote sessions
  • Reproduce issues in-house and responding back in a timely manner
  • Regular follow ups with customers with recommendations, updates and action plans
  • Identify and escalate issues in a timely manner to vendor according to Standard Operating Procedures
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Compute Engineer

HPE Operations is our innovative IT services organization. It provides the exper...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep expertise in HPE Compute platforms (C7000, Synergy, Virtual Connect, ProLiant)
  • Advanced Linux administration (RHEL, SUSE) including kernel tuning, system hardening, and troubleshooting
  • Strong virtualization experience in VMware (vSphere, SRM, Horizon), KVM, and Hyper-V
  • Proficient in VMware infrastructure management: VM lifecycle operations, cluster management, performance monitoring, capacity planning, patching, backup/restore, and snapshot handling
  • Skilled in analyzing logs (VM-support, HPSreport, SOSreport) and performing root cause analysis
  • Solid understanding of storage technologies (SAN/NAS/DAS) and protocols (FC, iSCSI, FCoE)
  • Experience with Red Hat Satellite, SUSE Manager, and patch lifecycle management
  • Expertise in HA/DR solutions using Serviceguard, Pacemaker, and Linux clustering
  • Familiarity with networking fundamentals (VLANs, MTU, flow control) and troubleshooting
  • Strong scripting and automation skills using Bash, Python, and Ansible
Job Responsibility
Job Responsibility
  • Acts as a senior technical expert in Compute infrastructure, VMware virtualization, and Linux-based operating systems, providing advanced support and strategic guidance
  • Leads complex troubleshooting, root cause analysis, and performance tuning across enterprise environments
  • Provides architectural input and contributes to the design and implementation of infrastructure solutions
  • Supports transition and transformation initiatives, including migrations, upgrades, and automation efforts
  • Ensures compliance with ITIL processes and industry best practices
  • Acts as a technical liaison between internal teams, customers, and third-party vendors
  • Mentors junior engineers and contributes to knowledge sharing and process improvement
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Compute Engineer

HPE Operations is our innovative IT services organization. It provides the exper...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep expertise in HPE Compute platforms (C7000, Synergy, Virtual Connect, ProLiant)
  • Advanced Linux administration (RHEL, SUSE) including kernel tuning, system hardening, and troubleshooting
  • Strong virtualization experience in VMware (vSphere, SRM, Horizon), KVM, and Hyper-V
  • Proficient in VMware infrastructure management: VM lifecycle operations, cluster management, performance monitoring, capacity planning, patching, backup/restore, and snapshot handling
  • Skilled in analyzing logs (VM-support, HPSreport, SOSreport) and performing root cause analysis
  • Solid understanding of storage technologies (SAN/NAS/DAS) and protocols (FC, iSCSI, FCoE)
  • Experience with Red Hat Satellite, SUSE Manager, and patch lifecycle management
  • Expertise in HA/DR solutions using Serviceguard, Pacemaker, and Linux clustering
  • Familiarity with networking fundamentals (VLANs, MTU, flow control) and troubleshooting
  • Strong scripting and automation skills using Bash, Python, and Ansible
Job Responsibility
Job Responsibility
  • Acts as a senior technical expert in Compute infrastructure, VMware virtualization, and Linux-based operating systems, providing advanced support and strategic guidance
  • Leads complex troubleshooting, root cause analysis, and performance tuning across enterprise environments
  • Provides architectural input and contributes to the design and implementation of infrastructure solutions
  • Supports transition and transformation initiatives, including migrations, upgrades, and automation efforts
  • Ensures compliance with ITIL processes and industry best practices
  • Acts as a technical liaison between internal teams, customers, and third-party vendors
  • Mentors junior engineers and contributes to knowledge sharing and process improvement
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Compute Engineer

The candidate provides Operate and Admin support on Compute infrastructure and t...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Broad technical knowledge on HPE ISS solutions – Installing, Configuring & Troubleshooting of C7000 enclosures, HPE Synergy, Virtual Connects, Blade Switches- SAS,Ethernet & FC, ProLiant Blades & Storage Blades
  • Operating Systems Knowledge – Install, configure, administration and troubleshoot RHEL/SUSE(as Bare-Metal OS & as VMs on Hypervisors) and VMware
  • Working knowledge on Redhat/SUSE Linux
  • Troubleshooting OS logs for hardware issues from VM-support, HPSreport, SOSreport, Support-Config etc
  • Knowledge on SAN, NAS technologies (Ethernet / iSCSI, FC, FCOE)
  • Knowledge on DAS Storage & HBAs – Smart Array /RAID, SSDs SAS, SATA etc
  • Disaster Recovery planning and conducting DR tests
  • Performed routine Performance Analysis, Capacity analysis, security audit analysis reports to customer for necessary planned changes
  • Linux Vulnerability assessment and Mitigation
  • Serviceguard cluster configuration and management on Linux and Integration with Database and ERP Solution
Job Responsibility
Job Responsibility
  • Resolve customer’s issues via the telephone, email or remote sessions
  • Reproduce issues in-house and responding back in a timely manner
  • Regular follow ups with customers with recommendations, updates and action plans
  • Identify and escalate issues in a timely manner to vendor according to Standard Operating Procedures
  • Leverage internal technical expertise, including peers, mentors, knowledge base, community forums and other internal tools, to provide the most effective solutions to customer issues
  • Collaborate with other CoE/HW teams in diagnosing and isolating the cause of complex issues
  • Maintain quality on case documentation, SLA timeframes and operational metrics
  • Performs within the Productivity Measure of the team (scorecard)
  • Incident Management: Resolve single and cross technology incidents independently
  • Lead the team members to resolve complex or cross technology incidents
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion.
  • Fulltime
Read More
Arrow Right

Compute Engineer

This role involves providing advanced technical expertise in Compute infrastruct...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep expertise in HPE Compute platforms (C7000, Synergy, Virtual Connect, ProLiant)
  • Advanced Linux administration (RHEL, SUSE) including kernel tuning, system hardening, and troubleshooting
  • Strong virtualization experience in VMware (vSphere, SRM, Horizon), KVM, and Hyper-V
  • Proficient in VMware infrastructure management: VM lifecycle operations, cluster management, performance monitoring, capacity planning, patching, backup/restore, and snapshot handling
  • Skilled in analyzing logs (VM-support, HPSreport, SOSreport) and performing root cause analysis
  • Solid understanding of storage technologies (SAN/NAS/DAS) and protocols (FC, iSCSI, FCoE)
  • Experience with Red Hat Satellite, SUSE Manager, and patch lifecycle management
  • Expertise in HA/DR solutions using Serviceguard, Pacemaker, and Linux clustering
  • Familiarity with networking fundamentals (VLANs, MTU, flow control) and troubleshooting
  • Strong scripting and automation skills using Bash, Python, and Ansible
Job Responsibility
Job Responsibility
  • Acts as a senior technical expert in Compute infrastructure, VMware virtualization, and Linux-based operating systems, providing advanced support and strategic guidance
  • Leads complex troubleshooting, root cause analysis, and performance tuning across enterprise environments
  • Provides architectural input and contributes to the design and implementation of infrastructure solutions
  • Supports transition and transformation initiatives, including migrations, upgrades, and automation efforts
  • Ensures compliance with ITIL processes and industry best practices
  • Acts as a technical liaison between internal teams, customers, and third-party vendors
  • Mentors junior engineers and contributes to knowledge sharing and process improvement
  • Lead resolution of critical incidents and escalations, ensuring minimal business impact
  • Perform in-depth analysis of system logs, kernel dumps, and performance metrics
  • Design and implement automation for routine tasks using Ansible, Shell, Python, etc.
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - Compute

At the heart of orchestrating this monumental compute infrastructure is the Comp...
Location
Location
United States , Pittsburgh
Salary
Salary:
146000.00 - 234000.00 USD / Year
aurora.tech Logo
Aurora Innovation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of professional software engineering experience
  • Deep expertise in Golang (for core systems) and Python (for SDK/API layering)
  • Strong understanding of distributed systems fundamentals (e.g., CAP theorem, consensus algorithms, or gossip protocols)
  • Experience with performance profiling and tuning (e.g., memory management, I/O bottlenecks, or network latency optimization)
  • Specialized knowledge of container orchestration systems like Kubernetes
  • Proven track record of driving continuous performance, scalability, and resilience improvements in production environments managing critical data
  • Familiarity with cloud provider compute and data services (e.g., AWS EKS, S3, RDS)
Job Responsibility
Job Responsibility
  • Design, implement, and maintain core components of the high-performance, large-scale distributed batch compute engine (BatchAPI). Architect and optimize the scheduler, resource allocator, and execution engine of BatchAPI to handle bursty, heterogeneous workloads with minimal overhead
  • Design low-latency APIs and resilient communication protocols that bridge our Python SDK with the Golang-based core engine
  • Develop high-level workflow abstractions, enabling engineers across the company to programmatically define, deploy, and manage complex data processing, simulation, and ML training pipelines
  • Solve complex problems in distributed locking, throttling, and fair-share scheduling to ensure multi-tenant stability
  • Drive continuous improvements in the performance, scalability, and resilience of the entire compute infrastructure, implementing robust monitoring and alerting systems to maintain operational excellence for critical workflows
  • Collaborate closely with infrastructure and product engineering teams (e.g., Autonomy, Data, Simulation, Machine Learning) to gather requirements, provide expert consultation, and integrate compute workflows with key company systems
What we offer
What we offer
  • annual bonus
  • equity compensation
  • benefits
  • Fulltime
Read More
Arrow Right

Senior GPU Compute Engineer (Rust)

A high-performance computing company building next-generation GPU orchestration ...
Location
Location
Poland
Salary
Salary:
100000.00 EUR / Year
signifytechnology.com Logo
Signify Technology
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong experience with GPU programming (Vulkan, CUDA, OpenCL, or similar)
  • Solid understanding of GPU architecture, memory, and performance optimisation
  • Systems programming background in Rust, C, or C++
  • Experience with low-level programming concepts including DMA, PCIe, and memory management
  • Familiarity with compute shaders, SPIR-V, and GPU debugging/profiling tools
  • Exposure to embedded systems, device drivers, or hardware integration is beneficial
  • Understanding of high-performance networking or RDMA is a plus
Job Responsibility
Job Responsibility
  • Integrate advanced GPU hardware into a distributed orchestration platform
  • Develop and optimise Vulkan compute pipelines and GPU kernels
  • Build low-level systems software in Rust/C++ for GPU control and monitoring
  • Improve GPU scheduling, memory management, and resource utilisation
  • Optimise high-speed GPU-to-GPU communication and RDMA networking
  • Work closely with hardware and SDK teams to troubleshoot performance and integration issues
What we offer
What we offer
  • Work on cutting-edge GPU and compute infrastructure
  • Opportunity to influence architecture and SDK development
  • Collaborative engineering environment with direct hardware exposure
  • Remote working
  • Competitive compensation package
  • Fulltime
Read More
Arrow Right

Advanced Platform Engineer - Compute

Feedzai is the world’s first RiskOps platform for financial risk management, and...
Location
Location
Portugal
Salary
Salary:
Not provided
feedzai.com Logo
Feedzai
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Information Systems, or the equivalent combination of education, experience, and training
  • 6+ years of hands-on experience in platform engineering, DevOps, or cloud infrastructure
  • Strong programming skills in Go, Java, or similar, with a track record of designing and delivering maintainable systems
  • Deep experience with container technologies and orchestration (Docker, Kubernetes), including operator development or ecosystem tooling
  • Proven experience with CI/CD (e.g. Jenkins, GitLab) and GitOps (e.g. FluxCD, Argo CD)
  • Substantial experience with at least one major cloud provider (AWS, GCP, Azure) and familiarity with cloud-native patterns
  • Strong experience with monitoring and observability (e.g. Grafana, Prometheus) and using data to drive reliability and performance
  • Solid experience with Infrastructure-as-Code (e.g. Terraform, Crossplane) and platform lifecycle management
  • Track record of leading projects, driving technical decisions, and mentoring others
  • Self-driven, collaborative, and motivated to improve how we build and run the platform
Job Responsibility
Job Responsibility
  • Lead the design, implementation, and evolution of Kubernetes Operators and platform services, including deployment, monitoring, and operations
  • Drive development in Go or similar languages, setting standards and best practices for the team
  • Own and evolve automation for cloud infrastructure and incident response, and champion self-healing and reliability improvements
  • Define and improve playbooks, runbooks, and alerting strategies to streamline response and reduce toil
  • Own and advance the product deployment pipeline and GitOps practices (e.g. FluxCD, Argo CD)
  • Lead or coordinate incident response, root cause analysis, and post-incident reviews
  • drive preventive measures
  • Work with AI-assisted development tools (e.g. Cursor) as part of your daily workflow to ship faster and iterate effectively
  • Own and extend Infrastructure as Code (IaC) and platform lifecycle (monitoring, alerting, security, cost, configuration, backup) in production
  • Contribute to developer experience and internal platform capabilities so product teams can ship with less friction
  • Fulltime
Read More
Arrow Right