CrawlJobs Logo

System Performance Engineer

India, Hyderabad · Job Posted June 15, 2026
Apply Position
Job Link Share

Job Description

At Teradata, we believe that people thrive when empowered with better information. Teradata Autonomous Knowledge Platform activates enterprise intelligence by unifying data, knowledge and business context to achieve tangible outcomes. With Teradata, organizations can provide agents with full context for impact when it matters. Our solution lets businesses connect and scale on premises, in the cloud, or through a hybrid approach. Teradata delivers real business value with AI. System Performance Engineers are key members of a specialized technical team focused on diagnosing, analyzing, and improving performance across the Teradata analytics ecosystem. In this role, you will work on complex customer environments to investigate CPU, memory, I/O, network, and query-processing behavior; identify bottlenecks; and recommend tuning or configuration changes that improve stability, scalability, and throughput. The ideal candidate combines strong analytical problem-solving skills with hands-on experience in relational databases, Linux or UNIX systems, and performance diagnostics in enterprise-scale environments.

Job Responsibility

  • Analyze system and database performance using logs, metrics, and diagnostic tools to identify performance bottlenecks and abnormal behavior
  • Work directly with customers and internal teams to investigate performance issues and deliver clear, actionable recommendations
  • Apply structured troubleshooting methods to isolate, reproduce, and resolve complex workload, query, and infrastructure performance problems
  • Perform root cause analysis for degradation incidents and document findings, corrective actions, and preventive recommendations
  • Partner with engineering teams to identify product defects, validate fixes, and improve observability and performance diagnostics
  • Track technical actions, tuning recommendations, and ownership across cases to ensure timely follow-through and high-quality customer communication
  • Deliver a strong customer experience through professional communication, technical depth, and a focus on measurable performance outcomes
  • Follow incident and escalation processes for priority issues while maintaining clear status updates and technical case documentation
  • Support a 24x7 global environment through participation in on-call rotations, weekend coverage, or after-hours escalations as required
  • Contribute to proactive performance improvement initiatives, including trend analysis, repeatable diagnostics, and continuous tuning best practices

Requirements

  • Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related technical discipline
  • 3+ years of experience in system performance engineering, database administration, technical support, or a related role in enterprise data environments
  • Hands-on experience supporting Linux or UNIX systems in production environments
  • Strong analytical and problem-solving skills with the ability to interpret performance metrics and troubleshoot complex technical issues
  • Experience working collaboratively across support, engineering, and customer-facing teams
  • Strong communication, prioritization, and time-management skills in a fast-paced technical environment
  • Experience using ticketing, incident management, or case tracking systems such as ServiceNow

Nice to have

  • Experience supporting performance in complex database or analytics environments at enterprise scale (Teradata experience is a plus)
  • Strong knowledge of relational databases, SQL execution plans, indexing strategies, and query optimization
  • Exposure to public cloud platforms and performance considerations in cloud-based data environments
  • Experience with scripting or automation using Python, Shell, Perl, or similar languages
  • Knowledge of Linux system administration, system resource analysis, and operating system performance tools
  • Understanding of network fundamentals and their impact on end-to-end system performance
  • Experience with workload analysis, benchmarking, trend analysis, or capacity planning is a plus
  • Familiarity with storage, I/O, and infrastructure components that influence database and platform performance
  • Teradata platform, DBA, or analytics ecosystem experience is strongly preferred

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

System Performance Engineer

8 matching positions

Senior System Performance Engineer

Role As a Senior System Performance Engineer on GM's AV System Performance Tea...
Location
Location
United States , Austin;Mountain View
Salary
Salary:
128700.00 - 261300.00 USD / Year
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum 3+ years of relevant industry experience
  • Hands-on programming experience with C++ and Python
  • Strong understanding of computer architecture and system-level software fundamentals
  • Proven experience with performance profiling, analysis, tuning, and optimization
  • Experience developing or optimizing high-performance software, ideally for heterogeneous compute environments (e.g., GPUs, DSPs, or accelerators)
  • Familiarity with industry benchmarks and workloads (e.g., MLPerf)
  • Strong communication skills with the ability to influence technical decisions within a team or product area
  • Ability to lead projects through ambiguity and deliver results end to end
  • BS, MS in Computer Science or a related technical field (or equivalent practical experience)
Job Responsibility
Job Responsibility
  • Collaborate with performance leads and partner engineering teams to align on performance requirements, development practices, and improvement opportunities
  • Lead performance-focused engineering initiatives with moderate ambiguity and cross-team collaboration
  • Contribute to the roadmap for performance tooling, frameworks, and methodologies that support efficient and scalable AV software development
  • Evaluate and prototype new tools, techniques, and technologies to improve runtime performance and developer workflows
  • Design, implement, and maintain tools and automated systems that support performance analysis, debugging, and continuous monitoring
  • Apply and help improve performance engineering standards, processes, and best practices at the team level
  • Analyze software behavior, identify performance bottlenecks, and collaborate with product teams to propose and implement optimizations
  • Mentor junior engineers on performance profiling, optimization strategies, and engineering best practices
What we offer
What we offer
  • Medical
  • Dental
  • Vision
  • Health Savings Account
  • Flexible Spending Accounts
  • Retirement savings plan
  • Sickness and accident benefits
  • Life insurance
  • Paid vacation & holidays
  • Tuition assistance programs
  • Fulltime
Read More
Arrow Right

AI/HPC System Performance Engineer

Meta is building some of the world's largest AI and high-performance computing i...
Location
Location
United States , Menlo Park
Salary
Salary:
154000.00 - 217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience profiling and optimizing distributed AI or HPC workloads, including familiarity with GPU interconnects, RDMA networking, and collective communication frameworks such as NCCL or MPI
  • Experience debugging complex, non-reproducible performance issues across multi-layer systems including network fabric, operating system, and application layers
  • Experience designing and implementing performance monitoring systems, including instrumentation, telemetry pipelines, and alerting for large-scale infrastructure
  • Experience driving cross-functional technical projects from requirements definition through production deployment, including communicating performance findings and trade-offs to diverse stakeholders
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • 6+ years of experience in system performance engineering, network infrastructure engineering, or a related field within large-scale distributed computing or HPC environments
Job Responsibility
Job Responsibility
  • Profile and benchmark AI training and inference workloads across large-scale HPC clusters to identify network, compute, and memory bottlenecks
  • Develop and maintain performance analysis frameworks and dashboards to track system-level metrics including GPU utilization, network bandwidth, latency, and collective communication efficiency
  • Investigate and resolve performance regressions in distributed AI training environments, including issues related to RDMA fabrics, collective communication libraries, and job scheduling
  • Collaborate with network infrastructure, hardware, and AI research teams to define performance requirements and validate new HPC cluster configurations
  • Design and execute capacity and scalability experiments to inform network topology decisions for AI supercomputing infrastructure
  • Build tooling and automation to continuously monitor HPC system health, detect anomalies, and reduce mean time to mitigation during performance incidents
  • Establish service level objectives for AI cluster network performance and drive cross-functional alignment on reliability and efficiency targets
  • Lead technical design reviews for network and system architecture changes affecting AI workload performance, communicating trade-offs clearly to engineering and product stakeholders
  • Mentor other engineers on HPC performance methodologies, debugging techniques, and instrumentation best practices
  • Leverage AI-assisted workflows to accelerate root cause analysis, automate routine performance reporting, and expand coverage across the HPC stack
What we offer
What we offer
  • bonus + equity + benefits
  • Fulltime
Read More
Arrow Right

AI/HPC System Performance Engineer

Meta's AI Training and Inference Infrastructure is growing exponentially to supp...
Location
Location
Salary
Salary:
184000.00 - 257000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Experience with developing, evaluating and debugging host networking protocols such as RDMA
  • 10+ years of experience in designing, deploying and operating networks
  • Experience with triaging performance issues in complex scale-out distributed applications
  • Understanding of AI training workloads and demands they exert on networks
Job Responsibility
Job Responsibility
  • Lead multi-disciplinary teams to develop solutions for large scale training systems. Assess trade-offs of various solutions and make pragmatic decisions
  • Ensure timely milestone delivery with teamwork and close collaboration
  • Responsible for the overall performance of the communication system, including performance benchmarking, monitoring and troubleshooting production issues
  • Defining technical strategy and driving a multi-year roadmap to make progress towards the related objectives
  • Work with crossfunctional teams and provide guidance on the AI network architecture including topologies, transport, congestion control techniques
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Ai/hpc System Performance Engineer, Phd

Meta's AI Training and Inference Infrastructure is growing exponentially to supp...
Location
Location
United States , Menlo Park
Salary
Salary:
122000.00 - 181000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • BS/MS/PhD in relevant fields (EE, CS), with 2+ years work experience
  • Experience with using communication libraries, such as MPI, NCCL, and UCX
  • Experience with developing, evaluating and debugging host networking protocols such as RDMA
  • Experience with triaging performance issues in complex scale-out distributed applications
  • Must obtain work authorization in country of employment at the time of hire and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Active member of a multi-disciplinary team to develop solutions for large scale training systems
  • Responsible for the overall performance of the communication system, including performance benchmarking, monitoring and troubleshooting production issues
  • Identify potential performance issues across the stack: comms lib, RDMA transport, host networking, scheduling and network fabric. Develop and deploy innovative solutions to address the performance issues
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

AI/HPC System Performance Engineer

Meta's AI Training and Inference Infrastructure is growing exponentially to supp...
Location
Location
United States , Austin
Salary
Salary:
219000.00 - 301000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Experience with developing, evaluating and debugging host networking protocols such as RDMA
  • 10+ years of experience in designing, deploying and operating networks
  • Experience with triaging performance issues in complex scale-out distributed applications
Job Responsibility
Job Responsibility
  • Lead multi-disciplinary teams to develop solutions for large scale training systems. Assess trade-offs of various solutions and make pragmatic decisions
  • Ensure timely milestone delivery with teamwork and close collaboration
  • Responsible for the overall performance of the communication system, including performance benchmarking, monitoring and troubleshooting production issues
  • Defining technical vision and driving a multi-year roadmap to make progress towards the related objectives
  • Work with cross functional teams and provide guidance on the AI network architecture including topologies, transport, congestion control techniques
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Senior System Support Engineer – High Performance Computing

The HPC Senior System Support Engineer provides highly visible on-site technical...
Location
Location
Australia , Canberra
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A minimum TSPV Government Security clearance is mandatory for the role
  • Expertise in Linux/Unix operating systems, parallel file systems (e.g., Lustre, GPFS) and networking technologies is essential
  • Proficient in programming and scripting languages such as Python and C++
  • Ability to develop solutions that enhance the availability, performance, maintainability and agility of HPC solutions
  • Has contributed to the design and application of new tools
  • Possesses an understanding, at a detailed level, of architectural dependencies of technologies in use in the customer's IT environment
  • Frequently uses product and application knowledge along with internals or architectural knowledge to develop solutions
  • Able to communicate with internal and external senior management confidently and demonstrate the professionalism
  • Ability to work in a multi- technology environment with the ability to diagnose complex technical problems to their root cause
  • In addition to troubleshooting skills and consulting skills, has ability to summarise prognosis and impact at practice lead level
Job Responsibility
Job Responsibility
  • Responsible for verifying and implementing the detailed technical design solution to the problem as identified by the Project/Technical Manager
  • Provides detailed technical design, analyses and develops enterprise solutions
  • Regularly leads technical assessment and delivery solutions to the customer
  • Coordinates implementation of new installations, designs, and migrations for HPC solutions
  • Provides advanced technical consulting and advice to others on proposal efforts, solution design, system management, tuning and modification of solutions
  • Provides input to the company strategy moving forward
  • Collects and determines data from appropriate sources to assist in determining customer needs and requirements
  • Responds to requests for technical information from customers
  • Engages in technical problem solving across multiple technologies
  • often needs to develop new methods to apply to the situation
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

System Engineer / System Owner for Data Historian & Rotronic Monitoring System

We are seeking a motivated, technically skilled, and communicative System Engine...
Location
Location
Switzerland , Kaiseraugst
Salary
Salary:
Not provided
https://www.randstad.com Logo
Randstad
Expiration Date
August 31, 2026
Flip Icon
Requirements
Requirements
  • Degree, Technician Certification, or completed vocational training in Computer Science / Information Technology, Automation Engineering, Electrical Engineering or a comparable technical discipline
  • Strong hands-on experience with the AVEVA PI Platform
  • Advanced knowledge of PI System configuration and interface implementation
  • Experience with PLC and Process Control Systems, preferably Siemens S7
  • Experience with Computerized System Validation (CSV) and IT Qualification
  • Good understanding of Active Directory environments
  • Experience working in regulated pharmaceutical manufacturing environments
  • Proven experience within Pharmaceutical Production and GxP-regulated environments
  • Experience in Project Management
  • Strong understanding of system lifecycle management and compliance requirements
Job Responsibility
Job Responsibility
  • Serve as the System Owner for AVEVA PI and Rotronic Monitoring Systems (RMS)
  • Ensure reliable operation and lifecycle management of both systems across the Basel and Kaiseraugst sites
  • Manage system performance, availability, maintenance, and continuous improvement activities
  • Create, review, and maintain validation and qualification documentation
  • Ensure compliance with GMP, GxP, CSV, and internal quality standards
  • Support audits and inspections and coordinate remediation activities where required
  • Lead Incident, Problem, Change, and Deviation Management processes
  • Coordinate service providers and internal stakeholders to ensure timely issue resolution
  • Drive continuous service improvements and operational excellence
  • Lead and contribute to system-related projects and upgrades
  • Fulltime
Read More
Arrow Right

Senior Staff Software Engineer (Impala Performance Engineer)

At Cloudera, we empower people to transform complex data into clear and actionab...
Location
Location
Salary
Salary:
Not provided
cloudera.com Logo
Cloudera
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of industry experience in performance related work ideally on large scale distributed systems
  • Understanding of DBMS algorithms and data structure fundamentals
  • Understanding of hardware trends and full stack systems performance: CPU, RAM, storage, network, Linux kernel, JVM, distributed systems performance
  • Deep understanding of performance measurement methodologies and performance analysis tools and techniques
  • Strong design and coding skills (Java/C++/Golang/Python preferred)
  • Ability to work in a distributed setting with team members spread across multiple geographies
  • Demonstrated ability to work on large cross-functional projects, including strong communication skills and a collaborative mindset
  • Hands-on experience with containerization and Kubernetes
  • B.S. or M.S. in Computer Science or equivalent experience
Job Responsibility
Job Responsibility
  • Work with internal development teams and the open source community to proactively drive performance improvements/optimizations across our data warehouse stack
  • Work with product managers, developers and the field team to understand performance and scale requirements and customer workload characteristics, and develop benchmarks and related performance analysis tooling based on these requirements
  • Analyze performance and scalability characteristics to identify bottlenecks in large-scale distributed systems
  • Perform root cause analysis of performance issues identified by internal testing and from customers and suggest corrective actions
  • Evaluate performance of competitor systems
What we offer
What we offer
  • Generous PTO Policy
  • Support work life balance with Unplugged Days
  • Flexible WFH Policy
  • Mental & Physical Wellness programs
  • Phone and Internet Reimbursement program
  • Access to Continued Career Development
  • Comprehensive Benefits and Competitive Packages
  • Paid Volunteer Time
  • Employee Resource Groups
  • Fulltime
Read More
Arrow Right