CrawlJobs Logo

System Engineer

China, 上海 360000.00 - 480000.00 CNY / Year · Job Posted May 16, 2026
Apply Position
Job Link Share

Job Description

The company has almost 100 million customers based in Japan and 1 billion globally as well, providing more than 70 services in a variety such as e-commerce, payment services, financial services, telecommunication, media, sports, etc. GPUOD team. Optimize Kubernetes (K8s) for GPU workloads, including scheduling policies, autoscaling, and multi-tenant resource isolation. Deploy and maintain inference serving platforms (e.g., NVIDIA Triton, vLLM, SGlang) for high-throughput and low-latency model deployment. Automate cluster provisioning, monitoring, and recovery to maximize uptime and GPU utilization. Collaborate with ML engineers to troubleshoot GPU-related issues in training jobs (e.g., NCCL errors, OOM) and inference bottlenecks. Implement observability tools (Prometheus, Grafana) to track GPU utilization, job performance, and cluster health. Develop infrastructure-as-code (IaC) solutions for reproducible GPU environments (e.g., Terraform, Ansible).

Job Responsibility

  • Optimize Kubernetes (K8s) for GPU workloads, including scheduling policies, autoscaling, and multi-tenant resource isolation
  • Deploy and maintain inference serving platforms (e.g., NVIDIA Triton, vLLM, SGlang) for high-throughput and low-latency model deployment
  • Automate cluster provisioning, monitoring, and recovery to maximize uptime and GPU utilization
  • Collaborate with ML engineers to troubleshoot GPU-related issues in training jobs (e.g., NCCL errors, OOM) and inference bottlenecks
  • Implement observability tools (Prometheus, Grafana) to track GPU utilization, job performance, and cluster health
  • Develop infrastructure-as-code (IaC) solutions for reproducible GPU environments (e.g., Terraform, Ansible)

Requirements

  • 3+ years of experience in DevOps/MLOps, GPU infrastructure, or distributed computing
  • Deep expertise in Kubernetes (K8s) for GPU workload orchestration (e.g., KubeFlow, Volcano, custom schedulers)
  • Strong programming skills in Go or Python for platform development, automation and tooling
  • Proficiency in Linux system administration, performance tuning, and networking (e.g., RDMA, InfiniBand)
  • Experience with IaC tools (Terraform, Ansible) and CI/CD pipelines (GitHub Actions, Jenkins)
  • Bachelor's or higher degree in Computer Science, Engineering, or a related field
  • Strong teamwork and communication skills, with a passion for solving infrastructure challenges
  • Familiarity with distributed training frameworks (e.g., PyTorch DDP, FSDP, DeepSpeed)
  • Familiarity with Nvidia Triton serving framework or similar framework, and serving parameter tuning to make a good trade off between latency and throughput
  • Hands-on experience with GPU clusters, including troubleshooting NVIDIA drivers, CUDA, and NCCL issues
  • Knowledge of high-performance storage (Lustre, WekaFS) for large-scale training data
  • Experience with LLM training/inference stacks (e.g., Megatron-LM, TensorRT-LLM)

Nice to have

  • Familiarity with distributed training frameworks (e.g., PyTorch DDP, FSDP, DeepSpeed)
  • Familiarity with Nvidia Triton serving framework or similar framework, and serving parameter tuning to make a good trade off between latency and throughput
  • Hands-on experience with GPU clusters, including troubleshooting NVIDIA drivers, CUDA, and NCCL issues
  • Knowledge of high-performance storage (Lustre, WekaFS) for large-scale training data
  • Experience with LLM training/inference stacks (e.g., Megatron-LM, TensorRT-LLM)

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

System Engineer

8 matching positions

New

System Engineer

I’m working with one of the world’s most respected quantitative investment firms...
Location
Location
Singapore , Singapore
Salary
Salary:
Not provided
thisisiceberg.com Logo
Iceberg Cyber Security
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Windows enterprise administration, desktop engineering and Microsoft 365
  • Linux/UNIX systems administration
  • PC hardware, software deployment and endpoint support
  • Mobile device management
  • Trading floor technology and high-performance user support
  • Videoconferencing, AV and enterprise communications platforms
  • Infrastructure administration across servers, storage and enterprise systems
  • Troubleshooting complex technical issues across hardware, software and networking
  • Python, PowerShell or other scripting languages for automation
  • Process improvement and operational automation
Job Responsibility
Job Responsibility
  • Provide hands-on support to users
  • Identify patterns, improve processes, automate repetitive work
  • Continually raise the standard of the environment
  • Work across Windows and Linux systems, desktop engineering, communications platforms, storage, clustered computing, mobile technologies, AV systems and datacentre infrastructure
  • Partner with colleagues across the business
  • Collaborate with global engineering teams on infrastructure projects and operational improvements
Read More
Arrow Right
New

System Engineer

I’m working with one of the world’s most respected quantitative investment firms...
Location
Location
Hong Kong , Hong Kong
Salary
Salary:
Not provided
thisisiceberg.com Logo
Iceberg Cyber Security
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Windows enterprise administration, desktop engineering and Microsoft 365
  • Linux/UNIX systems administration
  • PC hardware, software deployment and endpoint support
  • Mobile device management
  • Trading floor technology and high-performance user support
  • Videoconferencing, AV and enterprise communications platforms
  • Infrastructure administration across servers, storage and enterprise systems
  • Troubleshooting complex technical issues across hardware, software and networking
  • Python, PowerShell or other scripting languages for automation
  • Process improvement and operational automation
Read More
Arrow Right
New

System Engineer

Drive accountable and compliant system development to fulfill V-model guidelines...
Location
Location
China , Suzhou
Salary
Salary:
Not provided
borgwarner.com Logo
BorgWarner
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master’s degree and major in power electrification and electronics or automation domain
  • 5+ years of power electronics develop experience on system/software/EE design
  • Proficient in power electronics especially inverter products
  • Proficient in IGBT/SiC power device application and protection
  • Familiar with PMSM/ASM FOC motor control algorithm
  • Familiar with motor calibration process
  • Familiar with SPWM/SVPWM modulation algorithm
  • Familiar with ASPICE/Agile development process
  • Familiar with engineering tools like Vector CANalyzer/CANoe/CANape
  • Good leadership with the ability to guide and drive the team
Job Responsibility
Job Responsibility
  • Drive accountable and compliant system development to fulfill V-model guidelines and standard Engineering Development Frameworks (EDF)
  • As System Coordinator, define the system development timeline in alignment with project requirements and ensure its effective implementation
  • Clarify and define the technical specifications of all system design deliverables across multi-disciplinary engineering teams
  • Manage and integrate customer requirements specifications and additional stakeholder requirements into system requirements
  • Contribute regular Project Status Reviews and report progress to management
  • Participate in Project Core Team meetings and CCB meetings on a regular basis
  • Conduct daily communication with specialists in the System Design Department to oversee the development and maintenance of system design deliverables
  • Ensuring that the relevant standards and work rules of the System Design department are applied and respected
  • Monitor the progress of system design deliverables on a regular basis and define corrective countermeasures to address progress bottlenecks
  • Perform risk management activities specific to the system design deliverables throughout the development lifecycle
  • Fulltime
Read More
Arrow Right

System Engineer

Alter Domus is seeking a skilled and proactive Microsoft 365 Specialist (System ...
Location
Location
Malta , Birkirkara
Salary
Salary:
Not provided
alterdomus.com Logo
Alter Domus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum of 3 years of relevant experience in the IT industry
  • Exceptional oral and written communication skills
  • Detail-oriented and organized, with a proven ability to meet deadlines in a fast-paced environment
  • Strong work ethic, responsiveness, and a commitment to customer service excellence
  • Excellent team player with strong interpersonal skills
  • Self-motivated and capable of thriving in a project-based environment
  • Advanced expertise in mail flow, hybrid configurations, transport rules, connectors, and accepted domains
  • Strong troubleshooting: message trace, header analysis
  • Strong knowledge in Conditional Access, Identity Protection, MFA, Single Sign-On
  • Intune Management: Device compliance, Windows Autopatch & Autopilot, Security baselines, configuration profiles, update rings, and app deployment
Job Responsibility
Job Responsibility
  • Manage and support the Microsoft 365 environment, including Exchange Online, Intune, SharePoint, Defender, Entra ID and Teams
  • Collaborate with cross-functional teams to integrate Microsoft 365 solutions with existing systems and workflows, enhancing overall operational efficiency
  • Oversee the procurement, allocation, and management of Microsoft 365 licenses, ensuring compliance with licensing agreements and optimizing license usage across the organization
  • Implement and maintain security protocols and best practices (CIS) to protect sensitive data within the Microsoft ecosystem, including user access controls, data loss prevention, and threat protection measures
  • Monitor system performance and security incidents, generating reports and insights to inform decision-making and improve service delivery
  • Deliver technical support and training to the Level 1 and Level 2 teams
  • Stay updated on the latest Microsoft 365 features and security trends, recommending enhancements and upgrades to improve functionality and security posture
  • Develop and maintain comprehensive documentation that supports end user operations, including guides for troubleshooting common issues, step-by-step instructions and best practices
What we offer
What we offer
  • Support for professional accreditations such as ACCA and study leave
  • Flexible arrangements, generous holidays, plus an additional day off for your birthday
  • Continuous mentoring along your career progression
  • Active sports, events and social committees across our offices
  • 24/7 support available from our Employee Assistance Program
  • The opportunity to invest in our growth and success through our Employee Share Plan
  • Plus additional local benefits depending on your location
Read More
Arrow Right

System Engineer

2HB Incorporated is seeking a Systems Engineer to support its government custome...
Location
Location
United States , Annapolis Junction
Salary
Salary:
Not provided
2hb.com Logo
2HB
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Containers/Cloud environments
  • Storage Management
  • Virtualization
  • Ip networking and Firewall exp.
  • ALL LINUX
  • Solid understanding of PKI/TLS/crypto including crypto policies
  • Exp w/ automation tools is a bonus
  • Application support and general troubleshooting
  • SSP compliance and STE/STN support on Linux systems.
Job Responsibility
Job Responsibility
  • Application support and general troubleshooting
  • SSP compliance and STE/STN support on Linux systems.
  • Fulltime
Read More
Arrow Right

System Engineer

DCS Corp is seeking an experienced Systems Engineer to support the U.S. Army Com...
Location
Location
United States , Sterling Heights
Salary
Salary:
Not provided
dcscorp.com Logo
DCS Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • U.S. citizenship is required
  • Ability to obtain and maintain a DoD Secret clearance
  • Bachelor’s degree in Systems Engineering, Electrical Engineering, Mechanical Engineering, Computer Engineering, Software Engineering, or a related technical discipline with 5 years of experience
  • Strong technical communication skills, including the ability to interface effectively with government customers, program managers, systems engineers, software engineers, test engineers, and other technical stakeholders
  • Experience supporting requirements development, requirements decomposition, verification planning, or system integration activities
  • Ability to analyze complex technical problems, identify root causes, and coordinate practical engineering solutions
  • Familiarity with systems engineering lifecycle processes, including requirements analysis, architecture development, integration, verification, validation, and sustainment
  • Ability to work in a fast-paced, team-oriented environment supporting defense or ground vehicle programs
  • Experience developing or reviewing technical documentation, engineering reports, test plans, or system specifications
Job Responsibility
Job Responsibility
  • Develop, analyze, and manage system-level requirements in coordination with government customers, engineering teams, and program stakeholders
  • Support system architecture development, functional analysis, trade studies, and technical decision-making for ground vehicle systems
  • Assist in defining system interfaces, integration strategies, verification methods, and validation plans
  • Collaborate with software, electrical, mechanical, cybersecurity, test, and logistics engineering teams to ensure system requirements are properly implemented and verified
  • Support integration and test activities, including troubleshooting system-level issues, documenting results, and coordinating corrective actions
  • Prepare and maintain technical documentation, including requirements specifications, interface control documents, test procedures, technical reports, design descriptions, and engineering briefings
  • Participate in technical reviews, design reviews, working groups, and customer meetings
  • Support configuration management, risk management, change impact assessments, and traceability of requirements throughout the system lifecycle
  • Apply established systems engineering processes, standards, and best practices to support program execution and technical quality
  • Fulltime
Read More
Arrow Right

System Engineer

As a System Engineer (m/f/d) you are the central contact person for our customer...
Location
Location
Switzerland , Adliswil
Salary
Salary:
Not provided
parking.net Logo
Parking Network B.V.
Expiration Date
June 30, 2026
Flip Icon
Requirements
Requirements
  • Federal Certificate of Competence in Computer Science, similar training or equivalent professional experience
  • Excellent communication skills and enjoyment of customer contact
  • Independent, flexible work style and team spirit
  • fluent German and good English skills
  • Experience in the technical or networking field, as well as a good understanding of IT.
Job Responsibility
Job Responsibility
  • Responsible support: You will remotely support our customers from within the support team and ensure that their systems function stably and reliably
  • Ensuring service quality: You process technical requests according to agreed service levels (SLA) and actively contribute to improving response and resolution times
  • Consulting and optimization: You identify optimization potential in existing installations and support customers in the introduction of new products and solutions
  • Embrace teamwork: You contribute your technical know-how – especially in the areas of IT, networks or system integration – in close collaboration with Global Technical Support and our partners.
What we offer
What we offer
  • Thorough onboarding and technical training
  • A lot of personal responsibility and trust
  • Stable, international company with a long-term perspective
  • A friendly, down-to-earth team
  • Fulltime
Read More
Arrow Right

System Engineer

Work at the intersection of hardware, software, and real-world applications, sha...
Location
Location
Sweden , Uppsala
Salary
Salary:
Not provided
life-science-talent-solutions.dk Logo
Life Science Talent
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A master’s degree in engineering (or equivalent experience)
  • A few years of demonstrable work experience in engineering related to mechanics, electronics, automation or similar
  • Broad technical curiosity—experience with CAD, electronics, or programming is a plus
  • Understanding of life sciences or regulated environments (GxP is a strong advantage)
  • Fluency in English – Swedish would be a plus but not required.
Job Responsibility
Job Responsibility
  • Play a central role in product development—from early ideas through to final delivery
  • Working closely with product managers, designers, and engineers, define what needs to be built and help ensure it meets real user needs
  • Build and test prototypes across software, electronics, and mechanical systems
  • Take ownership of documentation—creating manuals, specifications, and training materials
  • Act as a key link between hardware, software, and application teams, keeping communication clear and aligned
  • Support and mentor others, contributing to a team culture focused on quality, accountability, and continuous improvement.
Read More
Arrow Right