CrawlJobs Logo

Systems Research Engineer, GPU Programming

together.ai Logo

Together AI

Location Icon

Location:
United States , San Francisco

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

160000.00 - 230000.00 USD / Year

Job Description:

As a Systems Research Engineer specialized in GPU Programming, you will play a crucial role in developing and optimizing GPU-accelerated kernels and algorithms for ML/AI applications. Working closely with the modeling and algorithm team, you will co-design GPU kernels and model architecture to enhance the performance and efficiency of our AI systems. Collaborating with the hardware and software teams, you will contribute to the co-design of efficient GPU architectures and programming models, leveraging your expertise in GPU programming and parallel computing. Your research skills will be vital in staying up-to-date with the latest advancements in GPU programming techniques, ensuring that our AI infrastructure remains at the forefront of innovation.

Job Responsibility:

  • Optimize and fine-tune GPU code to achieve better performance and scalability
  • Collaborate with cross-functional teams to integrate GPU-accelerated solutions into existing software systems
  • Stay up-to-date with the latest advancements in GPU programming techniques and technologies

Requirements:

  • Strong background in GPU programming and parallel computing, such as CUDA and/or Triton
  • Knowledge of ML/AI applications and models
  • Knowledge of performance profiling and optimization tools for GPU programming
  • Excellent problem-solving and analytical skills
  • Bachelor's, Master's, or Ph.D. degree in Computer Science, Electrical Engineering, or equivalent practical experiences
What we offer:
  • competitive compensation
  • startup equity
  • health insurance
  • flexibility in terms of remote work

Additional Information:

Job Posted:
February 18, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Systems Research Engineer, GPU Programming

Geoint Systems Engineer

Reinventing Geospatial (RGi) is a leading expert in geospatial solutions for Def...
Location
Location
United States , Aberdeen Proving Grounds; Alexandria
Salary
Salary:
Not provided
rgi-corp.com Logo
Reinventing Geospatial
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Active Top Secret clearance with an ability to obtain SCI access and willingness to obtain CI Polygraph
  • US Citizenship Required
  • Experience with installation, configuration, security hardening, operation, maintenance, and troubleshooting of: Windows operating systems (Server and Desktop environments), Linux operating systems (RHEL, CentOS, Ubuntu, or similar distributions)
  • Proficiency in managing and troubleshooting enterprise software including: Web servers (Apache, Nginx, IIS), Database systems (PostgreSQL, SQL Server, MySQL, Oracle), Web applications and services Middleware and application servers
  • Strong scripting and automation capabilities with knowledge of: General programming paradigms including data types, control flow structures, and logic constructs, PowerShell, Python, Bash/Shell scripting experience
  • Experience with REST API technologies including: Understanding of HTTP methods (GET, POST, PUT, DELETE, PATCH) and the ability to automate API interactions for system integration and operations, JSON/XML data handling
  • Comprehensive understanding of networking fundamentals: Network protocols (TCP, UDP, multicast, unicast), File sharing protocols (SMB, NFS), IP addressing schemes (IPv4/IPv6) and subnet calculations, Routing concepts and implementation, OSI model and troubleshooting methodology
  • Experience with network troubleshooting tools and techniques
  • Knowledge of system hardware architecture for selection, suitability analysis, operation, and troubleshooting: RAID configurations (0, 1, 5, 6, 10), HDD vs. SSD performance characteristics, SAN architecture and management, CPU architectures and performance considerations, RAM capacity and speed requirements, GPU capabilities for geospatial processing workloads
  • Ability to perform hardware capacity planning and performance optimization
Job Responsibility
Job Responsibility
  • Support the installation, configuration, operation, and maintenance of geospatial software systems
  • Utilize technical expertise across operating systems, enterprise applications, automation technologies, and hardware infrastructure to ensure mission-critical geospatial capabilities remain operational and secure
  • Analyze system capabilities with AGE and COE compliance requirements and identify gaps
  • Maintain functional specifications that define essential technical requirements of Legacy DCGS-A, IS&A, Mission Command, and COE CPCE
  • Maintain system engineering documentation including the System Engineering Plan, Software Requirements Traceability Matrix
  • Cross reference mapping of GEOINT functional specifications to Intelligence or Mission Command Systems specifications and program-level documents, such as the Capabilities Production Document (CPD), Information Systems Interface Control Document (IS-ICD), and Requirements Definition Package (RDP)
  • Interact with systems users to translate their requirements into systems, hardware, and software requirements and design
  • Plan and perform engineering research, design development, and other assignments in conformance with design, engineering and customer specifications
  • Lead team of engineers through project completion
  • responsible for major technical/engineering projects of higher complexity
What we offer
What we offer
  • 100% paid employee healthcare & dental insurance
  • Paid parental leave
  • 401k with matching
  • Escalating vacation time
  • Referral bonuses
  • Tuition reimbursement
  • Professional development training
  • Free beverages and snacks
  • Weekly catered lunches and breakfast on Fridays
  • Fulltime
Read More
Arrow Right

Senior Research Engineer

We are seeking a highly skilled Senior Research Engineer to collaborate closely ...
Location
Location
United States
Salary
Salary:
210000.00 - 309000.00 USD / Year
assembly.ai Logo
Assembly
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong expertise in the Python ecosystem and major ML frameworks (PyTorch, JAX)
  • Experience with lower-level programming (C++ or Rust preferred)
  • Deep understanding of GPU acceleration (CUDA, profiling, kernel-level optimization)
  • TPU experience is a strong plus
  • Proven ability to accelerate deep learning workloads using compiler frameworks, graph optimizations, and parallelization strategies
  • Solid understanding of the deep learning lifecycle: model design, large-scale training, data processing pipelines, and inference deployment
  • Strong debugging, profiling, and optimization skills in large-scale distributed environments
  • Excellent communication and collaboration skills, with the ability to clearly prioritize and articulate impact-driven technical solutions
Job Responsibility
Job Responsibility
  • Investigate and mitigate performance bottlenecks in large-scale distributed training and inference systems
  • Develop and implement both low-level (operator/kernel) and high-level (system/architecture) optimization strategies
  • Translate research models and prototypes into highly optimized, production-ready inference systems
  • Explore and integrate inference compilers such as TensorRT, ONNX Runtime, AWS Neuron and Inferentia, or similar technologies
  • Design, test, and deploy scalable solutions for parallel and distributed workloads on heterogeneous hardware
  • Facilitate knowledge transfer and bidirectional support between Research and Engineering teams, ensuring alignment of priorities and solutions
What we offer
What we offer
  • competitive equity grants
  • 100% employer-paid benefits
  • flexibility of being fully remote
  • Fulltime
Read More
Arrow Right

AI Research Lab Research Associate

We are currently seeking highly qualified interns to accelerate research towards...
Location
Location
United States , Milpitas
Salary
Salary:
43.27 - 93.15 USD / Hour
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
May 26, 2026
Flip Icon
Requirements
Requirements
  • Pursuing PhD degree (or other degree with significant research and innovation experience) in a relevant discipline (e.g. machine learning, computer science, electrical engineering, math, statistics, etc.)
  • Track record of world-class innovative contributions and ideas in machine learning
  • Experience with innovative solution development, such as developing proofs-of-concept, first-of-a-kind solutions, and/or technology transfer
  • Experience in deep learning research
  • Experience in developing deep learning software with high proficiency in data structures and algorithms
  • Strong programming skills and experience with Python, C/C++, and preferably Java
  • Software development experience in Deep Learning, GPU acceleration, and Model Optimization
  • Experience in Deep Learning and Machine Learning frameworks and models like Tensorflow, PyTorch
  • Experience in Transformer Neural Network architectures for Generative AI and natural language processing
  • Experience with Agentic AI and Generative AI workflows - desired
Job Responsibility
Job Responsibility
  • Conduct research and come up with solutions with a fast turnaround time
  • Build the software and applications for Neural Networks and Machine Learning
  • Work with system programming, Deep Learning frameworks and models, GPU acceleration, Model optimization, real-time streaming data, distributed computing, and deployment
  • Provide thought leadership and technical influence both internally and externally to HPE
  • Collaborate with HPE Labs research teams as well as external partners
  • Work in alignment with HPE's broader innovation community.
What we offer
What we offer
  • Health & Wellbeing benefits including physical, financial and emotional wellbeing support
  • Personal and professional development programs
  • Unconditional inclusion and flexibility to manage work and personal needs.
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Performance Optimization

We're looking for a Software Engineer focused on Performance Optimization to hel...
Location
Location
United States , San Mateo
Salary
Salary:
175000.00 - 220000.00 USD / Year
fireworks.ai Logo
Fireworks AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent practical experience
  • 5+ years of experience working on performance optimization or high-performance computing systems
  • Proficiency in CUDA or ROCm and experience with GPU profiling tools (e.g., Nsight, nvprof, CUPTI)
  • Familiarity with PyTorch and performance-critical model execution
  • Experience with distributed system debugging and optimization in multi-GPU environments
  • Deep understanding of GPU architecture, parallel programming models, and compute kernels
Job Responsibility
Job Responsibility
  • Optimize system and GPU performance for high-throughput AI workloads across training and inference
  • Analyze and improve latency, throughput, memory usage, and compute efficiency
  • Profile system performance to detect and resolve GPU- and kernel-level bottlenecks
  • Implement low-level optimizations using CUDA, Triton, and other performance tooling
  • Drive improvements in execution speed and resource utilization for large-scale model workloads (LLMs, VLMs, and video models)
  • Collaborate with ML researchers to co-design and tune model architectures for hardware efficiency
  • Improve support for mixed precision, quantization, and model graph optimization
  • Build and maintain performance benchmarking and monitoring infrastructure
  • Scale inference and training systems across multi-GPU, multi-node environments
  • Evaluate and integrate optimizations for emerging hardware accelerators and specialized runtimes
What we offer
What we offer
  • Meaningful equity in a fast-growing startup
  • Competitive salary
  • Comprehensive benefits package
  • Fulltime
Read More
Arrow Right

Customer Support Engineer

As a Customer Support Engineer at a pioneering AI company, you'll be the first l...
Location
Location
United States , San Francisco
Salary
Salary:
180000.00 - 260000.00 USD / Year
together.ai Logo
Together AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in a customer-facing technical role with at least 1 year in a support function in AI
  • Strong technical background, with knowledge of AI, ML, GPU technologies and their integration into high-performance computing (HPC) environments
  • Familiarity with infrastructure services (e.g., Kubernetes, SLURM), infrastructure as code solutions (e.g., Ansible) high-performance network fabrics, NFS-based storage management, container infrastructure, and scripting and programming languages
  • Familiarity with operating storage systems in HPC environments such as Vast and Weka
  • Familiarity with inspecting and resolving network-related errors
  • Strong knowledge of Python, TypeScript, and/or JavaScript with testing/debugging experience using curl and Postman-like tools
  • Foundational understanding in the installation, configuration, administration, troubleshooting, and securing of compute clusters
  • Complex technical problem solving and troubleshooting, with a proactive approach to issue resolution
  • Ability to work cross-functionally with teams such as Sales, Engineering, Support, Product and Research to drive customer success
  • Strong sense of ownership and willingness to learn new skills to ensure both team and customer success
Job Responsibility
Job Responsibility
  • Engage directly with customers to tackle and resolve complex technical challenges involving our cutting-edge GPU clusters and our inference and fine-tuning services
  • ensure swift and effective solutions every time
  • Become a product expert in all of our Gen AI solutions, serving as the last line of technical defense before issues are escalated to Engineering and Product teams
  • Collaborate seamlessly across Engineering, Research, and Product teams to address customer concerns
  • collaborate with senior leaders both internally and externally to ensure the highest levels of customer satisfaction
  • Transform customer insights into action by identifying patterns in support cases and working with Engineering and Go-To-Market teams to drive Together’s roadmap (e.g., future models to support)
  • Maintain detailed documentation of system configurations, procedures, troubleshooting guides, and FAQs to facilitate knowledge sharing with team and customers
  • Be flexible in providing support coverage during holidays, nights and weekends as required by business needs to ensure consistent and reliable service for our customers
What we offer
What we offer
  • competitive compensation
  • startup equity
  • health insurance
  • flexibility in terms of remote work
  • Fulltime
Read More
Arrow Right

Customer Support Engineer

As a Customer Support Engineer at a pioneering AI company, you'll be the first l...
Location
Location
India
Salary
Salary:
Not provided
together.ai Logo
Together AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in a customer-facing technical role with at least 1 year in a support function in AI
  • Strong technical background, with knowledge of AI, ML, GPU technologies and their integration into high-performance computing (HPC) environments
  • Familiarity with infrastructure services (e.g., Kubernetes, SLURM), infrastructure as code solutions (e.g., Ansible) high-performance network fabrics, NFS-based storage management, container infrastructure, and scripting and programming languages
  • Familiarity with operating storage systems in HPC environments such as Vast and Weka
  • Familiarity with inspecting and resolving network-related errors
  • Strong knowledge of Python, TypeScript, and/or JavaScript with testing/debugging experience using curl and Postman-like tools
  • Foundational understanding in the installation, configuration, administration, troubleshooting, and securing of compute clusters
  • Complex technical problem solving and troubleshooting, with a proactive approach to issue resolution
  • Ability to work cross-functionally with teams such as Sales, Engineering, Support, Product and Research to drive customer success
  • Strong sense of ownership and willingness to learn new skills to ensure both team and customer success
Job Responsibility
Job Responsibility
  • Engage directly with customers to tackle and resolve complex technical challenges involving our cutting-edge GPU clusters and our inference and fine-tuning services
  • ensure swift and effective solutions every time
  • Become a product expert in all of our Gen AI solutions, serving as the last line of technical defense before issues are escalated to Engineering and Product teams
  • Collaborate seamlessly across Engineering, Research, and Product teams to address customer concerns
  • collaborate with senior leaders both internally and externally to ensure the highest levels of customer satisfaction
  • Transform customer insights into action by identifying patterns in support cases and working with Engineering and Go-To-Market teams to drive Together’s roadmap (e.g., future models to support)
  • Maintain detailed documentation of system configurations, procedures, troubleshooting guides, and FAQs to facilitate knowledge sharing with team and customers
  • Be flexible in providing support coverage during holidays, nights and weekends as required by business needs to ensure consistent and reliable service for our customers
What we offer
What we offer
  • competitive compensation
  • startup equity
  • health insurance
  • flexibility in terms of remote work for the respective hiring region
Read More
Arrow Right

Principal Research Engineer - Agent 365

Copilot usage is growing rapidly across Microsoft 365 and custom agent experienc...
Location
Location
United States , Redmond
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Architect and deliver AI systems across model development, data, infra, evaluation, and deployment spanning multiple product lines
  • Set technical direction for large programs
  • drive alignment across Research, Engineering, and Product
  • Integrate LLMs, multimodal models, multi-agent architectures, and RAG into Microsoft’s ecosystem
  • Establish standards for MLOps, governance, and Responsible AI, compliant with Microsoft principles and industry standards
  • Drive original research and thought leadership (whitepapers, internal notes, patents)
  • convert insights into shipped capabilities
  • Research Translation: Continuously review emerging work
  • identify high-potential methods and adapt them to Microsoft problem spaces
  • Production Integration: Turn research prototypes into production-quality code optimized for scale, latency, and maintainability
  • Fulltime
Read More
Arrow Right

AI Engineer

Location
Location
Vietnam , Da Nang
Salary
Salary:
Not provided
saigontechnology.com Logo
Saigon Technology
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Have programming skills in one of these languages: Python, Java...
  • At least 1 years of experience working with AI/ML projects as a Data Engineer, Research Engineer, or Software Engineer.
  • Have experience with one of ML/DL Frameworks: Tensorflow, Pytorch, Keras, scikit-learn, Pandas, Langchain, LlamaIndex…
  • Have experience working with OpenAI, Gemini, any LLM... to build agents, workflows, or RAG systems.
  • Have experience with one of chatbot building frameworks or services like Rasa, Dialogflow, Transformer, BERT, LLM/Prompt…
  • Familiar with OCR algorithms or services: OpenCV, Tesseract, Textract (AWS), Google Cloud Vision, PaddleOCR.
  • Experience with common development tools: Linux, GPU server, Google Colab, Jupyter, Git, Docker.
  • Good English proficiency and communication skills.
Job Responsibility
Job Responsibility
  • Join in developing and applying the ML/DL techniques to resolve our client’s business problems such as building chatbot systems, LLM/Prompt, OCR systems, fraud detection systems, facial recognition systems…
  • Join in developing some internal products applying AI models
What we offer
What we offer
  • Competitive Salary and Brilliant Health Benefits
  • Attractive salary (13th-month salary, salary review twice/year) and project bonus
  • Bonus programs for candidate referral, technical article writing
  • Interest-free loan support for personal plan
  • Allowance for sickness, maternity, paternity and periodic health examination
  • PVI health care program
  • The staff of the quarter and year reward
  • Progressive and Fun Working Environment
  • A professional English-speaking working environment with Agile – Scrum model
  • Hybrid Working Model: Flexible working time and WFH support.
  • Fulltime
Read More
Arrow Right