CrawlJobs Logo

Member of Technical Staff, Performance Optimization

fireworks.ai Logo

Fireworks AI

Location Icon

Location:
United States , San Mateo

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

175000.00 - 220000.00 USD / Year

Job Description:

We're looking for a Software Engineer focused on Performance Optimization to help push the boundaries of speed and efficiency across our AI infrastructure. In this role, you'll take ownership of optimizing performance at every layer of the stack—from low-level GPU kernels to large-scale distributed systems. A key focus will be maximizing the performance of our most demanding workloads, including large language models (LLMs), vision-language models (VLMs), and next-generation video models. You’ll work closely with teams across research, infrastructure, and systems to identify performance bottlenecks, implement cutting-edge optimizations, and scale our AI systems to meet the demands of real-world production use cases. Your work will directly impact the speed, scalability, and cost-effectiveness of some of the most advanced generative AI models in the world.

Job Responsibility:

  • Optimize system and GPU performance for high-throughput AI workloads across training and inference
  • Analyze and improve latency, throughput, memory usage, and compute efficiency
  • Profile system performance to detect and resolve GPU- and kernel-level bottlenecks
  • Implement low-level optimizations using CUDA, Triton, and other performance tooling
  • Drive improvements in execution speed and resource utilization for large-scale model workloads (LLMs, VLMs, and video models)
  • Collaborate with ML researchers to co-design and tune model architectures for hardware efficiency
  • Improve support for mixed precision, quantization, and model graph optimization
  • Build and maintain performance benchmarking and monitoring infrastructure
  • Scale inference and training systems across multi-GPU, multi-node environments
  • Evaluate and integrate optimizations for emerging hardware accelerators and specialized runtimes

Requirements:

  • Bachelor’s degree in Computer Science, Computer Engineering, Electrical Engineering, or equivalent practical experience
  • 5+ years of experience working on performance optimization or high-performance computing systems
  • Proficiency in CUDA or ROCm and experience with GPU profiling tools (e.g., Nsight, nvprof, CUPTI)
  • Familiarity with PyTorch and performance-critical model execution
  • Experience with distributed system debugging and optimization in multi-GPU environments
  • Deep understanding of GPU architecture, parallel programming models, and compute kernels

Nice to have:

  • Master’s or PhD in Computer Science, Electrical Engineering, or a related field
  • Experience optimizing large models for training and inference (LLMs, VLMs, or video models)
  • Knowledge of compiler stacks or ML compilers (e.g., torch.compile, Triton, XLA)
  • Contributions to open-source ML or HPC infrastructure
  • Familiarity with cloud-scale AI infrastructure and orchestration tools (e.g., Kubernetes, Ray)
  • Background in ML systems engineering or hardware-aware model design
What we offer:
  • Meaningful equity in a fast-growing startup
  • Competitive salary
  • Comprehensive benefits package

Additional Information:

Job Posted:
December 08, 2025

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Member of Technical Staff, Performance Optimization

Member of Technical Staff, GPU Optimization

We are building AI to simulate the world through merging art and science. We bel...
Location
Location
United States
Salary
Salary:
260000.00 - 325000.00 USD / Year
runwayml.com Logo
Runway
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of relevant engineering or research experience in machine learning, computer vision and/or graphics
  • Experience with CUDA, C++ and systems level performance optimizations
  • Solid knowledge of at least one machine learning framework (e.g. PyTorch, Tensorflow)
  • Very strong programming skills and ability to write clean and maintainable research code
  • Deep interest in building human-in-the-loop systems for creativity
  • Ability to rapidly prototype solutions and iterate on them with tight product deadlines
  • Strong communication, collaboration, and documentation skills
Job Responsibility
Job Responsibility
  • Develop innovative research projects in computer vision, focusing on generative models for image and video
  • Work with a world-class engineering team pushing the boundaries of content creation on the browser
  • Collaborate closely with the rest of the product organization to bring cutting-edge machine learning models to production
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Cloud Infrastructure

As a Software Engineer on our Cloud Infrastructure team, you'll be at the forefr...
Location
Location
United States , New York, NY; San Mateo, CA; Redwood City, CA
Salary
Salary:
175000.00 - 220000.00 USD / Year
fireworks.ai Logo
Fireworks AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience)
  • 5+ years of experience designing and building backend infrastructure in cloud environments (e.g., AWS, GCP, Azure)
  • Proven experience in ML infrastructure and tooling (e.g., PyTorch, TensorFlow, Vertex AI, SageMaker, Kubernetes, etc.)
  • Strong software development skills in languages like Python, or C++
  • Deep understanding of distributed systems fundamentals: scheduling, orchestration, storage, networking, and compute optimization
Job Responsibility
Job Responsibility
  • Architect and build scalable, resilient, and high-performance backend infrastructure to support distributed training, inference, and data processing pipelines
  • Lead technical design discussions, mentor other engineers, and establish best practices for building and operating large-scale ML infrastructure
  • Design and implement core backend services (e.g., job schedulers, resource managers, autoscalers, model serving layers) with a focus on efficiency and low latency
  • Drive infrastructure optimization initiatives, including compute cost reduction, storage lifecycle management, and network performance tuning
  • Collaborate cross-functionally with ML, DevOps, and product teams to translate research and product needs into robust infrastructure solutions
  • Continuously evaluate and integrate cloud-native and open-source technologies (e.g., Kubernetes, Ray, Kubeflow, MLFlow) to enhance our platform’s capabilities and reliability
  • Own end-to-end systems from design to deployment and observability, with a strong emphasis on reliability, fault tolerance, and operational excellence
What we offer
What we offer
  • Meaningful equity in a fast-growing startup
  • Competitive salary
  • Comprehensive benefits package
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, AI Training Infrastructure

As a Training Infrastructure Engineer, you'll design, build, and optimize the in...
Location
Location
United States , San Mateo
Salary
Salary:
175000.00 - 220000.00 USD / Year
fireworks.ai Logo
Fireworks AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, or related field, or equivalent practical experience
  • 3+ years of experience with distributed systems and ML infrastructure
  • Experience with PyTorch
  • Proficiency in cloud platforms (AWS, GCP, Azure)
  • Experience with containerization, orchestration (Kubernetes, Docker)
  • Knowledge of distributed training techniques (data parallelism, model parallelism, FSDP)
Job Responsibility
Job Responsibility
  • Design and implement scalable infrastructure for large-scale model training workloads
  • Develop and maintain distributed training pipelines for LLMs and multimodal models
  • Optimize training performance across multiple GPUs, nodes, and data centers
  • Implement monitoring, logging, and debugging tools for training operations
  • Architect and maintain data storage solutions for large-scale training datasets
  • Automate infrastructure provisioning, scaling, and orchestration for model training
  • Collaborate with researchers to implement and optimize training methodologies
  • Analyze and improve efficiency, scalability, and cost-effectiveness of training systems
  • Troubleshoot complex performance issues in distributed training environments
What we offer
What we offer
  • meaningful equity in a fast-growing startup
  • comprehensive benefits package
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Research

As a Member of Technical Staff on the Research team, you’ll push the boundaries ...
Location
Location
United States , San Mateo
Salary
Salary:
175000.00 - 240000.00 USD / Year
fireworks.ai Logo
Fireworks AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Research background in Artificial Intelligence, Machine Learning, Physics, or similar field
  • Experience solving analytical problems using analytic and quantitative approaches
  • Experience communicating research to audiences with different backgrounds
  • Experience coding in C/C++, Python, or other similar languages
Job Responsibility
Job Responsibility
  • Conduct foundational research to advance the capabilities, efficiency, and reliability of LLMs and multimodal systems
  • Design, implement, and evaluate novel model architectures, training methods, and optimization techniques
  • Collaborate with engineering teams to transition research prototypes into production-grade systems
  • Analyze empirical results, identify performance bottlenecks, and iterate quickly to improve model quality
  • Contribute to internal research strategy by identifying high-impact opportunities and emerging trends in AI
What we offer
What we offer
  • Meaningful equity in a fast-growing startup
  • Competitive salary
  • Comprehensive benefits package
  • Fulltime
Read More
Arrow Right
New

LGV Projects SW Engineer

E80 Group is a multinational company specialized in the development of automated...
Location
Location
Mexico , San Pedro Garza García
Salary
Salary:
Not provided
e80group.com Logo
E80 Group
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS Degree in Mechatronics
  • Ability to effectively communicate technical concepts to other technical staff or customer’s members
  • Ability to interface well with professional service staff and other non-technical members of the organization or customer team
  • Demonstrated aptitude for learning new technologies
  • Knowledge relating to the area in which analysis and coding is performed
  • Willing to travel 70-80%
  • Currently living in Monterrey area
Job Responsibility
Job Responsibility
  • Design and code applications following specifications and using various software languages, such as: C#, Visual Basic, SQL and PLC languages (Siemens, Allen-Bradley, Beckhoff, HMIs etc.)
  • Design and develop software on existing equipment and systems like such as LGVs systems, Palletizers, WMS etc.
  • Maintain and modify existing applications independently or by collaborating with senior staff members in this field of expertise
  • Perform custom programming at customer request
  • Perform integration of software with existing system
  • Create and develop software simulation and modeling
  • Customizing existing applications and develop new deployable applications
  • Diagnose problems and install upgrades on existing equipment and systems used in our customers’ automated material handling systems
  • Integrate software with existing systems
  • Ensure optimal performance of equipment onsite
What we offer
What we offer
  • Competitive salary
  • Above the law benefits
  • Excellent working environment
  • Growth opportunity
Read More
Arrow Right
New

Software Engineer

We are looking for a Software Engineer matching the following responsibilities a...
Location
Location
Mexico , San Pedro Garza García
Salary
Salary:
Not provided
e80group.com Logo
E80 Group
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Degree in Computer Engineering
  • Ability to effectively communicate technical concepts to other technical staff or customer’s members
  • Ability to interface well with professional service staff and other non-technical members of the organization or customer team
  • Demonstrated aptitude for learning new technologies
  • Knowledge relating to the area in which analysis and coding is preformed
Job Responsibility
Job Responsibility
  • Design and code applications following specifications and using various software languages, such as: C#, Visual Basic, SQL and PLC languages (Siemens, Allen-Bradley, Beckhoff, HMIs etc.)
  • Design and develop software on existing equipment and systems like such as LGVs systems, Palletizers, WMS etc.
  • Maintain and modify existing applications independently or by collaborating with senior staff members in this field of expertise
  • Perform custom programming at customer request
  • Perform integration of software with existing system
  • Create and develop software simulation and modeling
  • Customizing existing applications and develop new deployable applications
  • Diagnose problems and install upgrades on existing equipment and systems used in our customers’ automated material handling systems
  • Integrate software with existing systems
  • Ensure optimal performance of equipment onsite
What we offer
What we offer
  • Permanent contract
  • World E80 Academy: innovative training and learning paths
  • #BE80 Culture: dynamic and collaborative team where giving back is close to our hearts
  • Fulltime
Read More
Arrow Right

Software Engineer

We are looking for a Software Engineer matching the following responsibilities a...
Location
Location
Mexico , San Pedro Garza García
Salary
Salary:
Not provided
e80group.com Logo
E80 Group
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Degree in Computer, Software Engineering or related
  • Ability to effectively communicate technical concepts to other technical staff or customer’s members
  • Ability to interface well with professional service staff and other non-technical members of the organization or customer team
  • Demonstrated aptitude for learning new technologies
  • Knowledge relating to the area in which analysis and coding is preformed
  • Based in Monterrey area
  • Home office 20% monthly
Job Responsibility
Job Responsibility
  • Design and code applications following specifications and using various software languages, such as: C#, Visual Basic, SQL and PLC languages (Siemens, Allen-Bradley, Beckhoff, HMIs etc.)
  • Design and develop software on existing equipment and systems like such as LGVs systems, Palletizers, WMS etc.
  • Maintain and modify existing applications independently or by collaborating with senior staff members in this field of expertise
  • Perform custom programming at customer request
  • Perform integration of software with existing system
  • Create and develop software simulation and modeling
  • Customizing existing applications and develop new deployable applications
  • Diagnose problems and install upgrades on existing equipment and systems used in our customers’ automated material handling systems
  • Integrate software with existing systems
  • Ensure optimal performance of equipment onsite
What we offer
What we offer
  • Permanent contract
  • Career growth
  • World E80 Academy
  • #BE80 Culture
  • Fulltime
Read More
Arrow Right

Sfcc Technical Lead

The SFCC Technical Lead is primarily responsible for producing quality, on-budge...
Location
Location
Salary
Salary:
Not provided
grinteq.com Logo
Grinteq
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of experience with software development
  • 5+ years of experience with Salesforce Commerce Cloud (ex Demandware), ideally holding a leadership role on at least 2 full-scale projects
  • Strong knowledge and experience with integrations to back-end systems, in particular other systems in the Salesforce landscape
  • Ability to come up with accurate development estimates based on high-level business and/or technical requirements
  • Excellent knowledge of design patterns, OOP, coding standards, algorithm performance & optimization
  • Good understanding of data structures, JavaScript, RESTful JSON, browser-based DOM manipulation
  • Extensive experience with debugging, reuse, source code, management strategies (e.g. forking, branching), and release management
  • Knowledge of interactions with enterprise 3PL solutions (ERP, CRM, OMS, PIM) using web services & job
  • Experience with production launch readiness and cloud-based deployment models
  • Excellent knowledge of performance optimization techniques
Job Responsibility
Job Responsibility
  • Work with client’s IT organization to establish technology strategy at an application level
  • Facilitate group discussions and lead client requirement activities
  • able to translate user requirements into functional specifications for development teams
  • Establish high, mid and micro level plans and set technical direction for a small team
  • lead the estimation effort for projects
  • work to identify and manage risk and control scope
  • Strong knowledge and expertise regarding SFCC Platform gained in direct interaction with our projects
  • Leads teams of 2- 5 members to deliver to the highest quality, exceeding customer expectations
  • Work closely with a local team to create high quality e-commerce sites built on the SFCC platform
  • Analyze client business needs and recommend innovative solutions that leverage technology to provide market differentiation, efficiency improvements, and better user experiences
What we offer
What we offer
  • A decent salary level which allows you to think about our mutual success and not about tomorrow
  • Flexible working hours. You create your own schedule
  • Possibility to work remotely. You prefer home office or traveling around? Easy, that's exactly how we operate
Read More
Arrow Right