Filters

Countries

United States (195)

Work Mode

On-site work (65)

AI Research Engineer, Scaling Jobs

360 Job Offers

Filters

PhD AI Research Intern

Location

Canada

Salary

55.00 USD / Hour

Atlassian

Expiration Date

Until further notice

AI Research Engineer, Scaling

Location

United States , Palo Alto

Salary

180000.00 - 300000.00 USD / Year

1X Technologies

Expiration Date

Until further notice

AI Researcher, Core ML

Location

United States , San Francisco

Salary

160000.00 - 230000.00 USD / Year

Together AI

Expiration Date

Until further notice

AI Research Lead - Multimodal & Video

Location

Salary

Not provided

Tether App

Expiration Date

Until further notice

AI Research Lead - Multimodal & Video

Location

Salary

Not provided

Tether App

Expiration Date

Until further notice

AI Research Lead - Multimodal & Video

Location

Salary

Not provided

Tether App

Expiration Date

Until further notice

AI Research Lead - Multimodal & Video

Location

Salary

Not provided

Tether App

Expiration Date

Until further notice

AI Research Lead - Multimodal & Video

Location

Salary

Not provided

Tether App

Expiration Date

Until further notice

AI Research Lead - Multimodal & Video

Location

Salary

Not provided

Tether App

Expiration Date

Until further notice

AI Engineer

Location

United States , San Diego

Salary

199500.00 - 299300.00 USD / Year

Teradata

Expiration Date

Until further notice

AI Research Engineer

Location

United Kingdom

Salary

Not provided

Prolific

Expiration Date

Until further notice

Research Engineer, GenAI

Location

United States , San Francisco; New York

Salary

175000.00 - 250000.00 USD / Year

Kiddom

Expiration Date

Until further notice

Senior Staff Machine Learning Engineer (AI Agent)

Location

United States; Canada

Salary

Not provided

Cresta

Expiration Date

Until further notice

Senior Research Engineer

Location

United States

Salary

210000.00 - 309000.00 USD / Year

Assembly

Expiration Date

Until further notice

Director of AI Engineering

Location

Canada; United States

Salary

300000.00 - 450000.00 USD / Year

Apollo.io

Expiration Date

Until further notice

Research Engineer, World Models

Location

United States , Palo Alto

Salary

180000.00 - 300000.00 USD / Year

1X Technologies

Expiration Date

Until further notice

Research Engineer, Scaling

Location

United States , Palo Alto

Salary

180000.00 - 300000.00 USD / Year

1X Technologies

Expiration Date

Until further notice

Senior Research Engineer

Location

Germany , Munich, Berlin

Salary

Not provided

Hewlett Packard Enterprise

Expiration Date

Until further notice

Research Engineer

Location

United Kingdom , Bristol

Salary

Not provided

Hewlett Packard Enterprise

Expiration Date

Until further notice

Research Engineer AI

Location

United Kingdom , Bristol

Salary

Not provided

Hewlett Packard Enterprise

Expiration Date

Until further notice

1 2 3 4 5 6 ... 18

AI Research Engineer, Scaling Jobs: A Comprehensive Career Overview An AI Research Engineer specializing in Scaling is a critical technical role at the intersection of cutting-edge artificial intelligence research and high-performance, production-grade engineering. Professionals in these jobs are the architects and builders of the robust infrastructure that allows AI models, particularly large-scale neural networks, to be trained, evaluated, and deployed efficiently at immense scale. Their core mission is to break down computational barriers, transforming theoretical research and experimental prototypes into reliable, optimized systems that can leverage thousands of processors simultaneously. For individuals seeking these jobs, the work is fundamentally about enabling the next leaps in AI capability by ensuring that compute is not the limiting factor. The typical responsibilities for an AI Research Engineer, Scaling, are centered on the entire lifecycle of large-scale AI systems. A primary duty involves designing, implementing, and maintaining distributed training frameworks to orchestrate massive training runs across vast clusters of GPUs or TPUs. This requires ensuring fault tolerance, efficient data loading, and synchronized operations across hundreds or thousands of nodes. Concurrently, these engineers focus on the inference side, optimizing model deployment to achieve maximum throughput and minimal latency, whether in cloud data centers or on edge devices. This involves implementing techniques like model quantization, pruning, distillation, and efficient kernel design. They are also responsible for building and managing the underlying platforms for experiment tracking, resource scheduling, and performance monitoring, ensuring that research teams can iterate rapidly and reliably. To excel in these highly technical jobs, a specific and deep skill set is required. Proficiency in Python and C++ is standard, alongside an expert-level understanding of deep learning frameworks like PyTorch or TensorFlow. Crucially, candidates must possess extensive experience with distributed computing paradigms and tools such as DeepSpeed, FSDP (Fully Sharded Data Parallel), or similar frameworks. A strong grasp of scaling laws, hardware architecture (especially GPU memory hierarchies and tensor cores), and performance profiling is essential. Skills in low-level optimization, including writing or tuning CUDA kernels, using compilers like TensorRT or XLA, and implementing advanced quantization schemes (e.g., INT8, FP8), are highly valued. Typically, a degree in Computer Science, Electrical Engineering, or a related field provides the foundational knowledge. Beyond technical prowess, a mindset geared towards systemic thinking, problem-solving under constraints, and a passion for pushing the boundaries of what's computationally possible are key traits for success in these challenging and impactful jobs. Ultimately, AI Research Engineers in Scaling are the unsung enablers of modern AI breakthroughs. Their work directly dictates the speed of innovation, the feasibility of training ever-larger models, and the practical deployment of AI in real-world applications. For engineers who thrive on solving complex, system-level puzzles and want their work to have a multiplier effect on AI research and application, these jobs offer a unique and critical career path at the forefront of technology.