CrawlJobs Logo

Inference Frontend

cerebras.net Logo

Cerebras Systems

Location Icon

Location:
United States , Sunnyvale

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs.

Job Responsibility:

  • Collaborate with world-class engineers on real-world challenges across the software stack
  • Design, implement, and test software solutions that directly impact system performance and usability
  • Learn and contribute across multiple layers of a fully integrated AI-accelerated system
  • Gain hands-on experience with advanced hardware, compilers, distributed systems, and ML frameworks

Requirements:

  • Recently graduated or enrolled in a university program with a degree in Computer Science, Computer Engineering, or other related disciplines (graduating 2026)
  • Strong problem-solving skills and excellent communication skills
  • Proficient in one or more programming language – exposure and experience with C++ is an asset
What we offer:
  • Build a breakthrough AI platform beyond the constraints of the GPU
  • Publish and open source their cutting-edge AI research
  • Work on one of the fastest AI supercomputers in the world
  • Enjoy job stability with startup vitality
  • Our simple, non-corporate work culture that respects individual beliefs

Additional Information:

Job Posted:
February 17, 2026

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Inference Frontend

Python / PyTorch Developer — Frontend Inference Compiler

Would you like to participate in creating the fastest Generative Models inferenc...
Location
Location
United Arab Emirates , Dubai
Salary
Salary:
Not provided
cerebras.net Logo
Cerebras Systems
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Degree in Engineering, Computer Science, or equivalent in experience and evidence of exceptional ability
  • Strong Python programming skills and in-depth experience with PyTorch internals (e.g., TorchScript, FX, or Dynamo)
  • Solid understanding of computational graphs, tensor operations, and model tracing
  • Experience building or extending compilers, interpreters, or ML graph optimization frameworks
  • Experience working with PyTorch and HuggingFace Transformers library
  • Knowledge and experience working with Large Language Models (understanding Transformer architecture variations, generation cycle, etc.)
  • Strong C++ programming skills
  • Knowledge of MLIR based compilation stack
Job Responsibility
Job Responsibility
  • Analysis of new models from generative AI field and understanding of impacts on compilation stack
  • Develop and maintain model definition framework that consists of model building blocks to represent large language models based on PyTorch and Cerebras dialects ready to be deployed on Cerebras hardware
  • Develop and maintain the frontend compiler infrastructure that ingests PyTorch models and produces an intermediate representation (IR)
  • Extend and optimize PyTorch FX / TorchScript / TorchDynamo-based tooling for graph capture, transformation, and analysis
  • Collaboration with other teams throughout feature implementation
  • Research on new methods for model optimization to improve Cerebras inference
What we offer
What we offer
  • Build a breakthrough AI platform beyond the constraints of the GPU
  • Publish and open source their cutting-edge AI research
  • Work on one of the fastest AI supercomputers in the world
  • Enjoy job stability with startup vitality
  • Our simple, non-corporate work culture that respects individual beliefs
Read More
Arrow Right

Senior Software Engineer

The AI & Innovation team at Microsoft Suzhou is seeking a highly motivated Senio...
Location
Location
China , Beijing
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Electrical Engineering, or related technical field AND 4+ years of technical engineering experience with coding in languages such as Python, C++, or C#
  • OR equivalent industry experience
  • 7+ years of software engineering experience with a focus on AI/ML systems
  • Proven experience with one or more of the following: Developing or applying generative AI models
  • Building and optimizing inference pipelines for large AI models on cloud infrastructure
  • Integrating AI features into consumer-facing web or mobile applications at scale
  • Working with programmatic advertising ecosystems
  • Familiarity with cloud services (Azure preferred), microservices architecture, and DevOps practices
  • Hands-on experience in at least two of the three core areas: AI/ML Prototyping: Experience with deep learning frameworks (PyTorch, TensorFlow) and implementing/tuning models from recent literature
  • Video/Graphics Processing: Experience with video codecs (FFmpeg), computer graphics, GPU programming (CUDA), or real-time media pipelines
Job Responsibility
Job Responsibility
  • Rapid AI Prototyping: Design, build, and iterate on high-potential prototypes for AI-powered video generation, editing, and content understanding
  • System Integration & Productionization: Bridge the gap between research prototypes and production-ready systems
  • Integrate AI video generation capabilities with large-scale advertising platforms and consumer products
  • Full-Stack Development: Develop end-to-end solutions encompassing backend AI service APIs, model inference optimization, and frontend interfaces
  • Cross-Functional Collaboration: Work closely with Applied Scientists, Machine Learning Engineers, Product Managers, and Ads Platform teams
  • Technical Leadership: Drive architectural decisions for scalable, reliable, and cost-effective AI service deployment
  • Mentor junior engineers and promote engineering best practices
  • Live Site Ownership: Participate in on-call rotations and act as a Designated Responsible Individual (DRI) to ensure the health, performance, and reliability of services
  • Fulltime
Read More
Arrow Right

Senior Software Engineer (Typescript / Backend) - AI/ML

We are looking for a Senior Software Engineer to drive the development of AI/ML-...
Location
Location
United States
Salary
Salary:
131000.00 - 185000.00 USD / Year
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of software engineering experience in production environments
  • Exposure to working directly with AI/ML technologies
  • Strong frontend skills with TypeScript/JavaScript and React
  • Backend development experience in TypeScript or Python, with a focus on API design and service architecture
  • You have a high level of ownership and can drive features from concept to production with minimal supervision
  • You thrive in collaborative environments and can effectively communicate technical concepts to diverse stakeholders
Job Responsibility
Job Responsibility
  • Feature Development: Design and implement AI-powered features across the full stack, from backend inference services to intuitive frontend interfaces within the ClickHouse Cloud platform
  • API Architecture: Create robust, scalable APIs that connect ClickHouse's database capabilities with modern AI/ML inference systems and external/internal AI services
  • UI/UX Implementation: Build responsive, intuitive user interfaces that make complex AI functionalities accessible and valuable to users of all technical backgrounds
  • Ecosystem Integrations: Implement and maintain integrations with the broader AI/ML ecosystem and standards, ensuring that ClickHouse as a technology works seamlessly with popular frameworks and tools
  • Technical Integration: Integrate models into production systems with proper monitoring, versioning, observability, and evaluation
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
  • Fulltime
Read More
Arrow Right

Senior Software Engineer (Typescript / Backend) - AI/ML

We are looking for a Senior Software Engineer to drive the development of AI/ML-...
Location
Location
The Netherlands
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of software engineering experience in production environments
  • Exposure to working directly with AI/ML technologies
  • Strong frontend skills with TypeScript/JavaScript and React
  • Backend development experience in TypeScript or Python, with a focus on API design and service architecture
  • You have a high level of ownership and can drive features from concept to production with minimal supervision
  • You thrive in collaborative environments and can effectively communicate technical concepts to diverse stakeholders
Job Responsibility
Job Responsibility
  • Feature Development: Design and implement AI-powered features across the full stack, from backend inference services to intuitive frontend interfaces within the ClickHouse Cloud platform
  • API Architecture: Create robust, scalable APIs that connect ClickHouse's database capabilities with modern AI/ML inference systems and external/internal AI services
  • UI/UX Implementation: Build responsive, intuitive user interfaces that make complex AI functionalities accessible and valuable to users of all technical backgrounds
  • Ecosystem Integrations: Implement and maintain integrations with the broader AI/ML ecosystem and standards, ensuring that ClickHouse as a technology works seamlessly with popular frameworks and tools
  • Technical Integration: Integrate models into production systems with proper monitoring, versioning, observability, and evaluation
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right

Senior Software Engineer (TypeScript) - AI/ML

We are looking for a Senior Software Engineer to drive the development of AI/ML-...
Location
Location
United States
Salary
Salary:
131000.00 - 185000.00 USD / Year
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of software engineering experience in production environments
  • Exposure to working directly with AI/ML technologies
  • Strong frontend skills with TypeScript/JavaScript and React
  • Backend development experience in TypeScript or Python, with a focus on API design and service architecture
  • You have a high level of ownership and can drive features from concept to production with minimal supervision
  • You thrive in collaborative environments and can effectively communicate technical concepts to diverse stakeholders
Job Responsibility
Job Responsibility
  • Feature Development: Design and implement AI-powered features across the full stack, from backend inference services to intuitive frontend interfaces within the ClickHouse Cloud platform
  • API Architecture: Create robust, scalable APIs that connect ClickHouse's database capabilities with modern AI/ML inference systems and external/internal AI services
  • UI/UX Implementation: Build responsive, intuitive user interfaces that make complex AI functionalities accessible and valuable to users of all technical backgrounds
  • Ecosystem Integrations: Implement and maintain integrations with the broader AI/ML ecosystem and standards, ensuring that ClickHouse as a technology works seamlessly with popular frameworks and tools
  • Technical Integration: Integrate models into production systems with proper monitoring, versioning, observability, and evaluation
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
  • Fulltime
Read More
Arrow Right

Senior Software Engineer (TypeScript) - AI/ML

We are looking for a Senior Software Engineer to drive the development of AI/ML-...
Location
Location
The Netherlands
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of software engineering experience in production environments
  • Exposure to working directly with AI/ML technologies
  • Strong frontend skills with TypeScript/JavaScript and React
  • Backend development experience in TypeScript or Python, with a focus on API design and service architecture
  • You have a high level of ownership and can drive features from concept to production with minimal supervision
  • You thrive in collaborative environments and can effectively communicate technical concepts to diverse stakeholders
Job Responsibility
Job Responsibility
  • Feature Development: Design and implement AI-powered features across the full stack, from backend inference services to intuitive frontend interfaces within the ClickHouse Cloud platform
  • API Architecture: Create robust, scalable APIs that connect ClickHouse's database capabilities with modern AI/ML inference systems and external/internal AI services
  • UI/UX Implementation: Build responsive, intuitive user interfaces that make complex AI functionalities accessible and valuable to users of all technical backgrounds
  • Ecosystem Integrations: Implement and maintain integrations with the broader AI/ML ecosystem and standards, ensuring that ClickHouse as a technology works seamlessly with popular frameworks and tools
  • Technical Integration: Integrate models into production systems with proper monitoring, versioning, observability, and evaluation
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right

Senior AI Application Developer

We are seeking an experienced Senior AI Application Developer to design and buil...
Location
Location
Canada , Toronto
Salary
Salary:
145000.00 USD / Year
realign-llc.com Logo
Realign
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong experience in Python, Java, or Node.js
  • Hands-on experience with AI/ML frameworks (TensorFlow, PyTorch, etc.)
  • Experience with Generative AI and LLM integrations
  • Knowledge of REST APIs and microservices architecture
  • Experience with cloud platforms (AWS, Azure, or GCP)
  • Familiarity with Docker and Kubernetes
Job Responsibility
Job Responsibility
  • Design and develop AI-driven applications and services
  • Integrate ML/LLM models into backend and frontend systems
  • Build scalable APIs and microservices for AI applications
  • Optimize model performance and inference efficiency
  • Implement MLOps practices for model deployment and monitoring
  • Collaborate with Data Scientists, Cloud, and DevOps teams
  • Ensure security, scalability, and reliability of AI solutions
  • Provide technical leadership and mentor junior developers
  • Fulltime
Read More
Arrow Right

Software Engineer: ML Optimization

We internally call this team MBMB (More Big More Better). You will own optimizat...
Location
Location
United States , San Mateo; Somerville
Salary
Salary:
200000.00 - 350000.00 USD / Year
generalistai.com Logo
Generalist AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proficient and stay current with the latest ML techniques for training and inference optimizations in transformer and diffusion based architectures
  • Will chase ML optimizations anywhere: From the CUDA kernels, to ML architecture, to frontend or backend network bottlenecks, CPU bottlenecks, NVLink and comms, to torch, numpy, and Python inefficiencies
Job Responsibility
Job Responsibility
  • Making GPUs go brrrrr
  • Implementing ML, hardware, and software changes that lead to step-function gains
  • Optimizing both the inference and training stacks
What we offer
What we offer
  • Offers Equity
  • Fulltime
Read More
Arrow Right