CrawlJobs Logo

Sr. Deployment Engineer, AI Inference

cerebras.net Logo

Cerebras Systems

Location Icon

Location:
United States; Canada , Sunnyvale

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Cerebras Systems builds the world's largest AI chip, 56 times larger than GPUs. Our novel wafer-scale architecture provides the AI compute power of dozens of GPUs on a single chip, with the programming simplicity of a single device. This approach allows Cerebras to deliver industry-leading training and inference speeds and empowers machine learning users to effortlessly run large-scale ML applications, without the hassle of managing hundreds of GPUs or TPUs. We are seeking a highly skilled and experienced Sr. Deployment Engineer to build and operate our cutting-edge inference clusters. These clusters would provide the candidate an opportunity to work with the world's largest computer chip, the Wafer-Scale Engine (WSE), and the systems that harness its unparalleled power. You will play a critical role in ensuring reliable, efficient, and scalable deployment of AI inference workloads across our global infrastructure.

Job Responsibility:

  • Deploy AI inference replicas and cluster software across multiple datacenters
  • Operate across heterogeneous datacenter environments undergoing rapid 10x growth
  • Maximize capacity allocation and optimize replica placement using constraint-solver algorithms
  • Operate bare-metal inference infrastructure while supporting transition to K8S-based platform
  • Develop and extend telemetry, observability and alerting solutions to ensure deployment reliability at scale
  • Develop and extend a fully automated deployment pipeline to support fast software updates and capacity reallocation at scale
  • Translate technical and customer needs into actionable requirements for the Dev Infra, Cluster, Platform and Core teams
  • Stay up to date with the latest advancements in AI compute infrastructure and related technologies.

Requirements:

  • 5-7 years of experience in operating on-prem compute infrastructure (ideally in Machine Learning or High-Performance Compute) or in developing and managing complex AWS plane infrastructure for hybrid deployments
  • Strong proficiency in Python for automation, orchestration, and deployment tooling
  • Solid understanding of Linux-based systems and command-line tools
  • Extensive knowledge of Docker containers and container orchestration platforms like K8S
  • Familiarity with spine-leaf (Clos) networking architecture
  • Proficiency with telemetry and observability stacks such as Prometheus, InfluxDB and Grafana
  • Strong ownership mindset and accountability for complex deployments
  • Ability to work effectively in a fast-paced environment.
What we offer:
  • Build a breakthrough AI platform beyond the constraints of the GPU
  • Publish and open source their cutting-edge AI research
  • Work on one of the fastest AI supercomputers in the world
  • Enjoy job stability with startup vitality
  • Our simple, non-corporate work culture that respects individual beliefs.

Additional Information:

Job Posted:
February 17, 2026

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Sr. Deployment Engineer, AI Inference

Sr. Distinguished AI Engineer

At Capital One, we are creating responsible and reliable AI systems, changing ba...
Location
Location
United States , Cambridge, Massachusetts; New York, New York; Richmond, Virginia; San Jose, California; McLean, Virginia; San Francisco, California
Salary
Salary:
280600.00 - 384200.00 USD / Year
capitalone.com Logo
Capital One
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 10 years of experience developing AI and ML algorithms or technologies, or a Master's degree in Computer Science, AI, Electrical Engineering, Computer Engineering, or related fields plus at least 8 years of experience developing AI and ML algorithms or technologies
  • At least 10 years of experience programming with Python, Go, Scala, or Java
Job Responsibility
Job Responsibility
  • Partner with a cross-functional team of engineers, research scientists, technical program managers, and product managers to deliver AI-powered products
  • Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability
  • Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more
  • Invent and introduce state-of-the-art LLM optimization techniques to improve the performance — scalability, cost, latency, throughput — of large scale production AI systems
  • Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One
What we offer
What we offer
  • comprehensive, competitive, and inclusive set of health, financial and other benefits that support your total well-being
  • performance based incentive compensation, which may include cash bonus(es) and/or long term incentives (LTI)
  • Fulltime
Read More
Arrow Right

Sr. Engineer, ML Platform

As the leading delivery platform in the region, we have a unique responsibility ...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
deliveryhero.com Logo
Delivery Hero
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong software engineering background with experience in building distributed systems or platforms designed for machine learning and AI workloads
  • Expert-level proficiency in Python and familiarity with ML frameworks (TensorFlow, PyTorch), infrastructure tooling (MLflow, Kubeflow, Ray), and popular APIs (Hugging Face, OpenAI, LangChain)
  • Experience implementing modern MLOps practices, including model lifecycle management, CI/CD, Docker, Kubernetes, model registries, and infrastructure-as-code tools (Terraform, Helm)
  • Demonstrated experience working with cloud infrastructure, ideally AWS or GCP, including Kubernetes clusters (GKE/EKS), serverless architectures, and managed ML services (e.g., Vertex AI, SageMaker)
  • Proven experience with generative AI technologies: transformers, embeddings, prompt engineering strategies, fine-tuning vs. prompt-tuning, vector databases, and retrieval-augmented generation (RAG) systems
  • Experience designing and maintaining real-time inference pipelines, including integrations with feature stores, streaming data platforms (Kafka, Kinesis), and observability platforms
  • Familiarity with SQL and data warehouse modeling
  • capable of managing complex data queries, joins, aggregations, and transformations
  • Solid understanding of ML monitoring, including identifying model drift, decay, latency optimization, cost management, and scaling API-based genAI applications efficiently
  • Bachelor’s degree in Computer Science, Engineering, or a related field
Job Responsibility
Job Responsibility
  • Design, build, and maintain scalable, reusable, and reliable ML platforms and tooling that support the entire ML lifecycle, including data ingestion, model training, evaluation, deployment, and monitoring for both traditional and generative AI models
  • Develop standardized ML workflows and templates using MLflow and other platforms, enabling rapid experimentation and deployment cycles
  • Implement robust CI/CD pipelines, Docker containerization, model registries, and experiment tracking to support reproducibility, scalability, and governance in ML and genAI
  • Collaborate closely with genAI experts to integrate and optimize genAI technologies, including transformers, embeddings, vector databases (e.g., Pinecone, Redis, Weaviate), and real-time retrieval-augmented generation (RAG) systems
  • Automate and streamline ML and genAI model training, inference, deployment, and versioning workflows, ensuring consistency, reliability, and adherence to industry best practices
  • Ensure reliability, observability, and scalability of production ML and genAI workloads by implementing comprehensive monitoring, alerting, and continuous performance evaluation
  • Integrate infrastructure components such as real-time model serving frameworks (e.g., TensorFlow Serving, NVIDIA Triton, Seldon), Kubernetes orchestration, and cloud solutions (AWS/GCP) for robust production environments
  • Drive infrastructure optimization for generative AI use-cases, including efficient inference techniques (batching, caching, quantization), fine-tuning, prompt management, and model updates at scale
  • Partner with data engineering, product, infrastructure, and genAI teams to align ML platform initiatives with broader company goals, infrastructure strategy, and innovation roadmap
  • Contribute actively to internal documentation, onboarding, and training programs, promoting platform adoption and continuous improvement
  • Fulltime
Read More
Arrow Right
New

Sales Consultant

In this customer-facing role, you will combine instinct, empathy, and commercial...
Location
Location
Australia , Cannington
Salary
Salary:
Not provided
plush.com.au Logo
Plush Think Sofas
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A natural hunter with a strong drive to achieve and exceed sales targets
  • high in emotional intelligence and able to build rapport quickly
  • experienced in consultative selling and solution-based approaches
  • proven ability to close sales in high-value categories such as furniture, jewellery, automotive, or luxury goods
  • energetic, self-motivated, and adaptable, even in high-traffic periods
  • organized, professional, and collaborative with a growth mindset
Job Responsibility
Job Responsibility
  • Deliver exceptional, emotionally aware customer service that inspires loyalty and repeat business
  • engage customers through emotive storytelling to help envision their dream space
  • identify new sales opportunities and close deals confidently and consistently
  • maintain accurate sales records and ensure timely processing of all customer orders
  • collaborate with the Showroom Manager to maintain high presentation and merchandising standards
What we offer
What we offer
  • Competitive salary with generous, uncapped commissions
  • ongoing training and professional development opportunities
  • a supportive, growth-focused team environment within an ASX-listed company
  • Fulltime
Read More
Arrow Right
New

Survey Crew Chief

As a survey crew chief, you will engage in diverse survey tasks across various p...
Location
Location
United States , South Sioux City
Salary
Salary:
Not provided
olsson.com Logo
Olsson
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong communication skills
  • Ability to contribute and work well on a team
  • 4 years of experience working with a licensed surveyor
  • Experience with construction staking, boundary surveys, topography surveys, and ALTA surveys, is preferred
  • Valid driver's license with a good driving history
  • Must be able to lift and carry up to 50 pounds
  • The ability to work in a constant state of alertness and safe manner
Job Responsibility
Job Responsibility
  • Engage in diverse survey tasks across various projects, including construction staking, boundary surveys, topography surveys, and ALTA surveys
  • Work in all types of terrain and weather conditions
  • Engage in project management responsibilities including project research, boundary calculations, and collaborate on project planning and coordination
What we offer
What we offer
  • Competitive 401(k) match
  • Tailored development paths
  • Possibility for flexible work arrangements
  • Wellness program promoting balanced lifestyles
  • Traditional benefits package (health care, vision, dental, paid time off, etc.)
  • Opportunity to participate in a bonus system that rewards performance
  • Fulltime
Read More
Arrow Right
New

JTAC Instructor / Simulator Operator – AFSOC Support

Barbaricum is seeking an experienced Joint Terminal Attack Controller (JTAC) Ins...
Location
Location
United States , Clovis
Salary
Salary:
Not provided
barbaricum.com Logo
Barbaricum
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Active DoD Secret clearance with ability to obtain Top Secret
  • current TS/SCI highly preferred
  • JTAC Instructor (JTAC-I) qualified and current within the last 4 years
  • Must have held JTAC-I and/or JTAC-SEE rating
  • Minimum of 2 years of SOF background, or equivalent augmentation in operational deployments as a JTAC
  • Proficient in written and spoken English
  • Demonstrated ability to work in austere field environments under variable weather and operational conditions
  • Experience operating standard office systems and military automation platforms in both classified and unclassified settings
  • Familiarity with Microsoft Office suite and ability to produce training briefs, instructional content, and technical reports
  • Willingness to work long hours, including nights, weekends, and holidays in support of mission training cycles
Job Responsibility
Job Responsibility
  • Deliver JTAC, JTAC-I, and JTAC-SEE instruction in classroom, field, and simulator-based settings
  • Operate, maintain, and troubleshoot simulator systems including JTC-TRS and other government-furnished systems
  • Support and track JTAC progression through training records, qualification folders, and reporting documentation
  • Implement and update CAS and SUAS training scenarios in accordance with service doctrine and unit guidance
  • Perform software updates, scenario loads, and database management for simulators as certified
  • Provide simulator calibration, system diagnostics, and scripted software execution in coordination with help desks
  • Produce monthly simulator usage and status reports and communicate discrepancies to unit leadership
  • Assist with range and air asset scheduling, deconfliction, and inter-unit coordination for live training execution
  • Act as Opposing Force (OPFOR) and deliver instruction in complex tactical environments
  • Support no-notice evaluations, sustain training documentation, and maintain unit academics programs
What we offer
What we offer
  • This position offers relocation assistance and is bonus eligible after personnel complete their onboarding period
  • Fulltime
Read More
Arrow Right
New

Sales Consultant

We are seeking an enthusiastic and motivated Sales Consultant to join our Fortit...
Location
Location
Australia , Fortitude Valley
Salary
Salary:
Not provided
plush.com.au Logo
Plush Think Sofas
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Previous sales experience, ideally selling furniture or high-value items such as jewellery, cars, bedding, luxury goods / travel, etc.
  • Positive attitude and enthusiasm, especially during busy periods
  • Strong interpersonal skills with a focus on teamwork and collaboration
  • Open to feedback and eager to learn, demonstrating a growth mindset
  • Excellent organizational skills and the ability to manage multiple responsibilities
Job Responsibility
Job Responsibility
  • Deliver outstanding customer service to create the optimal Nick Scali experience
  • Utilize your product knowledge and selling skills to achieve daily and weekly sales targets
  • Ensure accurate completion of sales order paperwork and internal documentation for timely order processing
  • Maximize sales through effective selling techniques, including room solutions and add-on sales
  • Collaborate with the Showroom Manager to uphold showroom standards, including visual merchandising and pricing accuracy
What we offer
What we offer
  • Flexible working models available, ranging from 2 to 5 days a week, promoting work-life balance
  • Competitive salary with generous uncapped commission
  • Continuous training and career development opportunities
  • A supportive team environment that values innovation and improvement
  • Fulltime
Read More
Arrow Right
New

Data & Analytics Developer

If you enjoy working with data and automating business processes, and have exper...
Location
Location
United Kingdom
Salary
Salary:
Not provided
necsws.com Logo
NEC Software Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years’ experience in Power BI Development including DAX queries, Power Query and data modelling
  • Proficiency in Power Apps and Power Automate development.
  • You have a keen interest in data analytics and workflow automation.
  • Strong diligence and critical thinking skills.
  • You have excellent interpersonal, communication and organisational skills.
  • Able and willing to travel throughout the UK.
  • Knowledge of Power Tools (Power BI, Power Apps and Power Automate).
  • Experience of creating reports, dashboards and using M query.
  • Familiarity with Microsoft 365 tools (SharePoint, Teams, etc)
  • Exposure to AI concept or willingness to learn.
Job Responsibility
Job Responsibility
  • Contribute to the data discovery and design of data model for analytical projects.
  • Create and maintaining Power BI reports and dashboards.
  • Develop and automate workflows using Power Automate to streamline and optimize business processes.
  • Build and deploy custom business applications using Power Apps to solve business challenges and enhance productivity.
  • Maintain reporting datasets and dataflows.
  • Help with documentation and optimising analytical reporting delivery process.
  • Collaborate with other team members to ensure reporting assets are complaint and secured.
  • Stay current with industry trends and emerging technologies related to Power BI, Power Apps and Power Automate, and SharePoint to recommend enhancements.
  • Identifying new areas of influence for the Data & Analytics division.
What we offer
What we offer
  • Private Medical Cover funded by NEC for Employees (with the option to add family members at an additional cost)
  • 25 days paid holiday with the option to buy/sell (FTE)
  • 4 x basic salary life assurance cover funded by NEC (with the option to increase cover at an additional cost)
  • A Group Pension Plan with fantastic employer contributions up to a maximum of 8.5%
  • A selection of flexible benefits to suit your individual needs
  • All colleagues get free access to LinkedIn Learning. Over 15000 courses covering a huge breadth of subjects. Learn about what you like, when you like, how you like.
  • Fulltime
Read More
Arrow Right
New

Resin Material Handler

12-hour Day Shift - 6am - 6pm. Assist warehouse activities during times when ask...
Location
Location
United States , Buffalo Grove, Illinois
Salary
Salary:
21.00 - 23.00 USD / Hour
nemera.net Logo
Nemera
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • High school diploma or equivalent
  • Ability to frequently lift up to 55 lbs.
  • Sitting and Standing over differing long periods of time
  • Daily warehouse cleaning tasks and 5s activities
  • Excellent communication skills
  • Well organized with strong attention to detail
  • Ability to operate a forklift (experience with other powered equipment a plus)
  • Commitment to getting tasks done in a timely, accurate manner.
  • Ability to perform material handler and packer responsibilities
Job Responsibility
Job Responsibility
  • Assist warehouse activities during times when asked
  • Identify issues and perform corrective actions.
  • Work in accordance with set departmental/company 12-hour schedule.
  • Assist colleagues when necessary to ensure a smooth running of the warehouse
  • Provide recommendations on process improvements
  • Assist with continuous improvement initiatives implementation
  • Perform specific approved operator level equipment maintenance
  • Execute PPE safety standards while ensuring escalation of issues such as the adherence to PPE.
  • Monitor departmental 5S activities, ensuring compliance and standards are met
  • Ensure all warehouse documentation applicable to the job is completed in a timely, accurate manner
  • Fulltime
Read More
Arrow Right