Principal Engineer, AI Inference Reliability Job at Cerebras Systems (Sunnyvale)

Senior Software Engineer and Principal Software Engineer - Power Point AI Team

The PowerPoint team is embarking on an exciting new chapter - evolving a product...

Location

United States , Redmond

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
8+ years of experience in backend service engineering, including work on high-scale infrastructures
Proficiency in one or more systems programming languages such as C#, C++
1+ years of experience in software engineering, designing and developing systems (and APIs) that deploy and integrate with AI models
2+ years of experience working with rich telemetry, making data driven decisions, and carrying out rapid experimentation
2+ years of experience building software for scale, performance, and reliability
Academic or industry experience with building, finetuning, deploying or building eval-driven systems utilizing the models (any category)

Job Responsibility

Lead design and delivery of complex, scalable AI features ensuring resilience and exceptional user experience
Drive technical strategy and architecture decisions across multiple services, influencing partner teams and aligning with compliance and security requirements
Champion modern engineering practices, including AI-driven approaches, automation, and cloud-native patterns, across the full development lifecycle
Mentor and guide engineers, fostering technical excellence and continuous improvement in security, reliability, and performance
Collaborate cross-org to solve challenging technical problems, streamline processes, and reduce operational costs while improving live-site health
Design and implement scalable backend services optimized for machine learning workflows and large language model integration
Develop and maintain evaluation-driven systems that leverage text and multimodal inputs (e.g., images) to power visual-creation experiences
Build and optimize APIs and infrastructure to support high-performance model inference and experimentation at scale
Collaborate with product, ML, and design teams to integrate models into user-facing features, ensuring seamless functionality and performance
Conduct model evaluations and experiments, analyze results, and iterate on improvements to enhance accuracy and user experience

Fulltime

Principal AI Engineer

We are seeking a highly accomplished Principal AI Engineer to define and drive t...

Location

Ireland , Dublin 18

Salary:

Not provided

Mastercard

Expiration Date

Until further notice

Requirements

Demonstrated experience designing and building AI/ML systems in production at scale, ideally across multiple problem domains
Expert-level proficiency in Python and deep experience with modern AI frameworks such as PyTorch and TensorFlow
Strong experience with cloud-native architectures and AI infrastructure on platforms such as AWS, Azure, or GCP
Deep understanding of machine learning, deep learning, NLP, generative AI, and transformer-based architectures (e.g., BERT, GPT-style models, ViTs)
Proven expertise in MLOps, including model versioning, deployment strategies, monitoring, evaluation, and lifecycle management
Strong systems-thinking mindset, with experience designing resilient, scalable, and cost-efficient AI services
Experience working with large-scale data architectures, streaming and batch processing, and model inference optimization
Excellent communication skills with the ability to explain complex technical concepts to both technical and non-technical stakeholders
Track record of technical mentorship and influence without relying on formal line management
Comfortable operating in high-ambiguity environments and making sound technical judgments with incomplete information

Job Responsibility

Define and drive the technical direction of our AI platforms and solutions
Architect, build, and scale production-grade AI systems that deliver durable business impact
Lead through deep hands-on expertise, influence technical strategy across teams, and raise the engineering bar for AI development across the organization
Design, implement, and operate advanced AI systems that support critical business and client needs in a scalable, secure, and reliable manner
Partner closely with product, engineering, and data leaders to translate business intent into robust AI architectures and platforms

Fulltime

Principal Software Engineer - Agentix AI (Cortex XSIAM)

Your career: In this role, you will act as the primary architect of the "nervous...

Location

Israel , Tel Aviv

Salary:

Not provided

Palo Alto Networks

Expiration Date

Until further notice

Requirements

Keeps up with the latest research and stays on top of the fast-moving AI space, with a real passion for what’s happening in Generative AI
Regularly tries out different AI tools and sees how they’re useful in everyday work and life
Strong understanding of advanced prompting techniques like Chain-of-Thought, ReAct, and few-shot prompting
Experience working on model quantization or finding ways to optimize inference costs and token usage at scale
Hands-on experience with Python (FastAPI, Django, or Flask) or Go, with a solid grasp of async programming and microservices
Experience turning a vague product idea (e.g., "let's add a smart assistant") into clear, concrete technical requirements
Hands-on experience using frameworks like LangChain to build more complex LLM flows and agents
Experience working with vector databases
Comfortable building and using RESTful and GraphQL APIs, especially when dealing with low-latency streaming (WebSockets, Server-Sent Events)
Enjoys digging into "non-deterministic" systems - when an LLM fails, comfortable figuring out whether it’s the prompt, the retrieval, or the data

Job Responsibility

Act as the primary architect of the "nervous system" that bridges the gap between sophisticated AI models and real-world business logic
Design and maintain the critical infrastructure that enables intelligent, autonomous features to function reliably at scale
Take ownership of the entire data flow, where you will develop high-performance RAG (Retrieval-Augmented Generation) pipelines and complex agentic workflows
Champion system stability by implementing rigorous evaluation and monitoring frameworks

Fulltime

Principal Software Engineer - CoreAI Model Inference & Serving

Join our team within CoreAI, where we are building the AI data-plane that powers...

Location

United States , Redmond

Salary:

139900.00 - 274800.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, or Java
OR equivalent experience
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter

Job Responsibility

Be a hands-on technical leader, designing, coding, and shipping core serving systems, smart routing, and request distribution for a broad portfolio of LLMs, including OpenAI, Mistral, Grok, DeepSeek, and others
Build large-scale AI services and platform capabilities that power new products and customer experiences
Drive cutting-edge innovation in AI systems alongside world-class engineers and cross-functional partners
Lead through architecture, code reviews, mentorship, and technical excellence while staying close to implementation
Improve reliability, scalability, observability, efficiency, and performance across mission-critical services

Fulltime

Principal Engineer - Marketplace

Principal Engineer role in the Marketplace Engineering team to lead breakthrough...

Location

United States , San Francisco; Sunnyvale

Salary:

302000.00 - 336000.00 USD / Year

Uber

Expiration Date

Until further notice

Requirements

PhD in Computer Science, Machine Learning, Operations Research, or related quantitative field OR Master’s degree with 12+ years of industry experience
10+ years of experience building and deploying ML models in large-scale production environments
Expert-level proficiency in modern ML frameworks (TensorFlow, PyTorch, JAX) and distributed computing platforms (Spark, Ray)
Deep expertise across multiple areas including: Deep Learning, Causal Inference, Reinforcement Learning, Multi-objective Optimization, Algorithmic Game Theory, and Large-scale Ads Ranking/Auction Systems
Proven track record of leading complex ML projects from research through production with significant measurable business impact
Strong programming skills in Python, Java, or Go with experience building production ML systems
Experience with feature engineering, model serving, and ML infrastructure at scale (handling millions of predictions per second)
Technical leadership experience including mentoring senior engineers and driving cross-team technical initiatives
Advanced Deep Learning and Neural Network architectures
Scalable ML architecture and distributed model training

Job Responsibility

Lead the design and implementation of advanced ML systems for dynamic pricing algorithms serving millions of drivers across 70+ countries around the world
Architect real-time ML infrastructure handling 1M+ pricing decisions per second with sub-50ms latency requirements
Drive breakthrough research in causal ML, reinforcement learning, algorithmic game theory, and multi-objective optimization for marketplace optimization with strategic agents
Own end-to-end ML model lifecycle from research through production deployment and continuous optimization
Develop and enforce best practices in system design, ensuring data integrity, security, and optimal performance
Serve as a representative for the Marketplace organization to the broader internal and external technical community
Contribute to the eng brand for Marketplace and serve as a talent magnet to help attract and retain talent for the team
Stay abreast of industry trends and emerging technologies in software engineering, focused particularly on ML/AI, to enhance our systems and processes continually
Build scalable ML architecture and feature management systems supporting Driver Pricing and broader Marketplace teams
Design experimentation frameworks enabling rapid testing of pricing algorithms using A/B, Switchback, Synthetic Control, and other experimental methodologies

What we offer

Eligible to participate in Uber's bonus program
May be offered an equity award & other types of comp
Eligible to participate in a 401(k) plan
Eligible for various benefits (details at provided link)

Fulltime

Principal AI Network Architect

Microsoft Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) is the...

Location

United States , Redmond

Salary:

139900.00 - 274800.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Master's Degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field AND 7+ years technical engineering experience
OR Bachelor's Degree in Electrical Engineering, Computer Engineering, Mechanical Engineering, or related field AND 8+ years technical engineering experience
OR equivalent experience
Ability to meet Microsoft, customer and/or government security screening requirements
Microsoft Cloud Background Check
5+ years of experience in designing AI backend networks and integrating them into large-scale GPU systems
Proven expertise in system architecture across compute, networking, and accelerator domains
Deep understanding of RDMA protocols (RoCE, InfiniBand), congestion control (DCQCN), and Layer 2/3 routing
Experience with optical interconnects (e.g., PSM, WDM), link budget analysis, and transceiver integration
Familiarity with signal integrity modeling, link training, and physical layer optimization

Job Responsibility

Spearhead architectural definition and innovation for next-generation GPU and AI accelerator platforms, with a focus on ultra-high bandwidth, low-latency backend networks
Drive system-level integration across compute, storage, and interconnect domains to support scalable AI training workloads
Partner with silicon, firmware, and datacenter engineering teams to co-design infrastructure that meets performance, reliability, and deployment goals
Influence platform decisions across rack, chassis, and pod-level implementations
Cultivate deep technical relationships with silicon vendors, optics suppliers, and switch fabric providers to co-develop differentiated solutions
Represent Microsoft in joint architecture forums and technical workshops
Evaluate and articulate tradeoffs across electrical, mechanical, thermal, and signal integrity domains
Frame decisions in terms of TCO, performance, scalability, and deployment risk
Lead design reviews and contribute to PRDs and system specifications
Shape the direction of hyperscale AI infrastructure by engaging with standards bodies (e.g., IEEE 802.3), influencing component roadmaps, and driving adoption of novel interconnect protocols and topologies

Fulltime

New

Principal Software Engineer

We are developing Manufacturing and Engineering AI tools that help employees gai...

Location

India , Hyderabad

Salary:

Not provided

Amgen

Expiration Date

Until further notice

Requirements

13-17 years of engineering experience building or platforming cloud services or developer platforms, with 3+ years leading engineering teams or technical programs
Proven experience designing and operating cloud-native platforms using Kubernetes, containers, microservices, and related distributed system patterns
Hands-on experience with LLM serving or adjacent model-serving patterns, including inference endpoints, routing, scaling, batching, and latency/cost optimization
Practical knowledge of API gateway patterns, authentication and authorization, and secure integrations
Familiarity with cost attribution and FinOps concepts for cloud and AI workloads
Strong track record partnering with product managers and senior technical stakeholders to deliver platform capabilities and roadmaps
Excellent communication skills with the ability to explain technical tradeoffs clearly to both technical and non-technical audiences
Experience with observability and SRE practices, including metrics, tracing, logging, incident management, and production support
Master's / Bachelor’s degree in Computer Science, Engineering, or equivalent practical experience

Job Responsibility

Define the technical vision and reference architecture for AI platforms supporting chatbots, agents, orchestration, and related enterprise services
Translate product and business requirements into scalable platform capabilities, including agent hosting, model access, AI gateways, observability, and operational tooling
Drive platform decisions around LLM serving, model endpoints, caching, batching, latency-versus-cost tradeoffs, and multi-model support
Lead architecture for manufacturing integrations and industrial data connectivity, including patterns for SCADA, Data Historian, MES, ERP, LIMS, APIs, event streams, and document-based knowledge sources
Own platform reliability, scalability, and cost by defining SLIs/SLOs, capacity planning, cost attribution, and FinOps practices
Collaborate with Product Owners, Principal Engineers, and stakeholders to define roadmap, acceptance criteria, and delivery milestones
Lead and mentor engineers delivering platform services, integrations, CI/CD for agents and models, and marketplace/catalog capabilities
Establish standards for security, compliance, and model governance, including data handling, access controls, logging, auditability, and traceability
Be hands-on when needed to prototype architectures, review designs, troubleshoot production incidents, and participate in code and design reviews

What we offer

In addition to the base salary, Amgen offers competitive and comprehensive Total Rewards Plans that are aligned with local industry standards

Fulltime

Principal Software Engineer

Microsoft’s Azure Data engineering team is leading the transformation of analyti...

Location

United States , Multiple Locations

Salary:

139900.00 - 274800.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter

Job Responsibility

Design and lead the development of core AI capabilities in PostgreSQL including vector indexing, approximate nearest neighbor search, semantic query operators, and graph-native features
Architect in-database support for embedding pipelines and model integration to enable retrieval, reasoning, and inference
Lead system-level design efforts that span the PostgreSQL engine, extension frameworks, storage abstractions, and control plane surfaces
Collaborate with product managers, applied AI researchers, and platform teams to define use cases and translate them into scalable and intuitive capabilities
Contribute to production-grade implementation of complex systems, ensuring performance, reliability, and operability
Set technical direction and engineering quality standards through code and design reviews, prototyping, and mentorship
Act as a technical connector across teams, driving alignment on design, extensibility patterns, and developer experience
Stay current on trends in vector databases, graph systems, and AI workloads, applying academic and open-source innovation to real-world engineering
Embody our culture and values

Fulltime

Select Country

Principal Engineer, AI Inference Reliability

Job Description

Job Responsibility

Requirements

Nice to have

What we offer

Looking for more opportunities?