Principal AI Tooling Engineer Job at Palo Alto Networks (Santa Clara)

Principal AI Engineer

Location

Canada , Mississauga

Salary:

Not provided

Citi

Expiration Date

Until further notice

Requirements

Experience: Extensive experience in designing and building AI/ML solutions, with a significant focus on generative AI and Large Language Models (LLMs)
Gen AI Expertise: Deep understanding of modern AI architectures and techniques, including Retrieval-Augmented Generation (RAG), fine-tuning, function calling, and AI agentic workflows
Programming Proficiency: Expert-level skills in Python and extensive experience with core AI/ML libraries such as PyTorch, TensorFlow
System Design: Proven ability to architect and develop large-scale, distributed, multi-tier applications. Strong knowledge of microservices, API design, and system integration
MLOps: Solid understanding of MLOps principles and experience with tools for model versioning, deployment, monitoring, and lifecycle management
Leadership: Demonstrated experience serving as a technical lead, architect, or principal engineer, with a track record of mentoring team members and driving projects to completion

Job Responsibility

Architectural Leadership: Design and architect end-to-end generative AI solutions, from proof-of-concept to production, ensuring scalability, performance, and reliability
Technical Strategy: Develop and maintain a comprehensive strategic roadmap for generative AI adoption, evaluating new models, techniques, and platforms to keep our capabilities at the forefront of the industry
Solution Development: Lead the hands-on development of complex AI systems, including Retrieval-Augmented Generation (RAG) pipelines, autonomous AI agents, fine-tuning workflows, and custom model integrations
Best Practices & Standards: Establish and govern best practices for the full AI development lifecycle, including prompt engineering, model evaluation, MLOps, and data management
Cross-Functional Partnership: Collaborate closely with multiple management teams and business units to identify high-impact use cases and ensure the successful integration of AI solutions to meet business goals
Mentorship & Guidance: Serve as a senior advisor and coach to other engineers and analysts, fostering a culture of innovation and technical excellence. Allocate work and provide technical direction to the team
Risk & Compliance: Appropriately assess risk when business decisions are made, demonstrating consideration for the firm's reputation and safeguarding its clients and assets. Drive compliance with all applicable laws, rules, and regulations, particularly those related to AI ethics, data privacy, and model bias
Innovation and Research: Stay abreast of the latest advancements in generative AI research, and translate state-of-the-art developments into practical, innovative solutions

Fulltime

Principal AI Engineer – Vivado EDA Tools

We are seeking a skilled AI expert to join our new team focused on integrating A...

Location

United States , San Jose

Salary:

240000.00 - 360000.00 USD / Year

AMD

Expiration Date

Until further notice

Requirements

Extensive experience with AI/ML initiatives and a proven track record of deploying production AI systems
Deep expertise in Large Language Models, fine-tuning, and prompt engineering
Experience designing agentic AI solutions and multi-agent systems
Strong background in EDA tools and workflows, preferably with Synopsys experience
Proficiency in MLOps, model deployment, and lifecycle management
Advanced skills in Python, C++, and AI/ML orchestration frameworks
In-depth knowledge of FPGA design methodologies and tool development

Job Responsibility

Design and implement Large Language Model integrations for FPGA design assistance
Develop Agentic AI systems for automated design optimization
Conduct research and development of novel AI applications in EDA tools

What we offer

Benefits offered are described: AMD benefits at a glance

Fulltime

Principal Ai Engineer (Prisma Browser - Agents Platform)

The Prisma Browser group is building an agentic development lifecycle, an infras...

Location

Israel , Tel Aviv

Salary:

Not provided

Palo Alto Networks

Expiration Date

Until further notice

Requirements

At least 8+ years of experience in software development, architecture, or owning operational systems in production
Computer Science B.Sc. or equivalent education or equivalent military experience required
A product builder's mindset: you can extract requirements, talk to stakeholders, and tell the difference between what's important and what's noise
Experience in building production grade agents. Deep understanding of the agent loop, its states and transitions. You know how to build it correctly, not just use it
Positive 'can-do' mindset, able to work independently and within a team
Hands-on experience with LLM APIs, including a practical, highly-skeptical understanding of token costs, caching, context windows, and model failure points
You know how to build the right context for a task, including memory systems, session storage, and vector databases
You understand where LLMs fail and how to design around those failure points
You've used traces or observability tooling to diagnose and improve agent behavior
A systems-level background that touches reliability, observability, or platform engineering, with a strong preference for writing narrow, deterministic code over building hypothetical abstractions

Job Responsibility

Design and implement automated evaluation loops, static analysis, and rigorous quality gates to ensure the ADLC process doesn't just write code, but consistently produces great, production-ready code
Help the team tackle complex, hard problems to elevate our autonomous development product from 'good' to 'excellent'
Lead complex initiatives in Context Engineering and Prompt Engineering
Manage and orchestrate the complex ecosystem of autonomous agents utilized for internal development
Serve as a leading individual in a very strong team professionally and personally
Find space for growth to push the entire team or group forward
View prompt engineering as a core engineering discipline—where rewriting agent behavior is a versioned, reviewed, and tested code change
Act with a debugging temperament
conduct deep-dive analyses of raw agent transcripts to diagnose non-deterministic failures and ascertain root causes instead of merely working around them

Fulltime

Principal AI Engineer – Vivado EDA Tools

We are seeking a skilled AI expert to join our new team focused on integrating A...

Location

United States , Boxborough

Salary:

212000.00 - 318000.00 USD / Year

AMD

Expiration Date

Until further notice

Requirements

Extensive experience with AI/ML initiatives and a proven track record of deploying production AI systems
Deep expertise in Large Language Models, fine-tuning, and prompt engineering
Experience designing agentic AI solutions and multi-agent systems
Strong background in EDA tools and workflows, preferably with Synopsys experience
Proficiency in MLOps, model deployment, and lifecycle management
Advanced skills in Python, C++, and AI/ML orchestration frameworks
In-depth knowledge of FPGA design methodologies and tool development

Job Responsibility

Design and implement Large Language Model integrations for FPGA design assistance
Develop Agentic AI systems for automated design optimization
Conduct research and development of novel AI applications in EDA tools

Fulltime

Principal AI Engineer

As a Principal AI Engineer on the AI Foundations team, you are an established su...

Location

Singapore , Singapore

Salary:

Not provided

Mastercard

Expiration Date

Until further notice

Requirements

Bachelor’s degree in Computer Science, Engineering, Data Science, Applied Mathematics, or related technical field
advanced degree preferred
Strong foundation in software engineering, distributed systems, and applied machine learning relevant to production AI systems
Demonstrated understanding of responsible AI, model/system risk, privacy/security considerations, and governance requirements for enterprise deployments
Demonstrated, sustained ownership of production AI/ML systems, including design, build, deployment, and ongoing lifecycle operations
Real-world experience shipping complex agentic systems into production, including multi-agent coordination and multi-tool integration with safe action policies
Hands-on experience building production pipelines for evaluation, monitoring, versioning, and continuous improvement (including retraining or policy/guardrail updates)
Proven ability to define and operationalize observability and reliability practices for agentic systems (traceability, telemetry, SLOs, incident management)
Track record of influencing architecture and standards across multiple teams or programs, and mentoring engineers to raise overall engineering rigor

Job Responsibility

Serve as an established subject matter expert in AI Engineering, influencing stakeholders and shaping technical direction across multiple initiatives
Architect, design, develop, and maintain advanced AI/ML systems, with emphasis on complex agentic solutions (multi-agent orchestration, tool/function-calling, memory, reflection/self-correction, and autonomy policies)
Lead production implementation of agentic AI systems, including scalable training and evaluation pipelines, deployment frameworks, and runtime orchestration patterns
Define and implement safe tool-use patterns: structured outputs, robust error handling, permissioning and auditability, human-in-the-loop (HITL) approval steps for sensitive actions, and guardrail enforcement
Establish end-to-end AgentOps/LLMOps practices for agentic systems: release pipelines for prompts/tools/policies, canary strategies, safe rollback mechanisms, and continuous regression/safety evaluations as release gates
Build and optimize data ingestion, preprocessing, feature/embedding engineering, and retrieval/memory workflows to improve grounding quality and reduce failure modes
Own production observability for agentic systems: trace capture, cost/token telemetry, latency and reliability SLOs, and incident response practices for agent failures
Implement drift detection and performance decay monitoring (data drift, concept drift), and automate model/agent retraining, policy updates, and redeployment to maintain output quality over time
Drive measurable improvements in system effectiveness, safety, and efficiency by defining success metrics (task success, intervention rate, policy violations, cost and latency per task) and continuously improving evaluation coverage
Mentor and grow senior and junior engineers through design reviews, code reviews, hands-on coaching, and the creation of reusable patterns, playbooks, and standards for agentic delivery

Fulltime

Principal AI Engineer

Lead the evolution of T-Mobile’s Agentic Next Best Action (ANBA) capabilities—po...

Location

United States , Overland Park; Frisco; Bellevue

Salary:

150700.00 - 271900.00 USD / Year

T-Mobile

Expiration Date

Until further notice

Requirements

Deep expertise in LLM fine-tuning and prompt engineering (e.g., OpenAI APIs, Hugging Face, Anthropic Claude, Google Gemini)
Strong experience with AI orchestration tools (e.g., LangChain, LlamaIndex, vector databases for retrieval augmentation)
Hands-on knowledge of function calling and API-based reasoning models (e.g., using structured outputs to drive automated workflows)
Proficiency in Python and AI development frameworks for building scalable AI applications
Understanding of multi-agent architectures and best practices in agentic AI design
Experience with real-world AI evaluation techniques, including golden sets, synthetic data generation, and interactive testing
Ability to collaborate across teams, working with engineers, product managers, and conversational designers to refine AI-driven solutions
Bachelor's degree in Computer Science, Artificial Intelligence, or equivalent
At least 18 years of age
Legally authorized to work in the United States

Job Responsibility

Architects agentic AI systems that accomplish sophisticated tasks by invoking AI models as well as internal and third-party tools using APIs, ensuring flawless data flow in production environments
Optimizes performance of agentic AI systems through prompt engineering, fine-tuning and reinforcement learning using T-Mobile’s customer interaction data
Develops and maintains supporting software components and scripts to enable AI model deployment, testing, evaluation and monitoring
Implement retrieval-augmented generation (RAG) techniques to ensure AI responses are contextually accurate and grounded in real-time data
Stay at the forefront of LLM advancements, incorporating the latest techniques in prompt design, few-shot learning, Tool integration protocols like MCP and AI orchestration frameworks like Agent SDK
Collaborates with backend engineers, business experts and conversation designers ensuring AI-driven enhancements are optimally integrated into production environments
Defining success metrics that aligns with business requirements and continuously evaluate and improve model quality based on those metrics
Participates in other duties or projects as assigned by business management as needed

What we offer

Annual stock grant
Employee stock purchase plan
401(k)
Access to free, year-round money coaches
Medical, dental and vision insurance
Flexible spending account
Paid time off and up to 12 paid holidays
Paid parental and family leave
Family building benefits
Back-up care

Fulltime

Principal AI Engineer

The Principal AI Engineer will serve as a technical cornerstone of VideoAmp's AI...

Location

United States , Los Angeles; Boulder; New York; St. Petersburg

Salary:

175000.00 - 200000.00 USD / Year

VideoAmp

Expiration Date

Until further notice

Requirements

Bachelor's degree in Computer Science, Engineering, or a related field preferred
equivalent practical experience considered
8+ years of software engineering experience
3+ years in AI/ML infrastructure, LLM platform engineering, or agentic systems
Deep hands-on experience with LLM APIs (Anthropic, OpenAI, or equivalent)
Familiarity with prompt engineering, tool use / function calling, and multi-step agent orchestration
Strong background in resource-based API design
Experience building or consuming developer-facing platform APIs at scale
Experience with MCP or equivalent tool-layer abstractions for exposing platform capabilities to AI agents
Proficiency in Golang, Python, and SQL

Job Responsibility

Design, build, and operate VideoAmp's AI infrastructure and its universal tool layers
Lead the development of scenario evaluation frameworks
Architect and implement efficient tool discovery systems
Partner with internal engineering teams to negotiate and promote API-first designs
Own full SDLC of new Agent APIs from design through production, testing, releases, and enhancement
Facilitate weekly AI office hours
Contribute to multi-provider LLM abstraction layers
Author, review, and drive clear, technical requirements documentation for new solutions

What we offer

Equity participation included
Discretionary & flexible PTO + Spring, Summer & Winter company breaks
Inclusive and comprehensive medical, dental & vision
401(k) with matching
HSA & FSA
Paid Maternity & Parental Leave for all family additions
Cell phone & wifi reimbursement
Commuter benefits

Fulltime

Principal Engineer - AI Platform Development (Azure PostgreSQL)

Microsoft’s Azure Data engineering team is leading the transformation of analyti...

Location

Spain , Barcelona

Salary:

Not provided

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science or related technical field AND extensive technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter

Job Responsibility

Lead the design and development of AI Store capabilities in Azure PostgreSQL, including vector search, semantic indexing, and AI-optimized database features to power the next generation of intelligent applications
Architect intuitive, scalable APIs, SDKs, and extensibility layers that bring advanced database and AI capabilities into the hands of developers
Create seamless developer experiences by integrating PostgreSQL services with modern development tools, frameworks, and cloud platforms accelerating application development on Azure PostgreSQL
Partner closely with database engine engineers, product managers, and developer advocates to translate developer needs into deep system and platform innovations
Design and deliver high-quality interfaces, SDKs, samples, and documentation that make building AI-powered applications on PostgreSQL accessible, powerful, and joyful
Engage with open-source communities, technology partners, and developer ecosystems to amplify impact, gather feedback, and inform platform evolution
Champion a developer-first mindset while advancing technical excellence, scalability, and innovation across the stack from database internals to developer workflows

Fulltime

Select Country

Principal AI Tooling Engineer

Job Description

Job Responsibility

Requirements

Nice to have

What we offer

Looking for more opportunities?