Platform Engineer, Agents Job at Hebbia (New York City)

Senior ML Engineer - AI Platform & Agents

We are building agentic AI into the core of our product and need someone who can...

Location

France , Bordeaux

Salary:

Not provided

PhantomBuster

Expiration Date

Until further notice

Requirements

5+ years of experience as an ML Engineer, AI Engineer, or Software Engineer with a strong AI focus
Hands-on experience building AI agents using frameworks such as LangChain, Amazon Bedrock AgentCore, or similar
Strong understanding of LLM-based systems: prompt engineering, agent orchestration, tool use, and multi-agent workflows
Familiarity with MCP (Model Context Protocol) and experience integrating agents with external APIs or data sources
Experience working with Agents for Amazon Bedrock AgentCore or similar agent setups
Strong understanding of machine learning algorithms, statistical methods, and data preprocessing techniques
Experience with cloud platforms for model training and deployment, especially AWS
Proficiency in Python, including LangChain, and standard data libraries (Pandas, NumPy, etc.)
Fluency in English

Job Responsibility

Define and evolve our infrastructure to allow for better ML and AI capabilities, with a focus on LLM-based and agentic systems
Contribute to the development and expansion of our agentic AI framework powered by AWS Bedrock, enabling both internal tools and customer-facing features
Identify, source, and refine datasets to allow tuning models, powering retrieval pipelines, or expanding agentic workflows
Pre-process data by using techniques such as data cleaning, feature engineering, and transformation
Train, evaluate, and deploy both LLM-based systems and traditional machine learning models into production
Monitor, debug, and continuously improve deployed models and AI tools
Support machine learning usage throughout the company, including selecting the right modeling approach for the use case (LLM vs. traditional ML)
Support the integration and use of LLMs, including approaches such as fine-tuning, prompt tuning, and retrieval-augmented generation (RAG), to improve accuracy

What we offer

International team
Fun team building events
€40/month for remote work
Flexible working time
Home office budget up to €1500
100% of an Alan Blue subscription
Lunch vouchers - €8 (50% The Phantom Company) / worked day
Partnership with MokaCare
€70 a month benefit for entertainment expenses
Book Allowance and Sharing Program

Senior ML Engineer - AI Platform & Agents

Join PhantomBuster as a Senior ML Engineer to build agentic AI with AWS Bedrock,...

Location

France; Spain; Portugal

Salary:

Not provided

PhantomBuster

Expiration Date

Until further notice

Requirements

5+ years of experience as an ML Engineer, AI Engineer, or Software Engineer with a strong AI focus
Hands-on experience building AI agents using frameworks such as LangChain, Amazon Bedrock AgentCore, or similar
Strong understanding of LLM-based systems: prompt engineering, agent orchestration, tool use, and multi-agent workflows
Familiarity with MCP (Model Context Protocol) and experience integrating agents with external APIs or data sources
Experience working with Agents for Amazon Bedrock AgentCore or similar agent setups
Strong understanding of machine learning algorithms, statistical methods, and data preprocessing techniques
Experience with cloud platforms for model training and deployment, especially AWS
Proficiency in Python, including LangChain, and standard data libraries (Pandas, NumPy, etc.)
Fluency in English

Job Responsibility

Define and evolve our infrastructure to allow for better ML and AI capabilities, with a focus on LLM-based and agentic systems
Contribute to the development and expansion of our agentic AI framework powered by AWS Bedrock, enabling both internal tools and customer-facing features
Identify, source, and refine datasets to allow tuning models, powering retrieval pipelines, or expanding agentic workflows
Pre-process data by using techniques such as data cleaning, feature engineering, and transformation
Train, evaluate, and deploy both LLM-based systems and traditional machine learning models into production
Monitor, debug, and continuously improve deployed models and AI tools
Support machine learning usage throughout the company, including selecting the right modeling approach for the use case (LLM vs. traditional ML)
Support the integration and use of LLMs, including approaches such as fine-tuning, prompt tuning, and retrieval-augmented generation (RAG), to improve accuracy

What we offer

Fully remote working environment (France, Spain, or Portugal)
Real ownership: you will define how agentic AI is built at PhantomBuster, not follow someone else's decisions
Freedom to research and adopt new technologies as the space evolves & to make an impact at a small, self-funded, and profitable tech startup by laying the foundation for machine learning and AI
Collaborative and open-minded culture based on rationality, humility, honesty, and long-term thinking
International team
Fun team building events
€40/month for remote work
Flexible working time
Home office budget up to €1500
100% of an Alan Blue subscription (french-based contracts)

Fulltime

Qa Engineer – Agents & Ai Platform

QA Engineer – Agents & AI Platform. We are looking for a QA Engineer – Agents & ...

Location

India , Chennai

Salary:

Not provided

OptiSol Business Solutions

Expiration Date

Until further notice

Requirements

QA experience testing complex web platforms, workflow systems, or enterprise applications
Strong knowledge of QA methodologies, test case design, and defect lifecycle management
Hands-on experience with API testing tools such as Postman and exploratory testing techniques
Ability to build light automation scripts using Python, JavaScript, or testing frameworks for regression and API testing
Understanding of workflow-driven platforms or enterprise process systems
Awareness of security and privacy considerations, including RBAC, PHI/PII protection, and audit logging
Strong communication skills and a proactive mindset toward owning product quality

Job Responsibility

Agent Platform Testing & System Validation Understand product requirements and workflows across vertical agents and design end-to end test scenarios for agents, platform components, observability systems, and prompt management
Create and maintain comprehensive test plans, test cases, and test data for agent orchestration, prompts, and integrations
Build sample agents, mock tools, and demo workflows to simulate real-world scenarios and identify edge cases
Execute manual and automated tests including API, regression, and smoke testing for web UI, APIs, and background workflows
Perform basic security testing such as access control validation, data privacy checks, input validation, and misuse scenarios
Use logs, metrics, and traces to investigate defects and validate system behavior
Ensure platform reliability, performance, and quality across the testing lifecycle
Collaboration, Automation & Release Support Collaborate with engineers and product managers to triage issues and improve product test ability
Build light automation scripts for API and regression testing where applicable
Participate in release validation and quality sign-offs before deployments

What we offer

Opportunity to work on next-generation AI agent platforms and intelligent automation systems
Hands-on experience testing LLM-driven products and AI-powered workflows
A collaborative engineering culture focused on innovation, experimentation, and continuous learning
Competitive compensation and strong opportunities for career growth in AI and platform engineering

Fulltime

QA Engineer - Agents & AI Platform

We are looking for a skilled and innovative AI / Machine Learning Engineer to de...

Location

India , Chennai

Salary:

Not provided

OptiSol Business Solutions

Expiration Date

Until further notice

Requirements

Strong understanding of Machine Learning and Deep Learning fundamentals
Knowledge of Transformer architectures and language models
Experience working with Small Language Models (SLMs) or fine-tuned models
Hands-on experience with LangChain, LangGraph, CrewAI, AutoGen, OpenAI Agent SDK, or similar frameworks
Strong Python programming skills
Experience with FastAPI or similar backend frameworks
Knowledge of Relational and NoSQL databases
Familiarity with Git, Docker, and CI/CD pipelines
Experience with testing frameworks such as pytest or unittest

Job Responsibility

Design, develop, and deploy AI applications powered by Small Language Models (SLMs) and fine-tuned language models
Build and manage agent orchestration workflows using frameworks such as LangGraph,CrewAI, AutoGen, or OpenAI Agent SDK
Develop multi-agent systems that coordinate tasks through planning, reasoning, and tool interaction
Build Retrieval-Augmented Generation (RAG) pipelines integrating vector databases, APIs and enterprise data sources
Apply strong Python programming and data structure knowledge to build scalable AI systems
Develop backend services and APIs using Python frameworks such as FastAPI
Integrate AI solutions with relational or NoSQL databases and external services
Write clean, maintainable, and testable code following software engineering best practices

What we offer

Supportive and professional work environment
Competitive salary as per market standards
Opportunity to work on advanced AI and multi-agent technologies
Career growth and learning opportunities in AI engineering

Fulltime

Principal Ai Engineer (Prisma Browser - Agents Platform)

The Prisma Browser group is building an agentic development lifecycle, an infras...

Location

Israel , Tel Aviv

Salary:

Not provided

Palo Alto Networks

Expiration Date

Until further notice

Requirements

At least 8+ years of experience in software development, architecture, or owning operational systems in production
Computer Science B.Sc. or equivalent education or equivalent military experience required
A product builder's mindset: you can extract requirements, talk to stakeholders, and tell the difference between what's important and what's noise
Experience in building production grade agents. Deep understanding of the agent loop, its states and transitions. You know how to build it correctly, not just use it
Positive 'can-do' mindset, able to work independently and within a team
Hands-on experience with LLM APIs, including a practical, highly-skeptical understanding of token costs, caching, context windows, and model failure points
You know how to build the right context for a task, including memory systems, session storage, and vector databases
You understand where LLMs fail and how to design around those failure points
You've used traces or observability tooling to diagnose and improve agent behavior
A systems-level background that touches reliability, observability, or platform engineering, with a strong preference for writing narrow, deterministic code over building hypothetical abstractions

Job Responsibility

Design and implement automated evaluation loops, static analysis, and rigorous quality gates to ensure the ADLC process doesn't just write code, but consistently produces great, production-ready code
Help the team tackle complex, hard problems to elevate our autonomous development product from 'good' to 'excellent'
Lead complex initiatives in Context Engineering and Prompt Engineering
Manage and orchestrate the complex ecosystem of autonomous agents utilized for internal development
Serve as a leading individual in a very strong team professionally and personally
Find space for growth to push the entire team or group forward
View prompt engineering as a core engineering discipline—where rewriting agent behavior is a versioned, reviewed, and tested code change
Act with a debugging temperament
conduct deep-dive analyses of raw agent transcripts to diagnose non-deterministic failures and ascertain root causes instead of merely working around them

Fulltime

Platform Engineer, Agent Collaboration Platform

Platform engineering at Hebbia is about excellent, scalable enablement. You are ...

Location

United States , New York City; San Francisco

Salary:

160000.00 - 300000.00 USD / Year

Hebbia

Expiration Date

Until further notice

Requirements

Bachelor's or Master's degree in Computer Science, Data Science, Statistics, or a related field
5+ years software development experience at a venture-backed startup or top technology firm, with a focus on distributed systems and platform engineering
Proficiency in building backend and distributed systems using technologies such as Python, Java, or Go
Deep understanding of scalable system design, performance optimization, and resilience engineering
Extensive experience with cloud platforms (e.g., AWS)
Working experience with one or more of the following: Kafka, ElasticSearch, PostgreSQL, and/or Redis
Knowledge of workflow orchestration and execution platforms like Airflow, Temporal or Prefect
Proven experience enabling observability patterns
Ability to analyze complex problems, propose innovative solutions, and effectively communicate technical concepts
Proven experience in leading software development projects and collaborating with cross-functional teams

Job Responsibility

Own critical system components: Take complex requirements and turn them into robust, scaled solutions that solve real customer needs
Unlock O(1) universal indexing: Build and iterate on our high-scale document build system that enables constant time latency for indexing any content in the world, regardless of data volume
Drive performance optimization: Architect and implement performance-tuning solutions to ensure our systems operate efficiently at scale, minimizing latency and maximizing throughput across millions of documents
Mentor and guide: Provide technical leadership, mentorship, and guidance to junior engineers, fostering a culture of learning and growth

What we offer

PTO: Unlimited
Insurance: Medical + Dental + Vision + 401K
Eats: Catered lunch daily + doordash dinner credit if you ever need to stay late
Parental leave policy: 3 months non-birthing parent, 4 months for birthing parent
Fertility benefits: $15k lifetime benefit
New hire equity grant: competitive equity package with unmatched upside potential

Fulltime

GCP AI Platform Architect / Lead AI Platform Engineer

Our client is an innovative technology company specializing in the development o...

Location

Poland , Kraków

Salary:

Not provided

TeamQuest Sp. z o. o.

Expiration Date

Until further notice

Requirements

GCP Expertise (verifiable - ask for production examples): GCP is their primary cloud not secondary experience alongside AWS/Azure. Production deployments across most of: Vertex AI, Cloud Run or GKE, Pub/Sub, BigQuery, Secret Manager, VPC Service Controls, IAM + Workload Identity. Has designed for GCP from scratch, not migrated from another cloud, end-to-end ownership
AI / Backend Engineering: Python is the primary language - production-grade service/API development, not scripting or data science only. Strong track record building distributed systems and integrating LLMs.
Agentic Architecture (must be production, not PoC): Hands-on production experience with at least one: LangGraph, Google ADK, CrewAI, or custom multi-agent orchestration layer. RAG pipelines shipped to production. Google ADK: candidate must be able to explain what it is, when to use it, and how it compares to LangGraph and custom orchestration. AI agent workflows, ReAct prompting, and Function Calling in production environments
Multi-Tenant Architecture: Has designed a multi-tenant SaaS platform end-to-end - not just contributed. Can articulate tenant isolation strategies: IAM boundary design, data isolation per tenant, VPC controls.
API Design & Integrations: Proven ability to create secure, high-performance APIs capable of asynchronously managing traffic and communication between multiple decoupled services.
Enterprise Security: Practical knowledge of data isolation in multi-tenant SaaS architectures, IAM, and securing cloud-based environments.
Vector Databases: Hands-on experience with semantic search and at least one of: Pinecone, Weaviate, pgvector, or Vertex Matching Engine.

Job Responsibility

System Architecture: Design and develop a scalable, cloud-native architecture on Google Cloud Platform (GCP) that meets enterprise security and multi-tenant data isolation requirements for a SaaS environment
AI Agent Orchestration: Architect and implement autonomous, multi-step AI workflows with a clear separation of agent responsibilities (retrieval, analysis, reasoning, response generation)
Hands-on Core Development: Actively contribute to core system development-coding orchestration logic, designing services, optimizing performance, and building secure API integrations for routing queries across internal and external agents
Frontend Enablement: Design the backend layer, streaming protocols, and APIs to seamlessly support and integrate with advanced conversational UIs
Data Management & Extensibility: Build a robust backend capable of processing qualitative and social data, ensuring the platform is easily extensible to incorporate new data sources

What we offer

Attractive salary
Full remote work
Social benefits:sporto card,healthcare insurance

Fulltime

GCP AI Platform Architect / Lead AI Platform Engineer

Our client is an innovative technology company specializing in the development o...

Location

Poland , Katowice

Salary:

Not provided

TeamQuest Sp. z o. o.

Expiration Date

Until further notice

Requirements

GCP Expertise (verifiable - ask for production examples): production deployments across most of: Vertex AI, Cloud Run or GKE, Pub/Sub, BigQuery, Secret Manager, VPC Service Controls, IAM + Workload Identity
Has designed for GCP from scratch, not migrated from another cloud, end-to-end ownership
AI / Backend Engineering: Python is the primary language - production-grade service/API development, not scripting or data science only
Strong track record building distributed systems and integrating LLMs
Agentic Architecture (must be production, not PoC): Hands-on production experience with at least one: LangGraph, Google ADK, CrewAI, or custom multi-agent orchestration layer
RAG pipelines shipped to production
Google ADK: candidate must be able to explain what it is, when to use it, and how it compares to LangGraph and custom orchestration
AI agent workflows, ReAct prompting, and Function Calling in production environments
Multi-Tenant Architecture: Has designed a multi-tenant SaaS platform end-to-end - not just contributed
Can articulate tenant isolation strategies: IAM boundary design, data isolation per tenant, VPC controls

Job Responsibility

System Architecture: Design and develop a scalable, cloud-native architecture on Google Cloud Platform (GCP) that meets enterprise security and multi-tenant data isolation requirements for a SaaS environment
AI Agent Orchestration: Architect and implement autonomous, multi-step AI workflows with a clear separation of agent responsibilities (retrieval, analysis, reasoning, response generation)
Hands-on Core Development: Actively contribute to core system development-coding orchestration logic, designing services, optimizing performance, and building secure API integrations for routing queries across internal and external agents
Frontend Enablement: Design the backend layer, streaming protocols, and APIs to seamlessly support and integrate with advanced conversational UIs
Data Management & Extensibility: Build a robust backend capable of processing qualitative and social data, ensuring the platform is easily extensible to incorporate new data sources

What we offer

Attractive salary
Full remote work
Social benefits: sport card, healthcare insurance

Fulltime

Select Country

Platform Engineer, Agents

Job Description

Job Responsibility

Requirements

Nice to have

What we offer

Looking for more opportunities?