CrawlJobs Logo

Platform Engineer, Agents

United States, New York City 160000.00 - 300000.00 USD / Year · Job Posted December 09, 2025
Apply Position
Job Link Share

Job Description

Platform engineering at Hebbia is about excellent, scalable enablement. You are responsible for the core distributed systems that power billions of tokens across millions of dollars of AUM. You will be responsible for deploying efficient systems and building software tightly coupled with state of the art infrastructure/system design. Hebbia’s edge is built on operating on the edge of the tokenomics curve and you will serve as a key contributor in this area. We value engineers who think on their feet, innovate and can solve for exponential scale.

Job Responsibility

  • Own critical system components: Take complex requirements and turn them into robust, scaled solutions that solve real customer needs
  • Unlock O(1) universal indexing: Build and iterate on our high-scale document build system that enables constant time latency for indexing any content in the world, regardless of data volume
  • Drive performance optimization: Architect and implement performance-tuning solutions to ensure our systems operate efficiently at scale, minimizing latency and maximizing throughput across millions of documents
  • Mentor and guide: Provide technical leadership, mentorship, and guidance to junior engineers, fostering a culture of learning and growth

Requirements

  • Bachelor's or Master's degree in Computer Science, Data Science, Statistics, or a related field. A strong academic background with coursework in data structures, algorithms, and software development is preferred
  • 5+ years software development experience at a venture-backed startup or top technology firm, with a focus on distributed systems and platform engineering
  • Proficiency in building backend and distributed systems using technologies such as Python, Java, or Go
  • Deep understanding of scalable system design, performance optimization, and resilience engineering
  • Extensive experience with cloud platforms (e.g., AWS)
  • Working experience with one or more of the following: Kafka, ElasticSearch, PostgreSQL, and/or Redis
  • Knowledge of workflow orchestration and execution platforms like Airflow, Temporal or Prefect
  • Proven experience enabling observability patterns
  • Ability to analyze complex problems, propose innovative solutions, and effectively communicate technical concepts to both technical and non-technical stakeholders
  • Proven experience in leading software development projects and collaborating with cross-functional teams. Strong interpersonal and communication skills to foster a collaborative and inclusive work environment
  • Enthusiasm for continuous learning and professional growth. A passion for exploring new technologies, frameworks, and software development methodologies
  • Autonomous and excited about taking ownership over major initiatives

Nice to have

  • Experience building distributed systems leveraging technologies such as etcd or Apache Zookeeper
  • Frequent user of AI products, especially during the development lifecycle (i.e. Cursor, Claude Code, etc)

What we offer

  • PTO: Unlimited
  • Insurance: Medical + Dental + Vision + 401K
  • Eats: Catered lunch daily + doordash dinner credit if you ever need to stay late
  • Parental leave policy: 3 months non-birthing parent, 4 months for birthing parent
  • Fertility benefits: $15k lifetime benefit
  • New hire equity grant: competitive equity package with unmatched upside potential

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Platform Engineer, Agents

8 matching positions

Senior ML Engineer - AI Platform & Agents

We are building agentic AI into the core of our product and need someone who can...
Location
Location
France , Bordeaux
Salary
Salary:
Not provided
phantombuster.com Logo
PhantomBuster
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience as an ML Engineer, AI Engineer, or Software Engineer with a strong AI focus
  • Hands-on experience building AI agents using frameworks such as LangChain, Amazon Bedrock AgentCore, or similar
  • Strong understanding of LLM-based systems: prompt engineering, agent orchestration, tool use, and multi-agent workflows
  • Familiarity with MCP (Model Context Protocol) and experience integrating agents with external APIs or data sources
  • Experience working with Agents for Amazon Bedrock AgentCore or similar agent setups
  • Strong understanding of machine learning algorithms, statistical methods, and data preprocessing techniques
  • Experience with cloud platforms for model training and deployment, especially AWS
  • Proficiency in Python, including LangChain, and standard data libraries (Pandas, NumPy, etc.)
  • Fluency in English
Job Responsibility
Job Responsibility
  • Define and evolve our infrastructure to allow for better ML and AI capabilities, with a focus on LLM-based and agentic systems
  • Contribute to the development and expansion of our agentic AI framework powered by AWS Bedrock, enabling both internal tools and customer-facing features
  • Identify, source, and refine datasets to allow tuning models, powering retrieval pipelines, or expanding agentic workflows
  • Pre-process data by using techniques such as data cleaning, feature engineering, and transformation
  • Train, evaluate, and deploy both LLM-based systems and traditional machine learning models into production
  • Monitor, debug, and continuously improve deployed models and AI tools
  • Support machine learning usage throughout the company, including selecting the right modeling approach for the use case (LLM vs. traditional ML)
  • Support the integration and use of LLMs, including approaches such as fine-tuning, prompt tuning, and retrieval-augmented generation (RAG), to improve accuracy
What we offer
What we offer
  • International team
  • Fun team building events
  • €40/month for remote work
  • Flexible working time
  • Home office budget up to €1500
  • 100% of an Alan Blue subscription
  • Lunch vouchers - €8 (50% The Phantom Company) / worked day
  • Partnership with MokaCare
  • €70 a month benefit for entertainment expenses
  • Book Allowance and Sharing Program
Read More
Arrow Right

Senior ML Engineer - AI Platform & Agents

Join PhantomBuster as a Senior ML Engineer to build agentic AI with AWS Bedrock,...
Location
Location
France; Spain; Portugal
Salary
Salary:
Not provided
phantombuster.com Logo
PhantomBuster
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience as an ML Engineer, AI Engineer, or Software Engineer with a strong AI focus
  • Hands-on experience building AI agents using frameworks such as LangChain, Amazon Bedrock AgentCore, or similar
  • Strong understanding of LLM-based systems: prompt engineering, agent orchestration, tool use, and multi-agent workflows
  • Familiarity with MCP (Model Context Protocol) and experience integrating agents with external APIs or data sources
  • Experience working with Agents for Amazon Bedrock AgentCore or similar agent setups
  • Strong understanding of machine learning algorithms, statistical methods, and data preprocessing techniques
  • Experience with cloud platforms for model training and deployment, especially AWS
  • Proficiency in Python, including LangChain, and standard data libraries (Pandas, NumPy, etc.)
  • Fluency in English
Job Responsibility
Job Responsibility
  • Define and evolve our infrastructure to allow for better ML and AI capabilities, with a focus on LLM-based and agentic systems
  • Contribute to the development and expansion of our agentic AI framework powered by AWS Bedrock, enabling both internal tools and customer-facing features
  • Identify, source, and refine datasets to allow tuning models, powering retrieval pipelines, or expanding agentic workflows
  • Pre-process data by using techniques such as data cleaning, feature engineering, and transformation
  • Train, evaluate, and deploy both LLM-based systems and traditional machine learning models into production
  • Monitor, debug, and continuously improve deployed models and AI tools
  • Support machine learning usage throughout the company, including selecting the right modeling approach for the use case (LLM vs. traditional ML)
  • Support the integration and use of LLMs, including approaches such as fine-tuning, prompt tuning, and retrieval-augmented generation (RAG), to improve accuracy
What we offer
What we offer
  • Fully remote working environment (France, Spain, or Portugal)
  • Real ownership: you will define how agentic AI is built at PhantomBuster, not follow someone else's decisions
  • Freedom to research and adopt new technologies as the space evolves & to make an impact at a small, self-funded, and profitable tech startup by laying the foundation for machine learning and AI
  • Collaborative and open-minded culture based on rationality, humility, honesty, and long-term thinking
  • International team
  • Fun team building events
  • €40/month for remote work
  • Flexible working time
  • Home office budget up to €1500
  • 100% of an Alan Blue subscription (french-based contracts)
  • Fulltime
Read More
Arrow Right

Qa Engineer – Agents & Ai Platform

QA Engineer – Agents & AI Platform. We are looking for a QA Engineer – Agents & ...
Location
Location
India , Chennai
Salary
Salary:
Not provided
optisolbusiness.com Logo
OptiSol Business Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • QA experience testing complex web platforms, workflow systems, or enterprise applications
  • Strong knowledge of QA methodologies, test case design, and defect lifecycle management
  • Hands-on experience with API testing tools such as Postman and exploratory testing techniques
  • Ability to build light automation scripts using Python, JavaScript, or testing frameworks for regression and API testing
  • Understanding of workflow-driven platforms or enterprise process systems
  • Awareness of security and privacy considerations, including RBAC, PHI/PII protection, and audit logging
  • Strong communication skills and a proactive mindset toward owning product quality
Job Responsibility
Job Responsibility
  • Agent Platform Testing & System Validation Understand product requirements and workflows across vertical agents and design end-to end test scenarios for agents, platform components, observability systems, and prompt management
  • Create and maintain comprehensive test plans, test cases, and test data for agent orchestration, prompts, and integrations
  • Build sample agents, mock tools, and demo workflows to simulate real-world scenarios and identify edge cases
  • Execute manual and automated tests including API, regression, and smoke testing for web UI, APIs, and background workflows
  • Perform basic security testing such as access control validation, data privacy checks, input validation, and misuse scenarios
  • Use logs, metrics, and traces to investigate defects and validate system behavior
  • Ensure platform reliability, performance, and quality across the testing lifecycle
  • Collaboration, Automation & Release Support Collaborate with engineers and product managers to triage issues and improve product test ability
  • Build light automation scripts for API and regression testing where applicable
  • Participate in release validation and quality sign-offs before deployments
What we offer
What we offer
  • Opportunity to work on next-generation AI agent platforms and intelligent automation systems
  • Hands-on experience testing LLM-driven products and AI-powered workflows
  • A collaborative engineering culture focused on innovation, experimentation, and continuous learning
  • Competitive compensation and strong opportunities for career growth in AI and platform engineering
  • Fulltime
Read More
Arrow Right

QA Engineer - Agents & AI Platform

We are looking for a skilled and innovative AI / Machine Learning Engineer to de...
Location
Location
India , Chennai
Salary
Salary:
Not provided
optisolbusiness.com Logo
OptiSol Business Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong understanding of Machine Learning and Deep Learning fundamentals
  • Knowledge of Transformer architectures and language models
  • Experience working with Small Language Models (SLMs) or fine-tuned models
  • Hands-on experience with LangChain, LangGraph, CrewAI, AutoGen, OpenAI Agent SDK, or similar frameworks
  • Strong Python programming skills
  • Experience with FastAPI or similar backend frameworks
  • Knowledge of Relational and NoSQL databases
  • Familiarity with Git, Docker, and CI/CD pipelines
  • Experience with testing frameworks such as pytest or unittest
Job Responsibility
Job Responsibility
  • Design, develop, and deploy AI applications powered by Small Language Models (SLMs) and fine-tuned language models
  • Build and manage agent orchestration workflows using frameworks such as LangGraph,CrewAI, AutoGen, or OpenAI Agent SDK
  • Develop multi-agent systems that coordinate tasks through planning, reasoning, and tool interaction
  • Build Retrieval-Augmented Generation (RAG) pipelines integrating vector databases, APIs and enterprise data sources
  • Apply strong Python programming and data structure knowledge to build scalable AI systems
  • Develop backend services and APIs using Python frameworks such as FastAPI
  • Integrate AI solutions with relational or NoSQL databases and external services
  • Write clean, maintainable, and testable code following software engineering best practices
What we offer
What we offer
  • Supportive and professional work environment
  • Competitive salary as per market standards
  • Opportunity to work on advanced AI and multi-agent technologies
  • Career growth and learning opportunities in AI engineering
  • Fulltime
Read More
Arrow Right

Principal Ai Engineer (Prisma Browser - Agents Platform)

The Prisma Browser group is building an agentic development lifecycle, an infras...
Location
Location
Israel , Tel Aviv
Salary
Salary:
Not provided
paloaltonetworks.com Logo
Palo Alto Networks
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 8+ years of experience in software development, architecture, or owning operational systems in production
  • Computer Science B.Sc. or equivalent education or equivalent military experience required
  • A product builder's mindset: you can extract requirements, talk to stakeholders, and tell the difference between what's important and what's noise
  • Experience in building production grade agents. Deep understanding of the agent loop, its states and transitions. You know how to build it correctly, not just use it
  • Positive 'can-do' mindset, able to work independently and within a team
  • Hands-on experience with LLM APIs, including a practical, highly-skeptical understanding of token costs, caching, context windows, and model failure points
  • You know how to build the right context for a task, including memory systems, session storage, and vector databases
  • You understand where LLMs fail and how to design around those failure points
  • You've used traces or observability tooling to diagnose and improve agent behavior
  • A systems-level background that touches reliability, observability, or platform engineering, with a strong preference for writing narrow, deterministic code over building hypothetical abstractions
Job Responsibility
Job Responsibility
  • Design and implement automated evaluation loops, static analysis, and rigorous quality gates to ensure the ADLC process doesn't just write code, but consistently produces great, production-ready code
  • Help the team tackle complex, hard problems to elevate our autonomous development product from 'good' to 'excellent'
  • Lead complex initiatives in Context Engineering and Prompt Engineering
  • Manage and orchestrate the complex ecosystem of autonomous agents utilized for internal development
  • Serve as a leading individual in a very strong team professionally and personally
  • Find space for growth to push the entire team or group forward
  • View prompt engineering as a core engineering discipline—where rewriting agent behavior is a versioned, reviewed, and tested code change
  • Act with a debugging temperament
  • conduct deep-dive analyses of raw agent transcripts to diagnose non-deterministic failures and ascertain root causes instead of merely working around them
  • Fulltime
Read More
Arrow Right

Platform Engineer, Agent Collaboration Platform

Platform engineering at Hebbia is about excellent, scalable enablement. You are ...
Location
Location
United States , New York City; San Francisco
Salary
Salary:
160000.00 - 300000.00 USD / Year
hebbia.ai Logo
Hebbia
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Data Science, Statistics, or a related field
  • 5+ years software development experience at a venture-backed startup or top technology firm, with a focus on distributed systems and platform engineering
  • Proficiency in building backend and distributed systems using technologies such as Python, Java, or Go
  • Deep understanding of scalable system design, performance optimization, and resilience engineering
  • Extensive experience with cloud platforms (e.g., AWS)
  • Working experience with one or more of the following: Kafka, ElasticSearch, PostgreSQL, and/or Redis
  • Knowledge of workflow orchestration and execution platforms like Airflow, Temporal or Prefect
  • Proven experience enabling observability patterns
  • Ability to analyze complex problems, propose innovative solutions, and effectively communicate technical concepts
  • Proven experience in leading software development projects and collaborating with cross-functional teams
Job Responsibility
Job Responsibility
  • Own critical system components: Take complex requirements and turn them into robust, scaled solutions that solve real customer needs
  • Unlock O(1) universal indexing: Build and iterate on our high-scale document build system that enables constant time latency for indexing any content in the world, regardless of data volume
  • Drive performance optimization: Architect and implement performance-tuning solutions to ensure our systems operate efficiently at scale, minimizing latency and maximizing throughput across millions of documents
  • Mentor and guide: Provide technical leadership, mentorship, and guidance to junior engineers, fostering a culture of learning and growth
What we offer
What we offer
  • PTO: Unlimited
  • Insurance: Medical + Dental + Vision + 401K
  • Eats: Catered lunch daily + doordash dinner credit if you ever need to stay late
  • Parental leave policy: 3 months non-birthing parent, 4 months for birthing parent
  • Fertility benefits: $15k lifetime benefit
  • New hire equity grant: competitive equity package with unmatched upside potential
  • Fulltime
Read More
Arrow Right

GCP AI Platform Architect / Lead AI Platform Engineer

Our client is an innovative technology company specializing in the development o...
Location
Location
Poland , Kraków
Salary
Salary:
Not provided
teamquest.pl Logo
TeamQuest Sp. z o. o.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • GCP Expertise (verifiable - ask for production examples): GCP is their primary cloud not secondary experience alongside AWS/Azure. Production deployments across most of: Vertex AI, Cloud Run or GKE, Pub/Sub, BigQuery, Secret Manager, VPC Service Controls, IAM + Workload Identity. Has designed for GCP from scratch, not migrated from another cloud, end-to-end ownership
  • AI / Backend Engineering: Python is the primary language - production-grade service/API development, not scripting or data science only. Strong track record building distributed systems and integrating LLMs.
  • Agentic Architecture (must be production, not PoC): Hands-on production experience with at least one: LangGraph, Google ADK, CrewAI, or custom multi-agent orchestration layer. RAG pipelines shipped to production. Google ADK: candidate must be able to explain what it is, when to use it, and how it compares to LangGraph and custom orchestration. AI agent workflows, ReAct prompting, and Function Calling in production environments
  • Multi-Tenant Architecture: Has designed a multi-tenant SaaS platform end-to-end - not just contributed. Can articulate tenant isolation strategies: IAM boundary design, data isolation per tenant, VPC controls.
  • API Design & Integrations: Proven ability to create secure, high-performance APIs capable of asynchronously managing traffic and communication between multiple decoupled services.
  • Enterprise Security: Practical knowledge of data isolation in multi-tenant SaaS architectures, IAM, and securing cloud-based environments.
  • Vector Databases: Hands-on experience with semantic search and at least one of: Pinecone, Weaviate, pgvector, or Vertex Matching Engine.
Job Responsibility
Job Responsibility
  • System Architecture: Design and develop a scalable, cloud-native architecture on Google Cloud Platform (GCP) that meets enterprise security and multi-tenant data isolation requirements for a SaaS environment
  • AI Agent Orchestration: Architect and implement autonomous, multi-step AI workflows with a clear separation of agent responsibilities (retrieval, analysis, reasoning, response generation)
  • Hands-on Core Development: Actively contribute to core system development-coding orchestration logic, designing services, optimizing performance, and building secure API integrations for routing queries across internal and external agents
  • Frontend Enablement: Design the backend layer, streaming protocols, and APIs to seamlessly support and integrate with advanced conversational UIs
  • Data Management & Extensibility: Build a robust backend capable of processing qualitative and social data, ensuring the platform is easily extensible to incorporate new data sources
What we offer
What we offer
  • Attractive salary
  • Full remote work
  • Social benefits:sporto card,healthcare insurance
  • Fulltime
Read More
Arrow Right

GCP AI Platform Architect / Lead AI Platform Engineer

Our client is an innovative technology company specializing in the development o...
Location
Location
Poland , Katowice
Salary
Salary:
Not provided
teamquest.pl Logo
TeamQuest Sp. z o. o.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • GCP Expertise (verifiable - ask for production examples): production deployments across most of: Vertex AI, Cloud Run or GKE, Pub/Sub, BigQuery, Secret Manager, VPC Service Controls, IAM + Workload Identity
  • Has designed for GCP from scratch, not migrated from another cloud, end-to-end ownership
  • AI / Backend Engineering: Python is the primary language - production-grade service/API development, not scripting or data science only
  • Strong track record building distributed systems and integrating LLMs
  • Agentic Architecture (must be production, not PoC): Hands-on production experience with at least one: LangGraph, Google ADK, CrewAI, or custom multi-agent orchestration layer
  • RAG pipelines shipped to production
  • Google ADK: candidate must be able to explain what it is, when to use it, and how it compares to LangGraph and custom orchestration
  • AI agent workflows, ReAct prompting, and Function Calling in production environments
  • Multi-Tenant Architecture: Has designed a multi-tenant SaaS platform end-to-end - not just contributed
  • Can articulate tenant isolation strategies: IAM boundary design, data isolation per tenant, VPC controls
Job Responsibility
Job Responsibility
  • System Architecture: Design and develop a scalable, cloud-native architecture on Google Cloud Platform (GCP) that meets enterprise security and multi-tenant data isolation requirements for a SaaS environment
  • AI Agent Orchestration: Architect and implement autonomous, multi-step AI workflows with a clear separation of agent responsibilities (retrieval, analysis, reasoning, response generation)
  • Hands-on Core Development: Actively contribute to core system development-coding orchestration logic, designing services, optimizing performance, and building secure API integrations for routing queries across internal and external agents
  • Frontend Enablement: Design the backend layer, streaming protocols, and APIs to seamlessly support and integrate with advanced conversational UIs
  • Data Management & Extensibility: Build a robust backend capable of processing qualitative and social data, ensuring the platform is easily extensible to incorporate new data sources
What we offer
What we offer
  • Attractive salary
  • Full remote work
  • Social benefits: sport card, healthcare insurance
  • Fulltime
Read More
Arrow Right