AI Platform Engineer, Backend Job at Brain Co. (San Francisco Bay Area)

GCP AI Platform Architect / Lead AI Platform Engineer

Our client is an innovative technology company specializing in the development o...

Location

Poland , Kraków

Salary:

Not provided

TeamQuest Sp. z o. o.

Expiration Date

Until further notice

Requirements

GCP Expertise (verifiable - ask for production examples): GCP is their primary cloud not secondary experience alongside AWS/Azure. Production deployments across most of: Vertex AI, Cloud Run or GKE, Pub/Sub, BigQuery, Secret Manager, VPC Service Controls, IAM + Workload Identity. Has designed for GCP from scratch, not migrated from another cloud, end-to-end ownership
AI / Backend Engineering: Python is the primary language - production-grade service/API development, not scripting or data science only. Strong track record building distributed systems and integrating LLMs.
Agentic Architecture (must be production, not PoC): Hands-on production experience with at least one: LangGraph, Google ADK, CrewAI, or custom multi-agent orchestration layer. RAG pipelines shipped to production. Google ADK: candidate must be able to explain what it is, when to use it, and how it compares to LangGraph and custom orchestration. AI agent workflows, ReAct prompting, and Function Calling in production environments
Multi-Tenant Architecture: Has designed a multi-tenant SaaS platform end-to-end - not just contributed. Can articulate tenant isolation strategies: IAM boundary design, data isolation per tenant, VPC controls.
API Design & Integrations: Proven ability to create secure, high-performance APIs capable of asynchronously managing traffic and communication between multiple decoupled services.
Enterprise Security: Practical knowledge of data isolation in multi-tenant SaaS architectures, IAM, and securing cloud-based environments.
Vector Databases: Hands-on experience with semantic search and at least one of: Pinecone, Weaviate, pgvector, or Vertex Matching Engine.

Job Responsibility

System Architecture: Design and develop a scalable, cloud-native architecture on Google Cloud Platform (GCP) that meets enterprise security and multi-tenant data isolation requirements for a SaaS environment
AI Agent Orchestration: Architect and implement autonomous, multi-step AI workflows with a clear separation of agent responsibilities (retrieval, analysis, reasoning, response generation)
Hands-on Core Development: Actively contribute to core system development-coding orchestration logic, designing services, optimizing performance, and building secure API integrations for routing queries across internal and external agents
Frontend Enablement: Design the backend layer, streaming protocols, and APIs to seamlessly support and integrate with advanced conversational UIs
Data Management & Extensibility: Build a robust backend capable of processing qualitative and social data, ensuring the platform is easily extensible to incorporate new data sources

What we offer

Attractive salary
Full remote work
Social benefits:sporto card,healthcare insurance

Fulltime

GCP AI Platform Architect / Lead AI Platform Engineer

Our client is an innovative technology company specializing in the development o...

Location

Poland , Katowice

Salary:

Not provided

TeamQuest Sp. z o. o.

Expiration Date

Until further notice

Requirements

GCP Expertise (verifiable - ask for production examples): production deployments across most of: Vertex AI, Cloud Run or GKE, Pub/Sub, BigQuery, Secret Manager, VPC Service Controls, IAM + Workload Identity
Has designed for GCP from scratch, not migrated from another cloud, end-to-end ownership
AI / Backend Engineering: Python is the primary language - production-grade service/API development, not scripting or data science only
Strong track record building distributed systems and integrating LLMs
Agentic Architecture (must be production, not PoC): Hands-on production experience with at least one: LangGraph, Google ADK, CrewAI, or custom multi-agent orchestration layer
RAG pipelines shipped to production
Google ADK: candidate must be able to explain what it is, when to use it, and how it compares to LangGraph and custom orchestration
AI agent workflows, ReAct prompting, and Function Calling in production environments
Multi-Tenant Architecture: Has designed a multi-tenant SaaS platform end-to-end - not just contributed
Can articulate tenant isolation strategies: IAM boundary design, data isolation per tenant, VPC controls

Job Responsibility

System Architecture: Design and develop a scalable, cloud-native architecture on Google Cloud Platform (GCP) that meets enterprise security and multi-tenant data isolation requirements for a SaaS environment
AI Agent Orchestration: Architect and implement autonomous, multi-step AI workflows with a clear separation of agent responsibilities (retrieval, analysis, reasoning, response generation)
Hands-on Core Development: Actively contribute to core system development-coding orchestration logic, designing services, optimizing performance, and building secure API integrations for routing queries across internal and external agents
Frontend Enablement: Design the backend layer, streaming protocols, and APIs to seamlessly support and integrate with advanced conversational UIs
Data Management & Extensibility: Build a robust backend capable of processing qualitative and social data, ensuring the platform is easily extensible to incorporate new data sources

What we offer

Attractive salary
Full remote work
Social benefits: sport card, healthcare insurance

Fulltime

Backend Engineer (AI Platform)

Plaud is building the world's most trusted AI work companion for professionals t...

Location

Singapore , Singapore

Salary:

Not provided

Plaud

Expiration Date

Until further notice

Requirements

Minimum 3 years of backend or AI engineering experience
At least 1+ years specifically in LLM application architecture
Deep practical knowledge of advanced agent patterns (e.g., Plan-Act-Reflection)
Proven ability to design complex distributed systems
Experience defining API standards and data protocols for cross-team usage

Job Responsibility

Agent Architecture Design: Designed the AI Agent architecture and implemented the "Plan-Act-Reflection" agentic flow
Skill Design: Developed agent skills including Function Calling, MCP Server integration, and Streaming APIs
Design DAG (Directed Acyclic Graph) reasoning flows to break down ambiguous user requests into executable steps
Solve critical runtime challenges like "Context Rot" (context overflow) by designing strategies for context offloading, isolation, and intelligent compression
RFT: Architect the Automated Data Flywheel system. Design Reward Functions and LLM-as-a-judge pipelines to programmatically evaluate agent performance and drive reinforcement learning

What we offer

Market-competitive compensation
Global exposure
Vibrant, creativity-fueled work atmosphere

Fulltime

Staff Software Engineer, Backend (AI Platform)

Cresta is on a mission to turn every customer conversation into a competitive ad...

Location

United States

Salary:

Not provided

Cresta

Expiration Date

Until further notice

Requirements

5+ years writing production software
2+ years focused on ML platform or infra
Expert Python (async, typing, packaging, performance)
Working Golang knowledge for systems components
Proven experience with one or more serving frameworks (e.g., vLLM, Triton, TorchServe)
Kubernetes and cloud-native ops
Solid grasp of distributed systems, networking, and container security
Culture of rigorous testing, code review, and continuous delivery

Job Responsibility

Own model serving: Design, build, and maintain low-latency, highly-available serving stacks for in-house ML model serving and integrating with LLM serving partners
Automate training pipelines: Orchestrate data prep, training, evaluation, and registry workflows on Kubernetes with solid MLOps practices
Optimize at scale: Profile and tune throughput, memory, and cost
introduce caching, sharding, batching, and GPU/CPU autoscaling where it pays off
Build platform primitives: Create reusable SDKs, templates, and CLI tools that let research and product teams ship models independently and safely
Raise the bar: Instrument deep observability (tracing, metrics, alerts), drive blameless post-mortems, and mentor engineers on production ML best practices

What we offer

Comprehensive medical, dental, and vision coverage with plans to fit you and your family
Flexible PTO to take the time you need, when you need it
Paid parental leave for all new parents welcoming a new child
Retirement savings plan to help you plan for the future
Remote work setup budget to help you create a productive home office
Monthly wellness and communication stipend to keep you connected and balanced
In-office meal program and commuter benefits provided for onsite employees

Backend Engineer - AI Developer Platform

At N26, we are building the internal AI platform that will power the next genera...

Location

Germany , Berlin

Salary:

Not provided

N26

Expiration Date

Until further notice

Requirements

Backend engineer who enjoys building platforms and developer-facing systems
Solid experience building software products written in languages such as Kotlin, Go, Python, or TypeScript
Experience working with APIs and distributed systems
Interest in developer platforms, tooling, or internal products
Curiosity about AI and how it can improve software development workflows
Strong collaboration skills and the ability to work within a highly technical team
Curiosity and willingness to learn new things
Data driven mindset

Job Responsibility

Help build the internal AI platform used by engineering teams at N26
Developing the core services that connect internal tools to AI providers
Implementing routing, cost controls, security policies, and observability
Building tools that make AI capabilities easy and safe to consume
Contributing to a platform where teams can publish and reuse AI skills
Supporting discovery, versioning, and governance of AI capabilities
Enabling composability across different AI-powered tools
Building services that enable AI-assisted workflows for engineers
Integrating AI capabilities with internal developer platforms
Supporting experimentation and iteration on new AI-enabled developer experiences

What we offer

Accelerate your career growth by joining one of Europe’s most talked about disruptors
Employee benefits that range from a competitive personal development budget, work from home budget, discounts to fitness & wellness memberships, language apps and public transportation
Access to a Premium subscription on your personal N26 bank account
Subscriptions for friends and family members
Additional day of annual leave for each year of service
A high degree of autonomy and access to cutting edge technologies
A relocation package with visa support for those who need it

Senior Backend Python Engineer - AI Platform

Are you looking for a career move that will put you at the heart of a global fin...

Location

United Kingdom , London

Salary:

Not provided

Citi

Expiration Date

Until further notice

Requirements

Proficiency in core Python and FastAPI framework
Profound understanding of software design principles, architectural patterns, and an unwavering commitment to writing clean, maintainable, and production-grade code
Experience of the full lifecycle of design, implementation and running of enterprise software solutions involving cross functional team collaboration
Experience contributing to the architecture and design (architecture, design patterns, reliability, scaling) of new and current systems
Experience with containerized deployment (Kubernetes, OpenShift etc)
Experience with DevOps, CI/CD and agile methodology

Job Responsibility

You will design, implement, build and deploy backend systems to automate the analysis of data, code and documentation, and structure the extracted knowledge in a Credit Risk Domain aware knowledge graph

What we offer

Generous holiday allowance starting at 27 days plus bank holidays
increasing with tenure
A discretional annual performance related bonus
Private medical insurance packages to suit your personal circumstances
Employee Assistance Program
Pension Plan
Paid Parental Leave
Special discounts for employees, family, and friends
Access to an array of learning and development resources

Fulltime

Senior ML Platform Engineer, AI Platform

We are seeking a skilled and passionate ML Platform Engineer to join our team an...

Location

Singapore , Singapore

Salary:

Not provided

Airwallex

Expiration Date

Until further notice

Requirements

5+ years in backend software development
at least 2+ years focus on AI/ML Platform or MLOps infrastructure
deep expertise in MLOps practices, including automated deployment pipelines, model optimization, and production lifecycle management
proven experience designing and implementing low-latency model serving solutions
proficiency in Python
skill in writing high-quality, maintainable code
experience in design and development of large-scale distributed, high concurrency, low-latency inference, high availability systems
excellent communication and mentoring abilities
a relevant degree in Computer Science, Mathematics or related fields

Job Responsibility

Platform Development: Design, build, and maintain the end-to-end MLOps platform using Kubernetes and Cloud Services
Infrastructure as Code (IaC): Use Terraform or similar tools to manage, provision, and scale all ML-related infrastructure securely and efficiently
Pipeline Automation: Implement and optimize CI/CD/CT (Continuous Integration, Delivery, Training) pipelines to automate model training, testing, packaging, and deployment using tools like Argo and Kubeflow Pipelines
Serving Infrastructure: Build highly available, low-latency, and high-throughput model serving infrastructure
Observability: Implement robust monitoring, alerting, and logging solutions to track infrastructure health, model performance, and data/model drift
Tooling & Support: Evaluate, integrate, and support ML tools such as Feature Stores and distributed model training pipelines
Security & Compliance: Ensure platform security, implement RBAC (Role-Based Access Control), and manage secrets for sensitive data and production environments
Collaboration: Work closely with Data Scientists and ML Engineers to understand their needs and provide technical guidance on best practices for scaling their models

Fulltime

Senior Machine Learning Engineer, AI Platform

The AI Platform team is responsible for building the foundational infrastructure...

Location

United States; Canada

Salary:

139000.00 - 218000.00 USD / Year

Mozilla

Expiration Date

Until further notice

Requirements

Bachelor’s degree with 4–6 years of relevant industry experience, or Master’s degree with significant hands-on experience building and operating production ML systems, or work experience equivalent
Strong experience developing in Python for machine learning systems, backend services, or distributed data processing
Proven experience deploying and operating ML workloads in cloud environments, including production-grade infrastructure
Solid understanding of model serving architectures, inference pipelines, and performance tradeoffs (latency, throughput, cost, scaling strategies)
Hands-on experience working with GPU-based workloads and accelerated computing in production settings
Experience designing CI/CD pipelines and development workflows that support reliable ML system deployment
Ability to independently scope and drive technical initiatives while balancing product and operational priorities
Strong problem-solving skills and the ability to debug performance and reliability issues in distributed systems
Clear and effective communication skills, with experience collaborating across engineering, product, and infrastructure teams

Job Responsibility

Design, build, and operate core AI platform components used to train, deploy, and serve machine learning models in production environments
Own model serving and inference workflows end-to-end, driving improvements in reliability, scalability, performance, and operational excellence
Lead efforts to optimize inference systems for throughput, latency, and cost efficiency across CPU and GPU workloads
Design and manage GPU-based inference and training workloads, including performance tuning, capacity planning, and resource utilization optimization
Own and improve critical parts of the model lifecycle, including packaging, versioning, testing strategies, validation, and deployment automation
Implement and evolve observability practices (metrics, logging, tracing, alerting) to improve visibility and operational resilience of ML services and pipelines
Partner closely with product, infrastructure, security, and data teams to design scalable platform capabilities that enable AI-powered features
Contribute to technical design discussions, propose architectural improvements, and mentor junior engineers through code reviews and knowledge sharing
Participate in and help improve operational processes, including incident response, on-call rotations, and post-incident reviews

What we offer

Generous performance-based bonus plans
Rich medical, dental, and vision coverage
Generous retirement contributions with 100% immediate vesting
Quarterly all-company wellness days
Country specific holidays plus a day off for your birthday
One-time home office stipend
Annual professional development budget
Quarterly well-being stipend
Considerable paid parental leave
Employee referral bonus program

Fulltime

Select Country

AI Platform Engineer, Backend

Job Description

Job Responsibility

Requirements

Nice to have

What we offer

Looking for more opportunities?