This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
At Monarch, AI is the engine powering intelligent, personalized financial experiences for our users. We're looking for an AI Engineer to design, build, and own the features that help hundreds of thousands of users understand and manage their money. You'll work across the full spectrum of AI development, from prompt engineering and API integrations to building multi-agent systems and fine-tuning language models. You will be a force multiplier for Monarch's product, making critical decisions on everything from our conversational AI architecture to how we evaluate and ship AI features with confidence.
Job Responsibility:
Apply AI to Real Financial Problems: Use GenAI and ML to help users make sense of their money, understand spending patterns, surface actionable insights, or automate tedious financial tasks
Choose the Right Tool for Each Problem: Navigate the AI toolkit thoughtfully, know when a well-crafted prompt suffices, when retrieval systems add value, and when custom models are worth the investment
Ship with Confidence: Leverage and enhance our sophisticated evaluation framework to ensure AI quality, design test datasets, implement new scorers, and use our Braintrust-based eval system to validate changes before they reach users
AI feature development, agent design and orchestration, ML model improvements, evaluation datasets and scorers, prompt engineering, and feature-level quality
Requirements:
5+ years of experience in software engineering
at least 2 years focused on building and operating production ML/AI systems
proven track record of shipping LLM-powered features
deep, hands-on expertise in prompt engineering, RAG systems, and evaluation techniques
strong fundamentals in machine learning: embeddings, similarity search, classification, and probabilistic reasoning
demonstrated experience building and using AI evaluation tooling (e.g., golden sets, rubric scoring, LLM-as-judge)
excellent Python skills
history of building production-grade AI features and services
strong collaboration and communication skills with a sharp product sensibility
strategic mindset, comfortable making build-vs-buy decisions and designing features for long-term reliability
Nice to have:
Multi-Agent Systems: Designing and building complex LLM orchestration with frameworks like LangGraph, CrewAI, or AutoGen
Fine-Tuning: Hands-on experience with LoRA, RLHF, or full fine-tuning on platforms like Vertex AI
Fintech Domain: Background in personal finance, banking, or data-rich consumer financial applications
Vector Databases: Hands-on experience with OpenSearch, pgvector, Pinecone, or similar at scale
Safety & Evaluation: Experience with red-teaming exercises, adversarial testing, and implementing guardrails
What we offer:
Work wherever you want! As a fully remote company
Competitive cash and equity compensation
Stipend to set-up your ideal working environment
Competitive Benefit Plans for employees based on your location (e.g. in the US we offer: Medical, dental and vision benefits and the ability to contribute to a 401k plan)
Unlimited PTO
3 day weekend every month! We take off the “First Friday” every month