CrawlJobs Logo

Ai Engineer, Quality

helpcare.ai Logo

Helpcare AI

Location Icon

Location:
United States , San Francisco

Category Icon

Job Type Icon

Contract Type:
Employment contract

Salary Icon

Salary:

Not provided

Job Description:

Fieldguide is building AI agents for the most complex audit and advisory workflows. We're a San Francisco-based Vertical AI company building in a $100B+ market undergoing rapid transformation. Over 50 of the top 100 accounting and consulting firms trust us to power their most mission-critical work. We're backed by Bessemer Venture Partners, 8VC, Floodgate, Y Combinator, Elad Gil, and other top-tier investors. As an AI Engineer, Quality , you will own the evaluation infrastructure that ensures our AI agents perform reliably at enterprise scale. This role is 100% focused on making evaluations a first-class engineering capability: building the unified platform, automated pipelines, and production feedback loops that let us evaluate any new model against all critical workflows within hours. You'll work at the intersection of ML engineering, observability, and quality assurance to ensure our agents meet the rigorous standards our customers demand.

Job Responsibility:

  • Design and build a unified evaluation platform that serves as the single source of truth for all of our agentic systems and audit workflows
  • Build observability systems that surface agent behavior, trace execution, and failure modes in production, and feedback loops that turn production failures into first-class evaluation cases
  • Own the evaluation infrastructure stack including integration with LangSmith and LangGraph
  • Translate customer problems into concrete agent behaviors and workflows
  • Integrate and orchestrate LLMs, tools, retrieval systems, and logic into cohesive, reliable agent experiences
  • Build automated pipelines that evaluate new models against all critical workflows within hours of release
  • Design evaluation harnesses for our most complex Agentic systems and workflows
  • Implement comparison frameworks that measure effectiveness, consistency, latency, and cost across model versions
  • Design guardrails and monitoring systems that catch quality regressions before they reach customers
  • Use AI as core leverage in how you design, build, test, and iterate
  • Prototype quickly to resolve uncertainty, then harden systems for enterprise-grade reliability
  • Build evaluations, feedback mechanisms, and guardrails so agents improve over time
  • Work with SMEs and ML Engineers to create evaluation datasets by curating production traces
  • Design prompts, retrieval pipelines, and agent orchestration systems that perform reliably at scale
  • Define and document evaluation standards, best practices, and processes for the engineering organization
  • Advocate for evaluation-driven development and make it easy for the team to write and run evals
  • Partner with product and ML engineers to integrate evaluation requirements into agent development from day one
  • Take full ownership of large product areas rather than executing on narrow tasks

Requirements:

  • Multiple years of experience shipping production software in complex, real-world systems
  • Experience with TypeScript, React, Python, and Postgres
  • Built and deployed LLM-powered features serving production traffic
  • Implemented evaluation frameworks for model outputs and agent behaviors
  • Designed observability or tracing infrastructure for AI/ML systems
  • Worked with vector databases, embedding models, and RAG architectures
  • Experience with evaluation platforms (LangSmith, Langfuse, or similar)
  • Comfort operating in ambiguity and taking responsibility for outcomes

Nice to have:

Experience with audit and accounting workflows

What we offer:
  • Competitive compensation packages with meaningful ownership
  • Flexible PTO
  • 401k
  • Wellness benefits, including a bundle of free therapy sessions
  • Technology & Work from Home reimbursement
  • Flexible work schedules

Additional Information:

Job Posted:
May 04, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Ai Engineer, Quality

AI Engineering Manager - Internal AI Agent

We are looking for an AI Engineering Manager to drive Mirakl's internal AI trans...
Location
Location
France , Paris
Salary
Salary:
Not provided
mirakl.com Logo
Mirakl
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in AI/ML or software engineering
  • Proven track record building AI agents using LLMs, RAG, MCP and related technologies
  • Strong technical proficiency in Python and multiple programming languages, with architectural design experience
  • Production deployment expertise - you've shipped AI solutions to real users
  • Technical pragmatism - ability to match the right technology to the use case
  • Curiosity and continuous learning - you stay current with AI/ML trends
  • 1+ years experience as a Lead or management roles (team management or technical leadership)
  • Strong leadership skills - you inspire and develop high-performing engineering teams
  • Cross-functional stakeholder management - you build relationships and excel at working with all organizational levels & functions
  • Strong communication & presentation skills - in both English and French
Job Responsibility
Job Responsibility
  • Partner closely with Mirakl teams & leadership to identify & prioritize opportunities, redesign workflows around AI agents, and drive adoption at scale
  • Lead and mentor a team of cross-functional AI engineers, defining your team’s roadmap to support strategic AI initiatives
  • Build advanced Mirakl-specific AI agents centrally, owning the complete delivery cycle from discovery to production deployment and operations
  • Foster organization-wide AI adoption by animating internal communities, providing self-service tools, training & support to empower teams as autonomous AI builders
  • Establish & scale technical standards & stack to ensure secure, compliant & high-quality deliverables across all internal AI projects
  • Explore emerging AI paradigms, evaluate new tools and technologies, and maintain active technology watch
Read More
Arrow Right

Senior Data Engineer – Data Engineering & AI Platforms

We are looking for a highly skilled Senior Data Engineer (L2) who can design, bu...
Location
Location
India , Chennai, Madurai, Coimbatore
Salary
Salary:
Not provided
optisolbusiness.com Logo
OptiSol Business Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong hands-on expertise in cloud ecosystems (Azure / AWS / GCP)
  • Excellent Python programming skills with data engineering libraries and frameworks
  • Advanced SQL capabilities including window functions, CTEs, and performance tuning
  • Solid understanding of distributed processing using Spark/PySpark
  • Experience designing and implementing scalable ETL/ELT workflows
  • Good understanding of data modeling concepts (dimensional, star, snowflake)
  • Familiarity with GenAI/LLM-based integration for data workflows
  • Experience working with Git, CI/CD, and Agile delivery frameworks
  • Strong communication skills for interacting with clients, stakeholders, and internal teams
Job Responsibility
Job Responsibility
  • Design, build, and maintain scalable ETL/ELT pipelines across cloud and big data platforms
  • Contribute to architectural discussions by translating business needs into data solutions spanning ingestion, transformation, and consumption layers
  • Work closely with solutioning and pre-sales teams for technical evaluations and client-facing discussions
  • Lead squads of L0/L1 engineers—ensuring delivery quality, mentoring, and guiding career growth
  • Develop cloud-native data engineering solutions using Python, SQL, PySpark, and modern data frameworks
  • Ensure data reliability, performance, and maintainability across the pipeline lifecycle—from development to deployment
  • Support long-term ODC/T&M projects by demonstrating expertise during technical discussions and interviews
  • Integrate emerging GenAI tools where applicable to enhance data enrichment, automation, and transformations
What we offer
What we offer
  • Opportunity to work at the intersection of Data Engineering, Cloud, and Generative AI
  • Hands-on exposure to modern data stacks and emerging AI technologies
  • Collaboration with experts across Data, AI/ML, and cloud practices
  • Access to structured learning, certifications, and leadership mentoring
  • Competitive compensation with fast-track career growth and visibility
  • Fulltime
Read More
Arrow Right

AI Engineering Manager - Internal AI Agent

We are looking for an AI Engineering Manager to drive Mirakl's internal AI trans...
Location
Location
France
Salary
Salary:
Not provided
mirakl.com Logo
Mirakl
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in AI/ML or software engineering
  • Proven track record building AI agents using LLMs, RAG, MCP and related technologies
  • Strong technical proficiency in Python and multiple programming languages, with architectural design experience
  • Production deployment expertise - you've shipped AI solutions to real users
  • Technical pragmatism - ability to match the right technology to the use case
  • Curiosity and continuous learning
  • 1+ years experience as a Lead or management roles (team management or technical leadership)
  • Strong leadership skills
  • Cross-functional stakeholder management
  • Strong communication & presentation skills - in both English and French
Job Responsibility
Job Responsibility
  • Partner closely with Mirakl teams & leadership to identify & prioritize opportunities, redesign workflows around AI agents, and drive adoption at scale
  • Lead and mentor a team of cross-functional AI engineers, defining your team’s roadmap to support strategic AI initiatives
  • Build advanced Mirakl-specific AI agents centrally, owning the complete delivery cycle from discovery to production deployment and operations
  • Foster organization-wide AI adoption by animating internal communities, providing self-service tools, training & support to empower teams as autonomous AI builders
  • Establish & scale technical standards & stack to ensure secure, compliant & high-quality deliverables across all internal AI projects
  • Explore emerging AI paradigms, evaluate new tools and technologies, and maintain active technology watch
Read More
Arrow Right

AI Engineering Manager - Internal AI Agent

We are looking for an AI Engineering Manager to drive Mirakl's internal AI trans...
Location
Location
France , Bordeaux
Salary
Salary:
Not provided
mirakl.com Logo
Mirakl
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in AI/ML or software engineering
  • Proven track record building AI agents using LLMs, RAG, MCP and related technologies
  • Strong technical proficiency in Python and multiple programming languages, with architectural design experience
  • Production deployment expertise - you've shipped AI solutions to real users
  • Technical pragmatism - ability to match the right technology to the use case
  • Curiosity and continuous learning
  • 1+ years experience as a Lead or management roles (team management or technical leadership)
  • Strong leadership skills
  • Cross-functional stakeholder management
  • Strong communication & presentation skills - in both English and French
Job Responsibility
Job Responsibility
  • Partner closely with Mirakl teams & leadership to identify & prioritize opportunities, redesign workflows around AI agents, and drive adoption at scale
  • Lead and mentor a team of cross-functional AI engineers, defining your team’s roadmap to support strategic AI initiatives
  • Build advanced Mirakl-specific AI agents centrally, owning the complete delivery cycle from discovery to production deployment and operations
  • Foster organization-wide AI adoption by animating internal communities, providing self-service tools, training & support to empower teams as autonomous AI builders
  • Establish & scale technical standards & stack to ensure secure, compliant & high-quality deliverables across all internal AI projects
  • Explore emerging AI paradigms, evaluate new tools and technologies, and maintain active technology watch
Read More
Arrow Right

Director of Quality Engineering

HMH is a learning technology company committed to delivering connected solutions...
Location
Location
India , Pune
Salary
Salary:
Not provided
edtechjobs.io Logo
EdTech Jobs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years of experience in software quality engineering
  • At least 7+ years in leadership roles overseeing QA/QE or Platform Engineering
  • Proven experience managing large engineering teams across geographies, preferably 50+ team members
  • Demonstrated success leading AI/automation initiatives within engineering or quality organizations
  • Deep expertise in modern testing strategies, CI/CD pipelines, cloud-native architectures, and tooling
  • Strong understanding of SDLC, release governance, and large-scale program execution
  • Excellent communication, stakeholder management, and leadership presence
  • Ability to operate in a fast-paced, matrixed global environment
Job Responsibility
Job Responsibility
  • Provide on-the-ground leadership for all QE teams in Pune, ensuring strong engagement, alignment, and execution
  • Develop and execute a strategy to elevate engineering maturity, strengthen delivery discipline, and scale operational excellence
  • Serve as a key member of the global engineering leadership team, driving alignment across geographies and business units
  • Engage with cross-functional teams, including product managers and business leaders, to align technical efforts with company objectives
  • Lead adoption and integration of AI, autonomous testing solutions, and agentic workflows to improve efficiency, accuracy, and delivery velocity
  • Champion innovation in automation frameworks, intelligent observability, and predictive analytics across QE
  • Evaluate and drive implementation of advanced tooling and platforms to enhance productivity
  • Oversee roadmap delivery for Platform and Curriculum initiatives, ensuring commitments are met with speed, reliability, and quality
  • Establish strong governance, release management rigor, and continuous improvement practices
  • Partner closely with Product, Architecture, Engineering, and Program teams to ensure seamless end-to-end delivery
  • Fulltime
Read More
Arrow Right

Senior AI Engineer

We are seeking an innovative AI Engineer to join a brand new team focused on pro...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience as an AI Engineer with a significant delivery history
  • Strong expertise in multiple programming languages & frameworks
  • Experience and proven experience in using quantitative testing practice applied to the field of AI/ML for actionable Go/No-Go decisions of delivering software to production
  • Demonstrated expertise of developing on a range of architectures, ideally up to and including container-based micro-services with focus on scalability, reliability, maintainability, and high performance
  • Good understanding of SQL and NoSQL databases
  • Excellent communication and collaboration skills
  • A growth mindset and willingness to learn and adapt in a fast-paced environment
  • Passion about site reliability engineering and its impact on product development
  • Being connected to latest technologies, like Generative AI, and keen to put them in practice.
Job Responsibility
Job Responsibility
  • Understand the landscape, tooling and procedures used by developers at Citi and look for opportunities to reduce toil and aid simplification using Gen AI based solutions
  • Apply classic AI and novel Gen AI evaluation methodology to raise the quality and reliability bar for the software that you will deliver, as well to manage and mitigate risks that are specific/inherent to this field
  • Advice on Evaluation metrics, devise and implement Quantitative Testing Plans, and help evolve the existing approaches to AI evaluation
  • Work with a wide variety of Citi technology teams and help them drive towards everything-as-code and a codified controls environment
  • Collaborate with product and engineering teams to design, build and maintain scalable and reliable web applications and services
  • Be hands-on with coding and software design to ensure adherence to high quality standards and best practices
  • Mentor and nurture other engineers to help them grow their skills and expertise
  • Support and drive cultural change, including instigating critical thinking about controls and processes and encouraging a culture of continuous improvement.
What we offer
What we offer
  • 27 days annual leave (plus bank holidays)
  • A discretional annual performance-related bonus
  • Private Medical Care & Life Insurance
  • Employee Assistance Program
  • Pension Plan
  • Paid Parental Leave
  • Special discounts for employees, family, and friends
  • Access to an array of learning and development resources.
  • Fulltime
Read More
Arrow Right

Head of Engineering, Product AI

Atlassian is looking for a Head of Engineering to lead product AI feature develo...
Location
Location
United States , Seattle; Mountain View; San Francisco
Salary
Salary:
251700.00 - 400000.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Over 8 years of experience in building and scaling engineering teams, specializing in Artificial Intelligence or Machine Learning domains
  • Extensive experience leading engineering teams, including managing managers of managers, and efficiently running organizations with 30+ engineers across multiple geographies
  • A track record of delivering high-quality AI solutions in a product-driven organization
  • Deep expertise in AI/ML technologies, particularly in quality assurance, testing, and validation
  • Previous experience collaborating with Engineering, Product Management, and Platform teams
  • Exceptional communication and leadership skills, capable of inspiring and guiding teams effectively
  • Strong problem-solving abilities and adaptability to thrive in fast-paced, collaborative environments
  • Demonstrated ability to manage complex projects and foster cross-functional collaboration
  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field
Job Responsibility
Job Responsibility
  • Define and execute the engineering strategy for Product AI quality, ensuring alignment with Atlassian's AI vision and product roadmap
  • Lead and mentor a team of engineers, fostering a culture of innovation, collaboration, and technical excellence
  • Collaborate with cross-functional teams, including product managers, data scientists, and designers, to integrate personalized AI solutions into Atlassian's ecosystem
  • Oversee the development of frameworks and tools to evaluate and ensure the quality, fairness, and reliability of personalized AI models and systems
  • Drive the adoption of best practices in AI quality assurance, including rigorous testing, monitoring, and validation processes
  • Ensure the scalability, security, and ethical deployment of personalized AI technologies
  • Partner with product and engineering teams across geographies to integrate quality-focused AI solutions into Atlassian's suite of products
  • Act as a thought leader within the organization, advocating for responsible AI practices and continuous improvement in AI quality
  • Represent the Central AI team in external forums, showcasing Atlassian's commitment to AI quality and innovation
What we offer
What we offer
  • health coverage
  • paid volunteer days
  • wellness resources
  • Fulltime
Read More
Arrow Right

AI Engineer

AI Engineer position at Inetum, a European leader in digital services, focusing ...
Location
Location
Romania , Bucharest
Salary
Salary:
Not provided
https://www.inetum.com Logo
Inetum
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum 3 to 5 years of experience in AI solutions deployment in enterprise environments
  • Understanding of how LLMs (GPT-4, Gemini, Claude, Llama, Mistral AI) work, their capabilities, and limitations
  • Familiarity with model architecture, tokenization, context windows, and prompt formatting
  • Crafting effective prompts for various tasks (text generation, summarization, Q&A, code generation, images and sound manipulations)
  • Techniques for prompt chaining, few-shot and zero-shot learning, and multi-turn conversations
  • Knowledge of prompt templates, system instructions, and role-based prompting
  • Understanding the concept of AI agents: autonomous entities that perceive, reason, and act to achieve goals
  • Familiarity with multi-agent systems, agent orchestration, and agentic workflows (e.g., using frameworks like Lang Chain, Crew AI, Auto Gen)
  • Ability to design, prompt, and coordinate groups of AI agents for collaborative or competitive tasks
  • Knowledge of agent communication, delegation, and task decomposition
Job Responsibility
Job Responsibility
  • Develop, refine, and optimize prompts for LLMs (GPT-4, Gemini, Claude, Llama, Mistral) to support a variety of tasks such as text generation, summarization, Q&A, and code generation
  • Design and implement prompt strategies for multi-turn conversations, prompt chaining, and role-based instructions
  • Build and coordinate groups of AI agents (multi-agent systems) for collaborative or competitive tasks using frameworks such as Lang Chain, Crew AI, or Auto Gen
  • Upgrade existing prompts while releases and solutions set evolve
  • Evaluate and improve the effectiveness of prompts and agent workflows through iterative testing, A/B experimentation, and performance analysis
  • Collaborate with cross-functional teams (developers, data scientists, product managers) to integrate prompt engineering and agentic workflows into enterprise solutions
  • Ensure compliance with data privacy, security, and regulatory standards (GDPR, NIS2) in all prompt and agent designs
  • Document prompt strategies, agent architectures, and best practices for internal knowledge sharing and training
  • Ensure provided solutions are ready for production, documented and monitored ensuring consistent delivery and quality
What we offer
What we offer
  • Full access to foreign language learning platform
  • Personalized access to tech learning platforms
  • Tailored workshops and trainings to sustain your growth
  • Medical Insurance
  • Meal tickets
  • Monthly budget to allocate on flexible benefit platform
  • Access to 7 Card services
  • Wellbeing activities and gatherings
  • Fulltime
Read More
Arrow Right