CrawlJobs Logo

Applied AI Engineer

India, Bengaluru 2000000.00 - 5000000.00 INR / Year · Job Posted June 29, 2026
Apply Position
Job Link Share

Job Description

Infer is building the operating system for insurance agencies. We make AI agents (including voice agents) that handle the work agencies have always done by hand: qualifying inbound leads, helping producers during live calls, auditing calls after, running renewals, and bringing churned customers back. Our long bet is that AI eventually sells insurance directly. Agencies are the wedge because that is where the work, the data, and the customer relationships actually live. Get good there, and the rest follows. We are a YC company and have raised from Stellaris Venture partners and others. Founders are: Vaibhav, Urvin and Suneel. Vaibhav was an architect and AI researcher(at Purdue) now a licensed insurance agent. Urvin worked at BCG, is a surfer with six pack abs. Suneel is an IITian and a philomath. About the role We're hiring an Applied AI Engineer to own the system that tells us whether our voice agents are getting better, and to keep them getting better on their own. Voice quality is the product. If an agent stutters, hallucinates a quote, or misses a disclosure, we lose trust, deals, and sometimes compliance footing. The system that catches all of that before customers do is the most important infrastructure we will build this year. Today we run thousands of conversations a day with real prospects. We need a harness that scores every change end to end, a benchmark suite that runs against any new model the day it drops, a red-team pipeline that probes our agents for failure modes, and self-improvement loops that feed production failures back into the eval set. This is an evals and infrastructure role with deep LLM work. You will touch audio, but the center of gravity is the harness and the loops around it. Think of the harness as CI for voice conversations: it runs synthetic and real calls through our stack and scores agent behavior at every layer (STT, LLM, tools, TTS, full call outcomes), so we catch regressions before customers do. New models are coming out every few weeks, so the question is not just whether ours is good today, but whether we can tell within a week if a new open source release should replace it. What success looks like Day 30 You understand how our agents work across prompts, tools, evals, telephony, and customer systems. You have shipped a v1 of evals with at least one end-to-end metric the team trusts. You are sitting in on customer call reviews and tagging failure modes by hand to learn where the real problems live. You have one new model (open or closed) benchmarked against our production stack with numbers we can defend. Day 60 The eval system runs on updates and blocks merges that regress on a known set of cases. We have a first red-team suite covering at least three classes of failure modes (jailbreaks, hallucinated quotes, compliance), running on a schedule. Hard-case mining from production calls is automated, so the eval set grows without anyone triaging every example by hand. At least one open source model (Qwen, DeepSeek, or similar) is benchmarked against our production stack with a defensible recommendation on whether to switch. Day 90 We can swap in any new LLM and have a numbers-backed answer on whether to ship it within a week. DSPy or GEPA-style prompt optimization is running over at least one production voice flow, and you have shown measurable lift. Self-improvement v1 is live for at least one failure pattern. The same problem does not get solved twice because the system feeds the fix back into the platform. You are spotting failure patterns across customer accounts and turning them into product fixes the rest of the team builds on.

Job Responsibility

  • Building and maintaining the eval framework that scores voice agent quality across transcription, LLM reasoning, tool use, TTS, and full-conversation outcomes
  • Design voice agent behavior: system prompts, tool use, conversation flow, error recovery, and guardrails for real-time interactions
  • Drive STT and TTS accuracy improvements by comparing providers, tuning configurations, and running rigorous A/B experiments the team can act on
  • Drive TTS quality improvements voice selection, latency vs. fidelity tradeoffs, prosody, edge cases
  • Curate and grow our evaluation datasets, including hard-case mining from production traffic
  • You'll build benchmarks we can run against any new model in days, run a red-team pipeline that probes for jailbreaks, hallucinated quotes, and compliance failures
  • Partner with backend engineers to wire eval signals into CI so regressions get caught before they ship
  • Wire eval signals into CI so regressions block merges, and build self-improvement loops where hard cases from production auto-feed the eval set and our prompts optimize themselves over time

Requirements

  • ML engineering experience shipping production systems
  • Strong Python and a working ML stack (PyTorch, Huggingface, pandas, scikit-learn)
  • Hands-on experience designing LLM-based agents: prompting, tool/function calling, multi-turn state, structured outputs
  • Hands-on experience building evals or eval frameworks for ML, LLM, or voice systems. Built LLM-as-judge eval pipelines and know their failure modes
  • Practical experience with ASR/STT comparing providers, fine-tuning, or running open models like Whisper
  • Practical experience with TTS systems (ElevenLabs or open models)
  • Comfortable working with audio data: sample rates, codecs, noise, alignment

Nice to have

  • Designed voice agents specifically handled barge-in, interruption recovery, disfluencies, and natural turn-taking at the prompt/behavior layer
  • Experience with diarization, VAD, or endpointing models
  • Audio dataset curation, labeling, or annotation pipelines
  • Trained or fine-tuned ASR or TTS models from scratch or on domain audio
  • Experience with active learning or data-flywheel patterns over production traffic
  • Open-source contributions to AI/ML frameworks
  • Familiarity with cost/latency tradeoffs across model providers for real-time voice

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Applied AI Engineer

8 matching positions

Applied AI Engineer

Build something bigger than your career as Applied AI Engineer. Are you a senior...
Location
Location
United States , Boston
Salary
Salary:
88000.00 - 168000.00 USD / Year
sophiagenetics.com Logo
SOPHiA GENETICS
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience with a senior engineer or product manager background from a top tech company building complex products
  • Strong hands-on building skills — you ship working software, not just diagrams
  • AI fluency is essential (LLMs, automation, modern AI tooling)
  • Dialed into modern engineering and product practices
  • Confident operator - you can walk into any function, diagnose what's broken, and drive change
  • Strong change management instincts - you can win over skeptical teams and make new ways of working stick
  • Comfortable moving between strategy conversations and shipping code in the same week
Job Responsibility
Job Responsibility
  • Build internal products and automations
  • AI enablement
  • Process and tooling improvements
  • Internal consulting on tech and product practices
  • Tech reporting and monitoring
  • Cross-functional alignment
  • Hands-on delivery support
What we offer
What we offer
  • Outstanding Medical, Dental & Vision with 90% Employer Contribution
  • Company matched 401K at 4%
  • Company-paid short & long-term disability insurance
  • FSA commuter benefits
  • 20 Days PTO, increasing to 25 with tenure
  • 5 Days Sick and 14 Public Holidays
  • Free EAP
  • Fulltime
Read More
Arrow Right

Applied Ai Engineer

Build and ship AI features end-to-end (model → system → user experience); Design...
Location
Location
China , Shanghai
Salary
Salary:
600000.00 - 1500000.00 CNY / Year
https://www.randstad.com Logo
Randstad
Expiration Date
September 02, 2026
Flip Icon
Requirements
Requirements
  • Strong foundation in machine learning and modern neural network architectures
  • Hands-on experience with training, fine-tuning, or deploying ML models
  • Ability to write clean, production-quality code
  • Comfort working across abstraction layers (model → infra → product)
  • Strong problem-solving skills in ambiguous, fast-moving environments
  • Bias toward shipping, iteration, and continuous improvement
Job Responsibility
Job Responsibility
  • Build and ship AI features end-to-end (model → system → user experience)
  • Design and iterate on prompts, tools, memory, and agent workflows
  • Turn raw model outputs into structured, reliable, and predictable behaviors
  • Debug issues across the full stack (model, orchestration, infra, UX)
  • Optimize for latency, cost, and production reliability
  • Develop lightweight evaluation frameworks to measure real-world performance
  • Work closely with product and engineering to translate ambiguous problems into working systems
  • Fulltime
Read More
Arrow Right

Applied Ai Engineer

We’re partnering with a rapidly scaling, VC-backed fintech building an AI-first ...
Location
Location
United States , New York
Salary
Salary:
Not provided
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong software engineering fundamentals with experience building production-grade applications and systems end-to-end
  • Proven startup experience – comfortable operating in fast-moving, ambiguous environments with high ownership
  • Hands-on experience developing user-facing applications, not just models (APIs, workflows, product features)
  • Experience with LLMs / GenAI (RAG, prompting, agents, evaluation, etc.)
  • Ability to move quickly from idea → prototype → production
  • Full-stack capability, or ability to leverage AI tools to fill gaps and ship complete features
  • Clear communicator with a strong product and user-centric mindset
Job Responsibility
Job Responsibility
  • Build and ship AI-powered product features (copilots, agents, automations, summarization, recommendations)
  • Design and scale LLM systems (RAG pipelines, tool use, orchestration layers, eval frameworks)
  • Rapidly prototype using cutting-edge tools (e.g. OpenAI, Claude, agent frameworks) and productionize what works
  • Own the full development lifecycle – from identifying opportunities to deploying and iterating in production
  • Partner closely with product, design, and end users to deliver meaningful outcomes
  • Continuously improve system quality across performance, reliability, and cost
What we offer
What we offer
  • Real product impact
  • High ownership
  • AI-first environment
  • Exceptional team
  • Strong backing – $100M+ raised from top-tier investors
  • Massive growth – proven traction with rapid expansion ahead
  • Fulltime
Read More
Arrow Right

Applied AI Engineer

Security represents the most critical priorities for our customers in a world aw...
Location
Location
United States , Multiple Locations
Salary
Salary:
100600.00 - 199000.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 2+ years related experience (e.g., statistics, predictive analytics, research) OR Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) related experience (e.g., statistics, predictive analytics, research) OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft background and Microsoft Cloud background check upon hire/transfer and every two years thereafter
  • 3+ years technical engineering experience with coding in languages including C#, Java AND Python
  • 2+ years of experience with LLMs and open-source GenAI frameworks, such as LangChain, LlamaIndex, Haystack, or equivalents (e.g., Transformers, AutoGen, DSPy), including agent-based orchestration, prompt engineering, retrieval-augmented generation (RAG), and fine-tuning and evaluation
  • 2+ years experience in shipping at least 2 large scale ML/AI-based services or applications on cloud platforms (Azure, AWS, GCP, etc.)
  • Proficiency in writing production-quality software code in one or more modern programming languages (Python, C#)
  • 2+ years experience developing software systems end-to-end, from design to implementation
Job Responsibility
Job Responsibility
  • Design, develop, and deploy end-to-end AI/ML systems, including data ingestion, model training, evaluation, and integration into production environments
  • Build and optimize applications leveraging LLMs and open-source GenAI frameworks such as LangChain, LlamaIndex, Haystack, Transformers, AutoGen, and DSPy
  • Implement advanced GenAI techniques including agent-based orchestration, prompt engineering, retrieval-augmented generation (RAG), and model fine-tuning
  • Write production-grade software in Python and C# or Java, ensuring maintainability, scalability, and performance
  • Collaborate with cross-functional teams to translate business requirements into technical solutions
  • Ship and maintain large-scale AI applications, with a focus on performance monitoring and continuous improvement
  • Conduct rigorous evaluation of AI models using appropriate metrics and benchmarks
  • Optimize models for latency, throughput, and accuracy in real-world scenarios
  • Work closely with data scientists, product managers, and other engineers to drive AI initiatives
  • Stay current with the latest advancements in GenAI, LLMs, and AI frameworks
  • Fulltime
Read More
Arrow Right

Applied AI Engineer

We are building something extraordinary at Payhawk, and we’re looking for a pass...
Location
Location
Bulgaria , Sofia
Salary
Salary:
Not provided
payhawk.com Logo
Payhawk
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Background in Computer Science, Mathematics, or a related discipline
  • Hands-on experience in the development, deployment, and fine-tuning of GenAI solutions in production environments
  • Desirable: experience with autonomous agents and agentic workflows, covering the entire lifecycle from design and development to deployment in real-world scenarios
  • Strong understanding of text representation techniques, feature engineering, model development, and prompt engineering
  • Familiarity with emerging trends in autonomous agents is a plus
  • Strong problem-solving abilities and a capacity to work independently
  • Ability to multitask, organize, and prioritize work
  • Exceptional integrity and work ethic
Job Responsibility
Job Responsibility
  • Design and implement autonomous AI agents that leverage LLMs for reasoning, planning, and decision-making
  • Measure and track performance of autonomous AI agents in terms of speed and accuracy across various complex tasks
  • Optimize and deploy LLM-based agents in production environments, ensuring efficiency, security, maintainability and reliability
  • Build robust evaluation pipelines to help us iterate over new versions of our end-to-end solution
  • Implement multi-agent systems where LLMs collaborate to solve tasks
  • Develop memory and context management strategies for long-term agent interactions
  • Collaborate with cross-functional teams to integrate machine learning solutions into scalable products
  • Stay abreast of the latest advancements in large language models (e.g., GPT, LLama, Gemma) and Transformer-based architectures
  • Explore the evolving field of autonomous agents and identify opportunities for innovative applications
  • Leverage cloud platforms, particularly GCP, to deploy and maintain ML solutions efficiently
What we offer
What we offer
  • Competitive compensation package based on experience
  • 30 days holiday paid leave
  • One week exchange policy to another Payhawk office (London, Berlin, Barcelona, Paris, Amsterdam and Vilnius)
  • Flexible working hours and opportunity to work from home
  • Regular team-wide events
  • Additional medical care
  • MultiSport card fully funded by us
  • Company office massages
  • Personal assistant service
  • Opportunity to use the Payhawk product (that is, essentially, built by you)
  • Fulltime
Read More
Arrow Right

Applied AI Engineer

Join our team as an Applied AI Engineer, where you will drive real-world impact ...
Location
Location
India , Noida
Salary
Salary:
Not provided
aqusag.com Logo
AquSag Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong proficiency in Python for data analysis, modeling, and automation
  • Extensive hands-on experience with machine learning frameworks and libraries
  • Expertise in working with JSON data structures and API integrations
  • Ability to translate business requirements into deployable AI solutions
  • Excellent written and verbal communication skills, demonstrating a "care a lot" mindset for clarity and collaboration
  • Experience with version control systems and collaborative development tools
  • Self-driven, detail-oriented, and adaptable to a fast-paced remote work environment
Job Responsibility
Job Responsibility
  • Design, develop, and implement machine learning models to solve diverse business challenges
  • Collaborate cross-functionally to identify opportunities for AI integration and innovation
  • Preprocess, analyze, and interpret complex datasets using Python and related data tools
  • Develop APIs and data pipelines for seamless model integration and operationalization
  • Optimize models for performance, scalability, and robustness in production environments
  • Document methodologies, results, and workflows with clear, concise written communication
  • Communicate technical concepts to both technical and non-technical stakeholders effectively
  • Fulltime
Read More
Arrow Right

Applied AI Engineer

A well-funded AI start-up is reimagining how software gets built – creating a pl...
Location
Location
United States
Salary
Salary:
Not provided
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience applying AI/ML to real-world, user-facing products
  • Strong foundation in machine learning, NLP, or gen AI – ideally with hands-on LLM experience
  • Proven software engineering ability – from model prototyping to scalable system design
  • Creative problem-solver comfortable moving from research to rapid implementation
  • Degree from a top-tier university (e.g. CMU, MIT, Stanford, Berkeley, etc.)
  • publications in leading NLP or ML conferences (e.g. EMNLP, ACL, NeurIPS, ICML, ICLR, AAAI etc.) are a strong plus
Job Responsibility
Job Responsibility
  • Build and refine LLM-powered systems for reasoning, command prediction, and workflow automation
  • Prototype and iterate intelligent agents that assist with software creation, debugging, and deployment
  • Develop prompt optimization and fine-tuning pipelines to improve performance and reliability
  • Design data-driven feedback loops that help the system continuously learn from usage
  • Work end-to-end: from research and experimentation to scalable deployment
What we offer
What we offer
  • Be the first AI hire, with significant ownership and scope to shape the roadmap, technical vision and AI direction within the business
  • Work on problems at the intersection of AI research, developer experience, and product impact
  • Collaborate with exceptional engineers and researchers from leading tech companies
  • Fully remote across the US or Canada, with optional in-person meetups and offsites
  • Competitive compensation with meaningful early-stage equity
  • Backed by world-class investors and angels behind companies like Airbnb, Stripe, Salesforce, and OpenAI
  • Fulltime
Read More
Arrow Right

Applied AI Engineer

At Multiverse, we believe technology should empower everyone to achieve their po...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
multiverse.io Logo
Multiverse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of Data Science, Machine Learning, or AI Engineering experience, with a proven track record of leading complex AI/ML projects from concept to production
  • Proficient in Python and its ecosystem (e.g. NumPy, Pandas, Scikit-Learn, PyTorch)
  • Deep experience with LLM orchestration (e.g. Langchain) and prompt engineering
  • Advanced knowledge of SQL and experience working with both structured and unstructured data
  • Strong proficiency in building APIs and microservices
  • Comfortable with GitHub, CI/CD, observability & evaluation practices, and Infrastructure as Code (e.g., Terraform)
  • Practical experience deploying and monitoring AI solutions within AWS, Azure or similar cloud environments
  • Analytical Rigour with exceptional attention to detail regarding data lineage and bias
  • User-Product first approach
  • Growth Mindset
Job Responsibility
Job Responsibility
  • Design & Deliver AI/ML solutions: Translate complex stakeholder queries and business hypotheses into actionable experiments and AI/ML model requirements
  • Architect LLM & Agentic Workflows: Design and integrate LLM-powered solutions (e.g., GPT, Claude, Gemini) for content generation and personalized learning
  • Own the End-to-End Lifecycle: Take full ownership of the journey from data lineage and preprocessing through to experimentation, deployment, evaluation and continuous iteration
  • Experimentation Rigour & Quality: Proactively monitor and refine models to optimise effectiveness while minimising sampling/analytical biases and operational challenges
  • Lead in MLOps & Infrastructure: Build and maintain scalable pipelines for model training and deployment using AWS and modern MLOps practices
  • Strategic Influence & Mentorship: Bridge the gap between technical concepts and business objectives by communicating actionable insights to stakeholders
What we offer
What we offer
  • 27 days holiday, plus 5 additional days off: 1 life event day, 2 volunteer days, 2 company-wide wellbeing days and 8 bank holidays per year
  • private medical Insurance with Bupa
  • a medical cashback scheme
  • life insurance
  • gym membership & wellness resources through Wellhub
  • access to Spill - all in one mental health support
  • Hybrid work offering - for most roles we collaborate in the office three days per week
  • Work-from-anywhere scheme - you'll have the opportunity to work from anywhere, up to 10 days per year
  • Space to connect: weekly catch-ups, seasonal celebrations, and a kitchen that’s always stocked
  • Fulltime
Read More
Arrow Right