CrawlJobs Logo

QA LLM Engineer

talentica.com Logo

Talentica

Location Icon

Location:

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

A QA Automation Engineer with strong experience in LLMs and GenAI who can ensure the accuracy, stability, and performance of AI-driven applications.

Job Responsibility:

  • Design and execute QA strategies for LLM-based and search-driven products
  • Validate data pipelines involving indexing, chunking, embeddings, cosine similarity and keyword search
  • Evaluate retrieval-augmented generation (RAG) and recommendation system quality using precision, recall, and relevance metrics
  • Develop prompt test suites to measure accuracy, consistency, and bias
  • Monitor LLM observability metrics such as latency, token usage, hallucination rate and cost performance
  • Automate end-to-end test scenarios using Playwright and integrate with CI/CD pipelines
  • Collaborate with ML engineers and developers to improve model responses and user experience
  • Contribute to test frameworks and datasets for LLM regression and benchmark testing

Requirements:

  • BE/BTech in Computer Science, Data Engineering, or a related field from a top institute (like IIT, NIT, BITS, etc.)
  • 3.5 to 5.5 years of experience in QA engineering
  • At least 1+ years of experience in GenAI or LLM-based systems
  • Strong understanding of indexing, chunking, embeddings, similarity search, and retrieval workflows
  • Experience with prompt engineering, LLM evaluation, and output validation techniques
  • Proficiency with Playwright, API automation, and modern QA frameworks
  • Knowledge of observability tools for LLMs
  • Solid scripting experience in Python
  • Knowledge of different LLM providers (OpenAI, Gemini, Anthropic, Mistral, etc.)
  • Exposure to RAG pipelines, recommendation systems, or model performance benchmarking
  • Strong analytical and debugging skills, with a detail-oriented mindset
What we offer:
  • A culture of innovation
  • Endless learning opportunities
  • Talented peers
  • Work-life balance
  • Flexible schedules
  • Remote work options
  • A great culture
  • Recognition & rewards

Additional Information:

Job Posted:
January 02, 2026

Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for QA LLM Engineer

Middle QA Automation Engineer

We are seeking a motivated QA Automation Engineer to join our team and contribut...
Location
Location
Salary
Salary:
Not provided
maddevs.io Logo
Mad Devs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 2+ years of hands-on manual testing experience
  • 2+ years of proven experience as a QA engineer with strong skills in Python-based test automation
  • Proficient in testing web applications and AI-powered platforms on both real devices and emulators
  • Familiar with CI/CD workflows, monitoring tools, and version control systems such as Git
  • Comfortable working in fast-paced, distributed startup environments, demonstrating the ability to work independently without micromanagement
  • Clear and effective communicator, capable of collaborating across teams and time zones while maintaining thorough documentation
  • Language skills: English proficiency at B2-C1 level and Russian at B2 level
Job Responsibility
Job Responsibility
  • Test AI/LLM-based client-server applications, focusing on functionality, performance, and reliability
  • Develop and maintain automated test scripts in Python
  • perform manual testing when necessary
  • Utilize QA tools such as Playwright, Selenium, Postman, and TestRail to ensure thorough testing coverage
  • Guarantee quality across infrastructure components, CI/CD pipelines, and integrations
  • Prepare and update detailed test documentation, test scenarios, and defect reports
  • Collaborate closely with developers, DevOps engineers, and product managers to align priorities and deliver high-quality software releases
What we offer
What we offer
  • Flexible working hours
  • Remote-first culture
  • Long-term projects
  • Salary in dollars
  • Professional communities
  • Onsite business trips
  • Training budget
  • Paid conferences
Read More
Arrow Right

Senior AI Software Developer

The Senior AI Engineer owns end-to-end delivery of AI features—from design to pr...
Location
Location
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or master’s degree in computer science, engineering, data science, machine learning, artificial intelligence, or closely related quantitative discipline
  • Typically, 7-10 years’ experience
  • LLMs & Agents: Prompt engineering, function/tool calling, orchestration frameworks, RAG
  • ML/DS: Evaluation metrics (precision/recall, BLEU/ROUGE where relevant), error analysis
  • Data/RAG: Embeddings, similarity (cosine/IP), chunking, rerankers, vector DB operations
  • Backend: Python (FastAPI/Flask), microservices patterns
  • MLOps/Infra: Docker, Kubernetes, CI/CD, artifact management, GPU scheduling
  • Observability: Metrics/logging/tracing, dashboards, automated evaluation pipelines
  • Frameworks: PyTorch/TensorFlow, Hugging Face, LangChain/LlamaIndex
  • Data: Pandas, SQL/NoSQL, Parquet/Arrow, Kafka/queues
Job Responsibility
Job Responsibility
  • Translate high-level designs into clear component contracts, APIs, and service boundaries
  • Implement LLM integrations, RAG pipelines, agents, tool/function calling, and prompt strategies
  • Own feature delivery for sprints/releases
  • maintain high code quality and documentation
  • Fine-tune models when needed
  • design evaluation harnesses and metrics
  • Build A/B testing setups
  • track accuracy, latency, robustness, and task success rates
  • Conduct error analysis
  • iterate using feedback efficacy loops and prompt refinement
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
Read More
Arrow Right
New

Senior QA Engineer with AI experience

N-iX is looking for a Senior QA Engineer with AI experience to join our team. We...
Location
Location
Ukraine
Salary
Salary:
Not provided
n-ix.com Logo
N-iX
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of experience in manual QA, with exposure to frontend, backend and AI testing
  • Solid experience with API testing using tools such as Postman, REST Client, or similar
  • Experience testing web UIs and understanding of cross-browser/cross-environment considerations
  • Ability to read and interpret API documentation, data schemas, and system architecture descriptions
  • Hands-on experience working on projects that included AI, ML, or NLP components — particularly validating outputs that are probabilistic or context-dependent
  • Familiarity with the concept of RAG (Retrieval-Augmented Generation) or LLM-based systems
  • Strong analytical thinking — especially the ability to assess whether an AI response is contextually correct, not just technically non-null
  • Good understanding of test documentation practices: test plans, test cases, bug reports, traceability
  • English level at least Upper-Intermediate
Job Responsibility
Job Responsibility
  • Follow a phased QA approach — begin with CMS and backend testing to establish a reliable baseline of expected system behavior, then apply those insights to validate AI agent outputs effectively
  • Design and execute test cases for REST APIs, covering functional correctness, edge cases, error handling, authentication, and data integrity
  • Perform UI testing across core user journeys, validating layout, behavior, and integration with backend services
  • Transition into AI output validation once the deterministic layers are stable — using your knowledge of business rules to identify inconsistencies, hallucinations, or degraded outputs in agent responses
  • Document and maintain test cases, test plans, and bug reports in a structured and traceable way
  • Participate in requirement reviews and technical discussions to identify testability gaps early
  • Collaborate with the Lead Big Data/AI Engineer and AI team to understand RAG pipeline behavior, document ingestion flows, and output quality expectations
  • Contribute to building reusable test assets and QA processes as the project scales
What we offer
What we offer
  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits
Read More
Arrow Right

Prompt Engineer

Fullpath is a growing tech company in the automotive space with hubs across the ...
Location
Location
Israel , Tel Aviv
Salary
Salary:
Not provided
fullpath.com Logo
Fullpath
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Native level verbal and written English with excellent communication, presentation, and interpersonal skills
  • 2+ years professional experience in product or R&D department
  • Strong product sense and empathy towards users, customers and their needs
  • Creative problem solver with a sense for business needs, while having some technical aptitude
  • Growth mindset
  • Strong analytical and troubleshooting skills with attention to detail
Job Responsibility
Job Responsibility
  • Design, prototype and iterate AI agents and LLM workflows from ideation to execution
  • Monitor live systems (dashboards, A/B tests, validation pipelines), triage issues, and implement fixes for hallucinations and regressions
  • Utilize prompt and Gen-AI engineering techniques and tools such as RAG, MCPs, prompt chaining, few-shot prompting
  • QA and evaluate end user-facing AI output
  • Collaborate with stakeholders across product, marketing, CX, and R&D teams
  • Generate and QA background and conversational agents
What we offer
What we offer
  • Family-friendly environment and flexible working hours
  • An awesome global team of forward-thinking, innovative go-getters
  • Integrate with tech titans: work directly with APIs from Google, Facebook, Microsoft, and more
  • Be part of a rapidly scaling company poised for the future
  • Learning and growth opportunities within a fast-paced tech startup environment
  • Clear career advancement path for strong performers
  • We are committed to setting each other up for success. As a member of our team, you will work in an environment that encourages growth, initiative taking, and continuous mutual feedback in order to reach your full potential
  • Cibus and lots of yummy treats
  • Fulltime
Read More
Arrow Right

Prompt Engineer

Fullpath is a growing tech company in the automotive space with hubs across the ...
Location
Location
Israel , Tel Aviv
Salary
Salary:
Not provided
fullpath.com Logo
Fullpath
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Native level verbal and written English with excellent communication, presentation, and interpersonal skills
  • 2+ years professional experience in product or R&D department
  • Strong product sense and empathy towards users, customers and their needs
  • Creative problem solver with a sense for business needs, while having some technical aptitude
  • Growth mindset
  • Strong analytical and troubleshooting skills with attention to detail
Job Responsibility
Job Responsibility
  • Design, prototype and iterate AI agents and LLM workflows from ideation to execution
  • Monitor live systems (dashboards, A/B tests, validation pipelines), triage issues, and implement fixes for hallucinations and regressions
  • Utilize prompt and Gen-AI engineering techniques and tools such as RAG, MCPs, prompt chaining, few-shot prompting
  • QA and evaluate end user-facing AI output
  • Collaborate with stakeholders across product, marketing, CX, and R&D teams
  • Generate and QA background and conversational agents
What we offer
What we offer
  • Family-friendly environment and flexible working hours
  • An awesome global team of forward-thinking, innovative go-getters
  • Integrate with tech titans: work directly with APIs from Google, Facebook, Microsoft, and more
  • Be part of a rapidly scaling company poised for the future
  • Learning and growth opportunities within a fast-paced tech startup environment
  • Clear career advancement path for strong performers
  • We are committed to setting each other up for success. As a member of our team, you will work in an environment that encourages growth, initiative taking, and continuous mutual feedback in order to reach your full potential
  • Cibus and lots of yummy treats
  • Fulltime
Read More
Arrow Right

Lead Software Engineer, Front End

Location
Location
United States , San Francisco; New York
Salary
Salary:
Not provided
kiddom.co Logo
Kiddom
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience with modern FE Frameworks
  • Have used AI agents such as Cursor or Claude Code to build software
  • Have used markdown files within codebase to guide the behavior of the coding agent
  • Have built a system that uses large language models and/or RAG to solve a problem or answer a user query
  • Have lead a team of front end engineers previously
  • Have helped in task estimation, task assignment and ensuring the quality and timeliness of the teams output
  • Have lead teams developing user facing features
  • Have worked closely with Product managers, Designers and QA
  • Fulltime
Read More
Arrow Right

Prompt Engineer

Fullpath is a growing tech company in the automotive space with hubs across the ...
Location
Location
Israel , Tel Aviv; Jerusalem
Salary
Salary:
Not provided
fullpath.com Logo
Fullpath
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Native level verbal and written English with excellent communication, presentation, and interpersonal skills
  • 2+ years professional experience in product or R&D department
  • Strong product sense and empathy towards users, customers and their needs
  • Creative problem solver with a sense for business needs, while having some technical aptitude
  • Growth mindset
  • Strong analytical and troubleshooting skills with attention to detail
Job Responsibility
Job Responsibility
  • Design, prototype and iterate AI agents and LLM workflows from ideation to execution
  • Monitor live systems (dashboards, A/B tests, validation pipelines), triage issues, and implement fixes for hallucinations and regressions
  • Utilize prompt and Gen-AI engineering techniques and tools such as RAG, MCPs, prompt chaining, few-shot prompting
  • QA and evaluate end user-facing AI output
  • Collaborate with stakeholders across product, marketing, CX, and R&D teams
  • Generate and QA background and conversational agents
What we offer
What we offer
  • Family-friendly environment and flexible working hours
  • An awesome global team of forward-thinking, innovative go-getters
  • Integrate with tech titans: work directly with APIs from Google, Facebook, Microsoft, and more
  • Be part of a rapidly scaling company poised for the future
  • Learning and growth opportunities within a fast-paced tech startup environment
  • Clear career advancement path for strong performers
  • We are committed to setting each other up for success. As a member of our team, you will work in an environment that encourages growth, initiative taking, and continuous mutual feedback in order to reach your full potential
  • Cibus and lots of yummy treats
  • Fulltime
Read More
Arrow Right

Prompt Engineer

Fullpath is a growing tech company in the automotive space with hubs across the ...
Location
Location
Israel , Tel Aviv or Jerusalem
Salary
Salary:
Not provided
fullpath.com Logo
Fullpath
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Native level verbal and written English with excellent communication, presentation, and interpersonal skills
  • 2+ years professional experience in product or R&D department
  • Strong product sense and empathy towards users, customers and their needs
  • Creative problem solver with a sense for business needs, while having some technical aptitude
  • Growth mindset
  • Strong analytical and troubleshooting skills with attention to detail
Job Responsibility
Job Responsibility
  • Design, prototype and iterate AI agents and LLM workflows from ideation to execution
  • Monitor live systems (dashboards, A/B tests, validation pipelines), triage issues, and implement fixes for hallucinations and regressions
  • Utilize prompt and Gen-AI engineering techniques and tools such as RAG, MCPs, prompt chaining, few-shot prompting
  • QA and evaluate end user-facing AI output
  • Collaborate with stakeholders across product, marketing, CX, and R&D teams
  • Generate and QA background and conversational agents
What we offer
What we offer
  • Family-friendly environment and flexible working hours
  • An awesome global team of forward-thinking, innovative go-getters
  • Integrate with tech titans: work directly with APIs from Google, Facebook, Microsoft, and more
  • Be part of a rapidly scaling company poised for the future
  • Learning and growth opportunities within a fast-paced tech startup environment
  • Clear career advancement path for strong performers
  • We are committed to setting each other up for success. As a member of our team, you will work in an environment that encourages growth, initiative taking, and continuous mutual feedback in order to reach your full potential
  • Cibus and lots of yummy treats
Read More
Arrow Right