CrawlJobs Logo

Technical Team Lead – LLM Systems

balbix.com Logo

Balbix

Location Icon

Location:
India , Delhi NCR

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

We’re hiring a hands-on Technical Team Lead to join our core LLM engineering team. You’ll work directly with the company's leadership to design, build, and scale intelligent reasoning pipelines using LangGraph and AWS Bedrock. This is not a coordination role, you’ll lead by example, write production-grade code, solve complex problems, and help grow a high-performing technical team from day one.

Job Responsibility:

  • Architect and implement LangGraph-powered workflows and Bedrock-based inference
  • Collaborate closely with the founder, and with the head of AI on system design and product strategy
  • Build and manage stateful agent flows, tool orchestration, retries, and memory handling
  • Debug real-world issues across prompts, agent logic, and runtime behavior
  • Mentor and lead an initial team of 5 engineers, shaping engineering best practices
  • Own the performance, cost-efficiency, and observability of LLM pipelines

Requirements:

  • Strong CS fundamentals (B.Tech/M.Tech or equivalent)
  • 5+ years of backend or systems engineering experience
  • Experience with LLM orchestration tools like LangGraph, LangChain, or Bedrock agents
  • Deep Python skills with experience in async and event-driven programming
  • Proven track record shipping and maintaining production systems
  • Ability to work across layers — prompt logic, orchestration, infrastructure

Nice to have:

  • Familiarity with Langfuse or tracing/observability tooling
  • Experience with vector stores, prompt versioning, or RAG architectures
  • Background in cybersecurity, risk reasoning, or enterprise software
What we offer:
  • Competitive salary
  • Meaningful equity
  • Fast-moving builder culture

Additional Information:

Job Posted:
December 23, 2025

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Technical Team Lead – LLM Systems

AI/ML Technical Lead

We build breakthrough software products that power digital businesses. We are an...
Location
Location
India , Noida
Salary
Salary:
Not provided
3pillarglobal.com Logo
3Pillar Global
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in AI/ML development, including leading technical teams
  • Strong expertise in agentic AI concepts, multi-agent orchestration, and autonomous tool-using AI systems
  • Hands-on experience with LangChain (chains, agents, custom tools, memory, LLM orchestration)
  • Experience in Agentic AI
  • Experience in Computer Vision
  • Proficiency with modern LLMs (OpenAI, Anthropic, Llama, Mistral, etc.) and fine-tuning methods
  • Deep knowledge of Python and ML frameworks (PyTorch, TensorFlow, Hugging Face)
  • Experience building and deploying RAG systems using vector databases (Pinecone, Chroma, Weaviate, etc.)
  • Strong understanding of ML Ops, CI/CD, containerization (Docker), and cloud platforms (AWS, GCP, Azure)
Job Responsibility
Job Responsibility
  • Lead the design, development, and deployment of advanced AI/ML solutions, including LLM-powered applications
  • Architect and implement agentic AI workflows, multi-agent systems, and autonomous reasoning pipelines
  • Build scalable applications using LangChain, retrieval-augmented generation (RAG), vector databases, and tool integrations
  • Mentor and lead a team of ML engineers, data scientists, and AI developers
  • Collaborate with cross-functional teams (Product, Engineering, Data) to define AI strategies and roadmaps
  • Optimize model performance, latency, reliability, and cost efficiency
  • Evaluate new AI frameworks, libraries, and models for integration into the stack
  • Ensure best practices for code quality, ML Ops, versioning, testing, and monitoring
  • Fulltime
Read More
Arrow Right

Technical Lead

At Spectro Cloud, we are in search of a talented individual to become an integra...
Location
Location
United States , San Jose
Salary
Salary:
Not provided
spectrocloud.com Logo
Spectro Cloud
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science or related technical field
  • 8+ years of software development experience (or 6+ years with a Master's degree)
  • Strong LLM/GenAI fundamentals: Solid understanding of large language models, prompt engineering, embeddings, vector search, RAG systems, and lightweight fine-tuning (LoRA/PEFT preferred)
  • Python expertise: Proficiency in Python and hands-on experience with AI/ML libraries such as Hugging Face, PyTorch, LangChain, LangGraph, FastAPI, or similar frameworks
  • LLM deployment experience: Familiarity with Kubernetes-based inference stacks including vLLM, llm-d, TensorRT, PyTorch Serve, or comparable deployment frameworks
  • Proficiency in at least one modern programming language such as Go, Java, or equivalent
  • Solid understanding of containerization and orchestration concepts, including Kubernetes
  • Deep understanding of microservices architecture and REST API design principles
  • Experience designing and building scalable, cloud-native applications
  • Analytical problem-solving: Ability to debug model outputs, improve retrieval accuracy, optimize latency, and iterate quickly through experiments
Job Responsibility
Job Responsibility
  • Building production-grade AI systems - designing, implementing, and maintaining LLM-powered applications, agentic AI workflows, and RAG pipelines across multiple product use-cases
  • Actively participate in guided technical labs covering prompt engineering, vector databases, LLM deployment tooling, multi-agent orchestration, fine-tuning strategies, and evaluation techniques
  • Develop, refine, and operationalize LLM solutions, including prompt design, retrieval strategies, embedding pipelines, LangChain/LangGraph workflows, and API integrations using Python, Hugging Face, FastAPI, and similar frameworks
  • Ensuring the seamless operation of our platform through a combination of automation, scripting, and rigorous testing
  • Stay ahead of emerging AI trends - small models, efficient inference (vLLM/TensorRT), multimodal systems, on-device LLMs - and recommend tools, frameworks, or integrations that enhance our platform
  • Work closely with cross-functional teams to create scalable, dependable, and secure solutions that push boundaries
  • Stay current with industry trends and emerging technologies, thereby ensuring that our solutions remain innovative and ahead of the curve
Read More
Arrow Right

Technical Lead - Full-Stack AI Engineer - Private Equity SaaS

Our client, a cutting-edge SaaS platform for private equity and venture capital,...
Location
Location
United States , Chicago
Salary
Salary:
170000.00 - 200000.00 USD / Year
selbyjennings.com Logo
Selby Jennings
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years in full-stack development
  • Strong skills in Go, Python, Node.js, and modern frontend frameworks (React, Next.js.)
  • Hands-on with LLMs (OpenAI, Anthropic preferred) and vector search (Pinecone, pgvector.)
  • Experience deploying in cloud environments (AWS, Azure, GCP.)
  • Startup-ready: thrives in ambiguity and owns projects end-to-end while moving fast and not overthinking.
Job Responsibility
Job Responsibility
  • Build scalable full-stack applications integrating AI agents into a secure SaaS platform.
  • Develop and optimize LLM pipelines, prompts, and backend systems (Supabase, Postgres, vector DBs).
  • Architect APIs and microservices for data processing and retrieval.
  • Implement responsive, high-performance frontend interfaces for complex workflows.
  • Ensure compliance with SOC 2, GDPR, and SEC standards.
  • Collaborate with product and design teams to translate requirements into seamless user experiences.
What we offer
What we offer
  • equity
  • Fulltime
Read More
Arrow Right

Principal Software Engineer, AI Developer Tools

At Docker, we make app development easier so developers can focus on what matter...
Location
Location
United States , Seattle
Salary
Salary:
232000.00 - 319000.00 USD / Year
docker.com Logo
Docker
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years software engineering experience with 3+ years in Staff or Principal Engineer roles
  • Deep expertise in AI/ML technologies with hands-on production experience building LLM-powered applications, AI agents, or AI-assisted developer tools
  • Strong understanding of LLM APIs (OpenAI, Anthropic, etc.), prompt engineering, agent orchestration frameworks, and practical applications of AI in software development workflows
  • Proven track record of architecting and building highly scalable distributed systems and developer-facing platforms
  • Production experience with modern cloud-native infrastructure including Kubernetes, GitOps deployment patterns, observability systems, and CI/CD pipelines
  • Proficiency in Go (preferred), Rust, Java, or Python with strong software engineering fundamentals
  • Experience designing developer tools, platform engineering systems, or internal tools that enable other teams
  • Exceptional product and platform mindset considering business outcomes, developer experience, and technical trade-offs
  • Strong communication skills with ability to influence technical and non-technical stakeholders across the organization
  • Track record of technical mentorship and elevating engineering teams' capabilities
Job Responsibility
Job Responsibility
  • Define the long-term technical vision and architecture for AI-powered developer tools and the self-service platform that enables teams to build their own AI agents
  • Establish architectural patterns, technical standards, and best practices for LLM integration, AI agent development, and production AI systems serving developers
  • Lead technical strategy for platform capabilities including deployment frameworks (ArgoCD/GitOps), observability integration (Grafana), security controls, and operational tooling for AI developer tools
  • Design highly available, scalable infrastructure for hosting AI agents and developer tools with predictable performance and intelligent resource management
  • Drive technical decisions on AI technology choices, LLM provider strategies, prompt engineering approaches, and agent orchestration frameworks
  • Partner with Senior Manager and product leadership to align technical architecture with business objectives and productization opportunities
  • Architect and build production-ready AI agents for developer productivity including code review assistants, test generators, deployment diagnostics, and incident response automation
  • Design and implement the self-service platform infrastructure that reduces time-to-production for new AI tools from weeks to days
  • Build systems that accelerate adoption of AI-native development tools (Claude Code, Cursor, Warp) across Docker's engineering organization
  • Establish reliability, security, and performance standards for AI systems including SLOs, monitoring, incident response, and cost management
What we offer
What we offer
  • Freedom & flexibility
  • fit your work around your life
  • Designated quarterly Whaleness Days plus end of year Whaleness break
  • Home office setup
  • we want you comfortable while you work
  • 16 weeks of paid Parental leave
  • Technology stipend equivalent to $100 net/month
  • PTO plan that encourages you to take time to do the things you enjoy
  • Training stipend for conferences, courses and classes
  • Equity
  • Fulltime
Read More
Arrow Right

Cto / Technical Co-founder – Ai

An elite AI engineering scale-up is launching a new enterprise-focused division ...
Location
Location
United States , Raleigh–Durham
Salary
Salary:
Not provided
weareorbis.com Logo
Orbis Consultants
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years leading technical teams, ideally building or scaling new ventures
  • Proven track record of recruiting and developing top talent
  • Hands-on expertise with AI/LLM systems in production
  • Comfort engaging with C-suite executives and navigating enterprise environments
  • Entrepreneurial mindset with strong communication skills
Job Responsibility
Job Responsibility
  • Define the technical vision and delivery model for enterprise AI solutions
  • Recruit and lead a world-class engineering team
  • Partner with business leadership to secure large-scale contracts
  • Design and deliver production-grade AI systems that solve complex challenges
  • Stay ahead of the curve by turning emerging research into enterprise-ready applications
What we offer
What we offer
  • Competitive salary
  • 401k with matching
  • unlimited PTO
  • health coverage
  • significant equity
  • profit sharing model
  • Fulltime
Read More
Arrow Right

Machine Learning Team Lead

TradingView is the world’s #1 platform for all things investing. 100M+ users tru...
Location
Location
Cyprus; Georgia , Tbilisi
Salary
Salary:
Not provided
tradingview.com Logo
TradingView
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 2+ years of experience in managing technical teams with the ability to organize workflows and build effective processes
  • Deep understanding of the ML project lifecycle: from idea and prototype to production and maintenance
  • Strong knowledge of NLP/LLM technologies: text generation and classification, embeddings, RAG, and other modern techniques
  • Excellent communication skills and experience working with various teams (ML, backend, QA, product, analytics)
  • Ability to define and maintain roadmaps and make system-level engineering decisions
  • Experience in prioritization, risk assessment, and managing technical debt
  • Proficiency in Python and modern development tools (Git, CI/CD, Docker, Kubernetes)
  • Experience in operating ML systems in production (monitoring, metrics, A/B testing, incident handling)
Job Responsibility
Job Responsibility
  • Develop and enhance projects related to news processing (sentiment analysis, NER, classification, search, etc.)
  • Perform data analysis and preprocessing, prepare datasets, and build model pipelines
  • Design monitoring systems and evaluate the performance of ML systems
  • Lead a team of ML engineers working on NLP and LLM projects (news, content generation, recommendations, search, and chat systems)
  • Set tasks, prioritize work, manage deadlines, and ensure timely delivery
  • Collaborate with product and analytics teams to align goals and approaches
  • Support the technical growth of the team through mentoring, reviewing solutions, and assisting in system design
  • Improve development and deployment processes for ML solutions in production
  • Contribute to engineering efforts as a senior developer: design and implement key components, perform code reviews, and drive technical improvements
What we offer
What we offer
  • Flexible working hours and a hybrid work format
  • Well-equipped offices for focused and collaborative work
  • A global, distributed team of 500+ professionals
  • Learning, mentorship, and long-term career growth
  • Relocation support and private health insurance
  • Performance-based bonuses
  • TradingView Premium access
  • Regular team events and company-wide meetups
Read More
Arrow Right

Senior Principal Technical Program Manager - ML Platform

Location
Location
Salary
Salary:
231300.00 - 301975.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience on software teams as Development Manager, Technical Product Manager or TPM leading technical platforms areas
  • Deep domain experience in AI and/or Search. Example: Model Inference, Model Evaluation, Model Training, LLM Ops, Semantic Search, Search Relevance, etc.
  • Partner with Engineering in defining direction, strategy and execution at Platform level
  • Strategic thinking and ability to understand business objectives to translate them into technical problems and programs.
  • Technical understanding of systems involved. Willingness to develop domain expertise in the area they operate - storage, networking, authentication, capacity management, service deployments, etc.
  • TPMs are not expected to write or read code, but are expected to understand system flows, block architectures, APIs and such.
  • Experience defining and running end-to-end complex technical programs
  • Strong leadership, organizational, and communication skills
Job Responsibility
Job Responsibility
  • Understand and stay up-to-date on latest innovations in AI and Search. Partner closely with engineering teams to translate these into practical platform evolution for Atlassian bringing value to our customers.
  • Analyze business objectives, customer needs, product adoption inhibitors and opportunities, industry trends, and based on these, in close collaboration with your stakeholders, define a long-term strategy and roadmap for your platform and product components.
  • Understand business objectives and translate them into technical systems problems that need to be prioritized solved in the current business environment.
  • Define specific systems programs and create a plan of action for realizing those programs. Such programs could be around capacity planning, migration efforts, high availability, network architecture, performance optimization, reliability improvements and more.
  • Use your technical understanding of Atlassian and related systems to partner with and influence engineers and architects in making progress on these problems.
  • Responsible for taking a systematic approach to engineering problems. This includes: prioritizing tasks, scoping out the project, defining objectives, and making consistent progress against each of these.
  • Be accountable for the success of these technical programs by managing the entire lifecycle from initiation to forecasting, budgeting, scheduling, etc.
  • Manage complex dependencies and projects with a broad scope across the company
What we offer
What we offer
  • health and wellbeing resources
  • paid volunteer days
Read More
Arrow Right

Senior Machine Learning Engineer (Team Lead)

As our Artificial Intelligence (AI) and Machine Learning (ML) Team Leader, you w...
Location
Location
Australia , South Bank
Salary
Salary:
Not provided
fctgcareers.com Logo
Flight Centre Brand
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years delivering production grade ML or AI systems with proven commercial impact
  • 3+ years Leading and Mentoring engineers
  • Experience building AI agents, RAG systems or LLM powered applications in production
  • Demonstrated experience leading technical teams and managing complex AI programmes
  • Strong hands on experience across ML infrastructure, distributed systems and scalable AI architecture
  • Experience building and governing AI agent platforms including endpoints, gateways and tool orchestration
  • Familiarity with MCP servers and emerging agent communication standards and protocols
  • Experience defining evaluation frameworks, safety mechanisms and governance for LLM and agent based systems
  • Deep knowledge of Python, modern AI/ML frameworks and scalable AI platforms including Databricks
  • Strong expertise in Kubernetes and cloud native production environments
Job Responsibility
Job Responsibility
  • Lead the development and productionisation of ML models, LLM powered systems and agent based applications
  • Define and build end to end MLOps including CI CD, model registry, monitoring, drift detection and retraining for predictive ML systems
  • Establish LLMOps standards including context engineering, automated evaluation pipelines, red teaming, safeguards and policy guardrails
  • Architect and build AI agent workflows, endpoints, gateways and orchestration layers enabling secure tool access, structured reasoning and multi agent collaboration
  • Design and govern MCP servers and modern agent communication protocols to ensure interoperability, security and scalability
  • Implement strong observability across ML and GenAI systems including reliability, latency, evaluation metrics, usage tracking and cost control
  • Drive scalable ML infrastructure, feature stores and data platforms on Databricks
  • Oversee Kubernetes based deployments and cloud native AI infrastructure
  • Partner with senior stakeholders to prioritise and deliver multiple high impact AI initiatives
  • Coach and grow a high performing AI engineering team
What we offer
What we offer
  • Individualised, ongoing Learning & Development via communities of practice
  • Innovation Days
  • Dedicated Engineering Days
  • Access to 'LinkedIn Learning' for ongoing skills development
  • Women in PM&E group
  • Exclusive Staff Discounts
  • Travel Discounts
  • Career opportunities in a network of brands and businesses across the globe
  • Corporate Health Discounts
  • Mental Health Support and Employee Assistance Program for staff and family
  • Fulltime
Read More
Arrow Right