CrawlJobs Logo

Platform Architect - Search & Retrieval Systems

alpha-sense.com Logo

AlphaSense

Location Icon

Location:
India , Bengaluru

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

AlphaSense is seeking an experienced engineering leader to own and scale our search platform that powers market intelligence across billions of documents. You'll tackle the challenge of building distributed systems that handle hundreds of queries per second with millisecond latency, while establishing engineering excellence that ensures reliability for our enterprise customers. This role is perfect for a seasoned engineer who loves large-scale data challenges and has a track record of building robust, high-performance systems. While search experience is valuable, we believe great engineers can master new domains – what matters most is your ability to build systems that scale and don't break.

Job Responsibility:

  • Scale Distributed Systems: Architect and optimize infrastructure handling billions of documents and hundreds of queries per second
  • Lead Platform Evolution: Drive the migration from legacy systems to modern architecture, ensuring zero downtime and improved performance
  • Build Engineering Excellence: Establish comprehensive monitoring, testing, and deployment practices that catch issues before customers do
  • Optimize Performance: Profile and tune systems from the infrastructure to the application level, balancing cost and performance
  • Drive Technical Strategy: Own the platform roadmap, making architectural decisions that will scale 10x
  • Mentor and Lead: Elevate the team's expertise in distributed systems and large-scale data challenges

Requirements:

  • 12+ years building and operating distributed systems in production
  • Experience with large-scale data platforms (billions of records) or high-throughput systems (100+ QPS)
  • Track record of improving system reliability and performance at scale
  • Deep expertise in distributed systems fundamentals: sharding, replication, consistency, partition tolerance
  • Strong performance optimization skills - you can profile, diagnose, and fix bottlenecks across the stack
  • Experience with data pipeline architecture, real-time processing, or database internals
  • Excellence in building observable systems with comprehensive monitoring and alerting
  • History of leading technical initiatives and mentoring engineering teams

Nice to have:

  • Experience with search platforms (Vespa, Elasticsearch, Solr) or similar large-scale data systems
  • Deep knowledge of Kubernetes, CRDs, and infrastructure as code
  • Background in information retrieval, ranking systems, or recommendation engines
  • Familiarity with hybrid search approaches (lexical and vector)
  • Experience with JVM-based systems and tuning
  • Knowledge of modern engineering practices from high-growth companies

Additional Information:

Job Posted:
January 04, 2026

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Platform Architect - Search & Retrieval Systems

Staff AI Context Engineer

MagicSchool is seeking a Staff AI Context Engineer to architect and enhance the ...
Location
Location
United States
Salary
Salary:
205000.00 - 240000.00 USD / Year
edtechjobs.io Logo
EdTech Jobs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep Knowledge Systems Experience: 5+ years building large-scale information systems with at least 2+ years in staff/senior roles. Extensive hands-on experience with RAG systems, knowledge graphs, or semantic search platforms in production environments.
  • Graph Database Expertise: Deep experience with graph databases (Neo4j, Neptune, or similar), including schema design, query optimization (Cypher, Gremlin), and building graph-based applications.
  • RAG & Retrieval Mastery: Demonstrated expertise building production RAG systems including embedding selection, chunking strategies, hybrid search, reranking, and retrieval evaluation. Familiarity with vector databases (pgvector, Pinecone, Weaviate, Qdrant).
  • Embedding & NLP Background: Strong understanding of embedding models (sentence transformers, domain-specific embeddings), fine-tuning approaches, and semantic similarity. Experience with document processing, entity extraction, and text chunking for optimal retrieval.
  • Technical Stack: Strong coding skills in Python and/or TypeScript/Node.js. Experience with our stack (TypeScript, Node.js, PostgreSQL, NextJS, Supabase) plus graph databases and vector stores. Familiarity with LLM APIs and context management patterns.
  • Information Architecture: Deep understanding of information retrieval theory, semantic search, knowledge representation, and strategies for organizing complex domain knowledge for both human and AI consumption.
  • Leadership & Impact: Track record of architecting complex knowledge systems, making high-leverage technical decisions about information architecture, and mentoring engineers on sophisticated retrieval and graph concepts.
Job Responsibility
Job Responsibility
  • Knowledge Graph & Semantic Architecture: Architect and implement graph-based knowledge systems (Neo4j, Neptune, etc) that represent educational content relationships, standards alignments, prerequisite chains, curriculum coherence, learning progressions, and pedagogical connections.
  • Graph Schema & Ontology Development: Design and evolve ontologies and schemas for educational content, defining entity types (standards, concepts, skills, assessments), relationship semantics, and property models.
  • GraphRAG Implementation: Build GraphRAG systems that combine knowledge graph traversal with vector similarity, enabling agents to retrieve contextually connected educational materials.
  • Retrieval Pipeline Architecture: Architect and implement sophisticated retrieval-augmented generation pipelines including hybrid search (dense + sparse), multi-stage retrieval, reranking strategies, and query understanding.
  • Embedding & Vectorization Strategy: Design and operationalize embedding pipelines for educational content, selecting and fine-tuning embedding models, implementing chunking strategies, and managing vector stores at scale.
  • Retrieval Evaluation & Optimization: Design evaluation pipelines that measure retrieval precision, recall, MRR, and NDCG across educational content types. Continuously optimize retrieval quality.
  • Document Ingestion & Processing: Build robust ingestion systems that process structured and unstructured educational content, extracting entities, relationships, and metadata for knowledge base population.
  • Semantic Parsing & Extraction: Implement NLP pipelines for educational content that extract key concepts, prerequisite relationships, learning objectives, and pedagogical metadata.
  • Memory & Context Management: Invent and operationalize memory compaction mechanisms, session state management, and cross-conversation memory patterns that allow agents to maintain coherence across extended teaching workflows.
  • Context Evaluation & Monitoring: Design evaluation frameworks that measure retrieval precision, token relevance, attention allocation, and reasoning coherence as context evolves across sessions.
What we offer
What we offer
  • Flexibility of working from home.
  • Unlimited time off.
  • Choice of employer-paid health insurance plans. Dental and vision are also offered at very low premiums.
  • Generous stock options, vested over 4 years.
  • 401k match.
  • Monthly wellness stipend.
  • Fulltime
Read More
Arrow Right

Senior Product Manager, Data & Retrieval

Harvey is building the AI platform for the world’s top legal and professional se...
Location
Location
United States , San Francisco
Salary
Salary:
178500.00 - 241500.00 USD / Year
harvey.ai Logo
Harvey
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience building or managing search, retrieval, recommendation, or data platforms at scale
  • Experience working with complex, heterogeneous, or domain-specific datasets with structured + unstructured data
  • Understanding of modern retrieval methods, including hybrid search (lexical + vector), dense retrieval, re-ranking, embeddings, chunking strategies, and index optimization
  • Hands-on experience with LLMs or RAG frameworks (evaluation, grounding, hybrid pipelines, query rewriting, LLM-as-a-judge, retrieval metrics)
  • Ability to partner with engineers on technical architecture, with enough depth to challenge assumptions, propose solutions, and influence design
  • A product mindset for search—balancing user needs, domain complexity, and system constraints to propose high-leverage improvements
Job Responsibility
Job Responsibility
  • Drive the roadmap and strategy for Harvey’s “Data Factory”, ensuring we scale our data 100x through new platforms that build the ‘legal index’ of the world
  • Work with internal operations and external data providers to methodically expand coverage, accelerate execution, and improve dataset quality
  • Own and evolve Harvey’s end-to-end data architecture—from ingestion and transformation to storage, indexing, and retrieval—ensuring performance, reliability, and scalability for LLM-powered products
  • Partner with Applied AI engineers to build and optimize retrieval systems, embeddings, search models, and evaluation frameworks
  • Architect and oversee large-scale ingestion pipelines that aggregate, normalize, and continuously update millions of heterogeneous legal documents across global jurisdictions
  • Collaborate cross-functionally with Product Engineering, Applied AI, Research, and Platform teams to deliver high-quality production systems that support reasoning, summarization, and legal research workflows
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits (401k match up to 4%)
  • flexible PTO
  • equity plan
  • bonus
  • Fulltime
Read More
Arrow Right

Staff Software Development Engineer

We’re building a world of health around every individual — shaping a more connec...
Location
Location
United States
Salary
Salary:
118450.00 - 260590.00 USD / Year
https://www.cvshealth.com/ Logo
CVS Health
Expiration Date
April 30, 2026
Flip Icon
Requirements
Requirements
  • 7+ years of professional software engineering experience, with significant depth in search or data-intensive systems
  • 3+ years of experience in: Information Retrieval concepts (inverted indexes, scoring models, relevance tuning) Query understanding, Relevancy and Ranking
  • Experience designing and operating distributed, event-driven systems using Kafka or equivalent messaging platforms
  • Demonstrated execution excellence, from design through production support
  • Growth mindset with a track record of mentoring engineers and raising technical standards
  • Experience with Search platforms: OpenSearch, Elasticsearch, and/or Solr
  • Bachelor’s degree or equivalent experience (Highschool diploma plus 4 years relevant work experience)
Job Responsibility
Job Responsibility
  • Driving relevance and ranking improvements using techniques such as: Query rewriting, synonym management, intent detection, and field boosting
  • Learning-to-rank, behavioral signals, and offline/online evaluation frameworks
  • Provide technical leadership in solution design and architecture reviews, partnering with principal engineers, architects, and product teams
  • Design and optimize event-driven systems using Kafka (or similar) for real-time indexing, user interaction signals, and search telemetry
  • Ensure systems meet performance, scalability, and reliability goals, including: Sub-100ms query latency targets High-throughput ingestion and near-real-time indexing Fault tolerance and graceful degradation
  • Lead and participate in deep code reviews, enforcing best practices in search engineering, distributed systems, and data correctness
  • Influence the adoption of new search technologies, tooling, and frameworks to improve developer productivity and search quality
  • Serve as a technical mentor, providing guidance to senior and junior engineers and raising the overall search engineering bar across the organization
  • Act as a multiplier, identifying systemic issues in search quality or platform reliability and driving long-term architectural improvements
What we offer
What we offer
  • Affordable medical plan options
  • 401(k) plan (including matching company contributions)
  • Employee stock purchase plan
  • No-cost programs for all colleagues including wellness screenings, tobacco cessation and weight management programs, confidential counseling and financial coaching
  • Paid time off
  • Flexible work schedules
  • Family leave
  • Dependent care resources
  • Colleague assistance programs
  • Tuition assistance
  • Fulltime
Read More
Arrow Right

Senior Software Engineer (Search / Retrieval)

Workato transforms technology complexity into business opportunity. As the leade...
Location
Location
United States , Palo Alto
Salary
Salary:
Not provided
workato.com Logo
Workato
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelors/Masters/PhD degree in Statistics, Mathematics or Computer Science, or another quantitative field
  • 7+ years of backend engineering experience with 3+ years in search, information retrieval, or related fields
  • Strong proficiency in Python
  • Hands-on experience with search engines (Opensearch or Elasticsearch)
  • Strong understanding of information retrieval concepts spanning traditional methods (TF-IDF, BM25) and modern neural search techniques (vector embeddings, transformer models)
  • Experience with text processing, NLP, and relevance tuning
  • Experience with relevance evaluation metrics (NDCG, MRR, MAP)
  • Experience with large-scale distributed systems
  • Strong analytical and problem-solving skills
  • Strong communication abilities to explain technical concepts
Job Responsibility
Job Responsibility
  • Lead the design, development, and optimization of intelligent search systems that leverage machine learning at their core
  • Build end-to-end retrieval pipelines that incorporate advanced techniques in query understanding, ranking, and entity recognition
  • Lead the development of advanced our search cluster that can scale to millions of documents across customers and data sources
  • Deploy learning-to-rank models that optimize relevance using behavioral signals, embeddings, and structured feedback
  • Build and scale robust Entity Recognition pipelines that enhance document understanding, enable contextual disambiguation, and support entity-aware retrieval
  • Architect next-gen search infrastructure capable of supporting highly dynamic document corpora and real-time indexing
  • Drive improvements in query construction, indexing and search performance
  • Be up-to-date with the latest improvements in search and indexing technologies
  • Collaborate with product and applied research teams to translate user needs into data-informed search innovations
  • Produce clean, scalable code and influence system architecture and roadmap across the relevance and platform stack
Read More
Arrow Right

Search Machine Learning Research Engineer

Perplexity is seeking an experienced Senior Machine Learning Engineer to help bu...
Location
Location
Germany , Berlin
Salary
Salary:
Not provided
perplexity.ai Logo
Perplexity
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep understanding of search and retrieval systems, including quality evaluation principles and metrics
  • Proven track record with large-scale search or recommender systems
  • Strong proficiency with PyTorch, including experience in distributed training techniques and performance optimization for large models
  • Expertise in representation learning, including contrastive learning and embedding space alignment for multilingual and multimodal applications
  • Strong publication record in AI/ML conferences or workshops (e.g., NeurIPS, ICML, ICLR, ACL, CVPR, SIGIR)
  • Self-driven, with a strong sense of ownership and execution
  • Minimum of 3 years (preferably 5+) working on search, recommender systems, or closely related research areas
Job Responsibility
Job Responsibility
  • Relentlessly push search quality forward — through models, data, tools, or any other leverage available
  • Architect and build core components of the search platform and model stack
  • Design, train, and optimize large-scale deep learning models using frameworks like PyTorch, leveraging distributed training (e.g., PyTorch Distributed, DeepSpeed, FSDP) and hardware acceleration, with a focus on retrieval and ranking models
  • Conduct advanced research in representation learning, including contrastive learning, multilingual, and multimodal modeling for search and retrieval
  • Deploy models — from boosting algorithms to LLMs — in a scalable and performant way
  • Build and optimize RAG pipelines for grounding and answer generation
  • Collaborate with Data, AI, Infrastructure, and Product teams to ensure fast and high-quality delivery
  • Fulltime
Read More
Arrow Right

Search Senior Machine Learning Engineer

Perplexity is seeking an experienced Senior Machine Learning Engineer to help bu...
Location
Location
Belgrade, London, Berlin
Salary
Salary:
Not provided
perplexity.ai Logo
Perplexity
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep understanding of search and retrieval systems, including quality evaluation principles and metrics
  • Proven track record with large-scale search or recommender systems
  • Self-driven, with a strong sense of ownership and execution
  • Minimum of 5 years of working on search or recsys-related projects
Job Responsibility
Job Responsibility
  • Relentlessly push search quality forward—through models, data, tools, or any other leverage available
  • Architect and build core components of our search platform and model stack
  • Train and evaluate retrieval, ranking and classification models, including LLMs
  • Deploy models - from boosting to LLMs - in a scalable and performant way
  • Build and optimize RAG pipelines for grounding and answer generation
  • Collaborate with Data, AI, Infrastructure and Product teams to ensure fast and high quality delivery
  • Fulltime
Read More
Arrow Right

Staff AI Engineer

As we work towards building out the Context Layer for the Agentic Enterprise, we...
Location
Location
United States , Palo Alto
Salary
Salary:
Not provided
workato.com Logo
Workato
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's/Master's/PhD degree in Statistics, Mathematics, Computer Science, or another quantitative field
  • 7+ years of backend engineering experience with 3+ years in search, information retrieval, or related fields
  • Strong proficiency in Python
  • Hands-on experience with search engines (Opensearch or Elasticsearch)
  • Strong understanding of information retrieval concepts spanning traditional methods (TF-IDF, BM25) and modern neural search techniques (vector embeddings, transformer models)
  • Experience with text processing, NLP, and relevance tuning
  • Experience with relevance evaluation metrics (NDCG, MRR, MAP)
  • Experience with large-scale distributed systems
  • Proficiency in Knowledge Graph construction and optimization is a plus
  • Strong analytical and problem-solving skills
Job Responsibility
Job Responsibility
  • Lead the development of advanced query understanding systems that parse natural language, resolve ambiguity, and infer user intent
  • Design and deploy learning-to-rank models that optimize relevance using behavioral signals, embeddings, and structured feedback
  • Build and scale robust Entity Recognition pipelines that enhance document understanding, enable contextual disambiguation, and support entity-aware retrieval
  • Architect next-gen search infrastructure capable of supporting highly dynamic document corpora and real-time indexing
  • Create and maintain graph-based knowledge systems that enhance LLM capabilities through structured relationship data
  • Drive improvements in query rewriting, intent classification, and semantic search, using both statistical and neural methods
  • Own the design of evaluation frameworks for offline/online relevance testing, A/B experimentation, and continual model tuning
  • Collaborate with product and applied research teams to translate user needs into data-informed search innovations
  • Produce clean, scalable code and influence system architecture and roadmap across the relevance and platform stack
What we offer
What we offer
  • vibrant and dynamic work environment
  • multitude of benefits they can enjoy inside and outside of their work lives
Read More
Arrow Right

Senior Staff Engineer, Applied AI

GEICO is seeking a Senior Staff Engineer, Applied AI to provide technical archit...
Location
Location
United States , Chevy Chase, MD; Palo Alto, CA
Salary
Salary:
130000.00 - 260000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8 or more years of professional software engineering or applied machine learning experience
  • 2 or more years working with Generative AI or LLM-based systems in production
  • Proven track record of architecting and delivering complex AI/ML capabilities that span multiple teams and have measurable business impact
  • Deep hands-on expertise with Python and modern AI frameworks including LangChain, LangGraph, LangSmith, LlamaIndex, Hugging Face, OpenAI/Anthropic APIs, and emerging agentic frameworks
  • Demonstrated experience building and deploying production RAG (Retrieval-Augmented Generation) systems including document ingestion, chunking strategies, vector search, and context retrieval
  • Demonstrated experience designing and operating production AI systems including multi-agent architectures, intelligent automation, and workflow orchestration
  • Strong understanding of agent architectures, workflow orchestration, retrieval-augmented generation (RAG), vector databases, knowledge graphs, and semantic reasoning
  • Familiarity with Agent-to-Agent (A2A) communication protocols and Model Context Protocol (MCP) for building interoperable AI systems
  • Experience ensuring platform scalability, cross-domain coherence, and alignment with AI platform capabilities and strategy
  • Strong expertise in distributed systems, microservices architecture, service design, performance optimization, and reliability engineering
Job Responsibility
Job Responsibility
  • Specify architectures and system decompositions for AI/ML capabilities that involve significant integrations and cross-team collaboration across multiple product areas
  • Provide technical architecture and leadership for medium to large, complex, cross-functional AI initiatives with visibility at the tech VP level
  • Architect and lead implementation of advanced Generative AI solutions including agent-based systems, intelligent automation, document intelligence, and decision support systems that span multiple business domains
  • Design and implement sophisticated agentic workflows that orchestrate multiple AI agents, tools, APIs, reasoning steps, and business logic to automate complex enterprise processes at scale
  • Question status quo with an eye for simpler designs and more secure approaches, influencing tech VPs to set direction for multiple teams
  • Build systems and platforms that meet the highest standards for scalability, resilience, performance, availability, security, and compliance
  • Identify and scope opportunities for automating business processes using AI across multiple product areas and business domains
  • Advance the state-of-the-art in applied AI by integrating knowledge graphs, vector reasoning, retrieval architectures, and multi-agent systems to solve complex business problems
  • Drive innovation by exploring new models, frameworks, reasoning techniques, and AI architectures and applying them strategically to high-impact business challenges
  • Run rigorous experimentation programs including hypothesis definition, A/B testing, measurement frameworks, and iterative improvement across production AI systems
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right