Platform Architect - Search & Retrieval Systems Job at AlphaSense (Bengaluru)

Principal Search Architect

Join the team that powers one of the most heavily used and most visible capabili...

Location

India , Hyderabad

Salary:

Not provided

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR equivalent experience
Extensive experience defining and evolving end-to-end Search architectures, including ranking, retrieval, graph-based systems, and platform/substrate layers, with a proven track record of setting architectural direction, guiding multiple teams, and shaping long-term platform strategy across organizational boundaries, with solid hands-on knowledge of ranking, retrieval, graph-based systems, and substrate/platform layers that power large-scale discovery and reasoning experiences
Deep expertise in indexing, retrieval, ranking, and query processing in production environments
Solid systems programming background with languages such as C, C++, or C#
Proven ability to define and communicate architectural strategies that guide multi-year engineering investment
Demonstrated cross-org leadership, with the ability to influence without authority and align diverse stakeholders
Solid problem decomposition skills and comfort operating in ambiguous technical spaces
Advanced understanding of AI-assisted Search concepts, including semantic retrieval, embeddings, evaluation, and responsible AI use
Exceptional communication and technical storytelling skills

Job Responsibility

Define the long-term architectural direction for Windows Search, including indexing pipelines, retrieval systems, ranking, and semantic enrichment
Lead complex, cross-team technical efforts spanning OS components, cloud-assisted pipelines, and on-device AI systems
Serve as the architectural authority for Search-related design reviews, tradeoff discussions, and platform decisions
Drive architectural clarity across boundaries: Search Platform, Indexer, AI models, telemetry, reliability, and user-facing surfaces
Ensure Search systems meet high reliability, performance, and quality bars, informed by telemetry, RQV signals, and customer impact
Anticipate future needs for agentic and AI-driven Search, identifying capability gaps and guiding multi-year investments
Partner with Product Managers to translate customer scenarios into durable technical primitives and measurable quality signals
Mentor senior engineers and architects, raising the bar on design rigor, system thinking, and operational excellence
Influence engineering standards, design patterns, and best practices across Search and adjacent platform teams

Fulltime

Staff AI Context Engineer

MagicSchool is seeking a Staff AI Context Engineer to architect and enhance the ...

Location

United States

Salary:

205000.00 - 240000.00 USD / Year

EdTech Jobs

Expiration Date

Until further notice

Requirements

Deep Knowledge Systems Experience: 5+ years building large-scale information systems with at least 2+ years in staff/senior roles. Extensive hands-on experience with RAG systems, knowledge graphs, or semantic search platforms in production environments.
Graph Database Expertise: Deep experience with graph databases (Neo4j, Neptune, or similar), including schema design, query optimization (Cypher, Gremlin), and building graph-based applications.
RAG & Retrieval Mastery: Demonstrated expertise building production RAG systems including embedding selection, chunking strategies, hybrid search, reranking, and retrieval evaluation. Familiarity with vector databases (pgvector, Pinecone, Weaviate, Qdrant).
Embedding & NLP Background: Strong understanding of embedding models (sentence transformers, domain-specific embeddings), fine-tuning approaches, and semantic similarity. Experience with document processing, entity extraction, and text chunking for optimal retrieval.
Technical Stack: Strong coding skills in Python and/or TypeScript/Node.js. Experience with our stack (TypeScript, Node.js, PostgreSQL, NextJS, Supabase) plus graph databases and vector stores. Familiarity with LLM APIs and context management patterns.
Information Architecture: Deep understanding of information retrieval theory, semantic search, knowledge representation, and strategies for organizing complex domain knowledge for both human and AI consumption.
Leadership & Impact: Track record of architecting complex knowledge systems, making high-leverage technical decisions about information architecture, and mentoring engineers on sophisticated retrieval and graph concepts.

Job Responsibility

Knowledge Graph & Semantic Architecture: Architect and implement graph-based knowledge systems (Neo4j, Neptune, etc) that represent educational content relationships, standards alignments, prerequisite chains, curriculum coherence, learning progressions, and pedagogical connections.
Graph Schema & Ontology Development: Design and evolve ontologies and schemas for educational content, defining entity types (standards, concepts, skills, assessments), relationship semantics, and property models.
GraphRAG Implementation: Build GraphRAG systems that combine knowledge graph traversal with vector similarity, enabling agents to retrieve contextually connected educational materials.
Retrieval Pipeline Architecture: Architect and implement sophisticated retrieval-augmented generation pipelines including hybrid search (dense + sparse), multi-stage retrieval, reranking strategies, and query understanding.
Embedding & Vectorization Strategy: Design and operationalize embedding pipelines for educational content, selecting and fine-tuning embedding models, implementing chunking strategies, and managing vector stores at scale.
Retrieval Evaluation & Optimization: Design evaluation pipelines that measure retrieval precision, recall, MRR, and NDCG across educational content types. Continuously optimize retrieval quality.
Document Ingestion & Processing: Build robust ingestion systems that process structured and unstructured educational content, extracting entities, relationships, and metadata for knowledge base population.
Semantic Parsing & Extraction: Implement NLP pipelines for educational content that extract key concepts, prerequisite relationships, learning objectives, and pedagogical metadata.
Memory & Context Management: Invent and operationalize memory compaction mechanisms, session state management, and cross-conversation memory patterns that allow agents to maintain coherence across extended teaching workflows.
Context Evaluation & Monitoring: Design evaluation frameworks that measure retrieval precision, token relevance, attention allocation, and reasoning coherence as context evolves across sessions.

What we offer

Flexibility of working from home.
Unlimited time off.
Choice of employer-paid health insurance plans. Dental and vision are also offered at very low premiums.
Generous stock options, vested over 4 years.
401k match.
Monthly wellness stipend.

Fulltime

Senior Product Manager, Data & Retrieval

Harvey is building the AI platform for the world’s top legal and professional se...

Location

United States , San Francisco

Salary:

178500.00 - 241500.00 USD / Year

Harvey

Expiration Date

Until further notice

Requirements

5+ years of experience building or managing search, retrieval, recommendation, or data platforms at scale
Experience working with complex, heterogeneous, or domain-specific datasets with structured + unstructured data
Understanding of modern retrieval methods, including hybrid search (lexical + vector), dense retrieval, re-ranking, embeddings, chunking strategies, and index optimization
Hands-on experience with LLMs or RAG frameworks (evaluation, grounding, hybrid pipelines, query rewriting, LLM-as-a-judge, retrieval metrics)
Ability to partner with engineers on technical architecture, with enough depth to challenge assumptions, propose solutions, and influence design
A product mindset for search—balancing user needs, domain complexity, and system constraints to propose high-leverage improvements

Job Responsibility

Drive the roadmap and strategy for Harvey’s “Data Factory”, ensuring we scale our data 100x through new platforms that build the ‘legal index’ of the world
Work with internal operations and external data providers to methodically expand coverage, accelerate execution, and improve dataset quality
Own and evolve Harvey’s end-to-end data architecture—from ingestion and transformation to storage, indexing, and retrieval—ensuring performance, reliability, and scalability for LLM-powered products
Partner with Applied AI engineers to build and optimize retrieval systems, embeddings, search models, and evaluation frameworks
Architect and oversee large-scale ingestion pipelines that aggregate, normalize, and continuously update millions of heterogeneous legal documents across global jurisdictions
Collaborate cross-functionally with Product Engineering, Applied AI, Research, and Platform teams to deliver high-quality production systems that support reasoning, summarization, and legal research workflows

What we offer

Comprehensive health, dental and vision coverage
retirement benefits (401k match up to 4%)
flexible PTO
equity plan
bonus

Fulltime

Senior Software Engineer (Search / Retrieval)

Workato transforms technology complexity into business opportunity. As the leade...

Location

United States , Palo Alto

Salary:

Not provided

Workato

Expiration Date

Until further notice

Requirements

Bachelors/Masters/PhD degree in Statistics, Mathematics or Computer Science, or another quantitative field
7+ years of backend engineering experience with 3+ years in search, information retrieval, or related fields
Strong proficiency in Python
Hands-on experience with search engines (Opensearch or Elasticsearch)
Strong understanding of information retrieval concepts spanning traditional methods (TF-IDF, BM25) and modern neural search techniques (vector embeddings, transformer models)
Experience with text processing, NLP, and relevance tuning
Experience with relevance evaluation metrics (NDCG, MRR, MAP)
Experience with large-scale distributed systems
Strong analytical and problem-solving skills
Strong communication abilities to explain technical concepts

Job Responsibility

Lead the design, development, and optimization of intelligent search systems that leverage machine learning at their core
Build end-to-end retrieval pipelines that incorporate advanced techniques in query understanding, ranking, and entity recognition
Lead the development of advanced our search cluster that can scale to millions of documents across customers and data sources
Deploy learning-to-rank models that optimize relevance using behavioral signals, embeddings, and structured feedback
Build and scale robust Entity Recognition pipelines that enhance document understanding, enable contextual disambiguation, and support entity-aware retrieval
Architect next-gen search infrastructure capable of supporting highly dynamic document corpora and real-time indexing
Drive improvements in query construction, indexing and search performance
Be up-to-date with the latest improvements in search and indexing technologies
Collaborate with product and applied research teams to translate user needs into data-informed search innovations
Produce clean, scalable code and influence system architecture and roadmap across the relevance and platform stack

Search Machine Learning Research Engineer

Perplexity is seeking an experienced Senior Machine Learning Engineer to help bu...

Location

Germany , Berlin

Salary:

Not provided

Perplexity

Expiration Date

Until further notice

Requirements

Deep understanding of search and retrieval systems, including quality evaluation principles and metrics
Proven track record with large-scale search or recommender systems
Strong proficiency with PyTorch, including experience in distributed training techniques and performance optimization for large models
Expertise in representation learning, including contrastive learning and embedding space alignment for multilingual and multimodal applications
Strong publication record in AI/ML conferences or workshops (e.g., NeurIPS, ICML, ICLR, ACL, CVPR, SIGIR)
Self-driven, with a strong sense of ownership and execution
Minimum of 3 years (preferably 5+) working on search, recommender systems, or closely related research areas

Job Responsibility

Relentlessly push search quality forward — through models, data, tools, or any other leverage available
Architect and build core components of the search platform and model stack
Design, train, and optimize large-scale deep learning models using frameworks like PyTorch, leveraging distributed training (e.g., PyTorch Distributed, DeepSpeed, FSDP) and hardware acceleration, with a focus on retrieval and ranking models
Conduct advanced research in representation learning, including contrastive learning, multilingual, and multimodal modeling for search and retrieval
Deploy models — from boosting algorithms to LLMs — in a scalable and performant way
Build and optimize RAG pipelines for grounding and answer generation
Collaborate with Data, AI, Infrastructure, and Product teams to ensure fast and high-quality delivery

Fulltime

Search Senior Machine Learning Engineer

Perplexity is seeking an experienced Senior Machine Learning Engineer to help bu...

Location

Belgrade, London, Berlin

Salary:

Not provided

Perplexity

Expiration Date

Until further notice

Requirements

Deep understanding of search and retrieval systems, including quality evaluation principles and metrics
Proven track record with large-scale search or recommender systems
Self-driven, with a strong sense of ownership and execution
Minimum of 5 years of working on search or recsys-related projects

Job Responsibility

Relentlessly push search quality forward—through models, data, tools, or any other leverage available
Architect and build core components of our search platform and model stack
Train and evaluate retrieval, ranking and classification models, including LLMs
Deploy models - from boosting to LLMs - in a scalable and performant way
Build and optimize RAG pipelines for grounding and answer generation
Collaborate with Data, AI, Infrastructure and Product teams to ensure fast and high quality delivery

Fulltime

Staff AI Engineer

As we work towards building out the Context Layer for the Agentic Enterprise, we...

Location

United States , Palo Alto

Salary:

Not provided

Workato

Expiration Date

Until further notice

Requirements

Bachelor's/Master's/PhD degree in Statistics, Mathematics, Computer Science, or another quantitative field
7+ years of backend engineering experience with 3+ years in search, information retrieval, or related fields
Strong proficiency in Python
Hands-on experience with search engines (Opensearch or Elasticsearch)
Strong understanding of information retrieval concepts spanning traditional methods (TF-IDF, BM25) and modern neural search techniques (vector embeddings, transformer models)
Experience with text processing, NLP, and relevance tuning
Experience with relevance evaluation metrics (NDCG, MRR, MAP)
Experience with large-scale distributed systems
Proficiency in Knowledge Graph construction and optimization is a plus
Strong analytical and problem-solving skills

Job Responsibility

Lead the development of advanced query understanding systems that parse natural language, resolve ambiguity, and infer user intent
Design and deploy learning-to-rank models that optimize relevance using behavioral signals, embeddings, and structured feedback
Build and scale robust Entity Recognition pipelines that enhance document understanding, enable contextual disambiguation, and support entity-aware retrieval
Architect next-gen search infrastructure capable of supporting highly dynamic document corpora and real-time indexing
Create and maintain graph-based knowledge systems that enhance LLM capabilities through structured relationship data
Drive improvements in query rewriting, intent classification, and semantic search, using both statistical and neural methods
Own the design of evaluation frameworks for offline/online relevance testing, A/B experimentation, and continual model tuning
Collaborate with product and applied research teams to translate user needs into data-informed search innovations
Produce clean, scalable code and influence system architecture and roadmap across the relevance and platform stack

What we offer

vibrant and dynamic work environment
multitude of benefits they can enjoy inside and outside of their work lives

Senior Staff Engineer, Applied AI

GEICO is seeking a Senior Staff Engineer, Applied AI to provide technical archit...

Location

United States , Chevy Chase, MD; Palo Alto, CA

Salary:

130000.00 - 260000.00 USD / Year

Geico

Expiration Date

Until further notice

Requirements

8 or more years of professional software engineering or applied machine learning experience
2 or more years working with Generative AI or LLM-based systems in production
Proven track record of architecting and delivering complex AI/ML capabilities that span multiple teams and have measurable business impact
Deep hands-on expertise with Python and modern AI frameworks including LangChain, LangGraph, LangSmith, LlamaIndex, Hugging Face, OpenAI/Anthropic APIs, and emerging agentic frameworks
Demonstrated experience building and deploying production RAG (Retrieval-Augmented Generation) systems including document ingestion, chunking strategies, vector search, and context retrieval
Demonstrated experience designing and operating production AI systems including multi-agent architectures, intelligent automation, and workflow orchestration
Strong understanding of agent architectures, workflow orchestration, retrieval-augmented generation (RAG), vector databases, knowledge graphs, and semantic reasoning
Familiarity with Agent-to-Agent (A2A) communication protocols and Model Context Protocol (MCP) for building interoperable AI systems
Experience ensuring platform scalability, cross-domain coherence, and alignment with AI platform capabilities and strategy
Strong expertise in distributed systems, microservices architecture, service design, performance optimization, and reliability engineering

Job Responsibility

Specify architectures and system decompositions for AI/ML capabilities that involve significant integrations and cross-team collaboration across multiple product areas
Provide technical architecture and leadership for medium to large, complex, cross-functional AI initiatives with visibility at the tech VP level
Architect and lead implementation of advanced Generative AI solutions including agent-based systems, intelligent automation, document intelligence, and decision support systems that span multiple business domains
Design and implement sophisticated agentic workflows that orchestrate multiple AI agents, tools, APIs, reasoning steps, and business logic to automate complex enterprise processes at scale
Question status quo with an eye for simpler designs and more secure approaches, influencing tech VPs to set direction for multiple teams
Build systems and platforms that meet the highest standards for scalability, resilience, performance, availability, security, and compliance
Identify and scope opportunities for automating business processes using AI across multiple product areas and business domains
Advance the state-of-the-art in applied AI by integrating knowledge graphs, vector reasoning, retrieval architectures, and multi-agent systems to solve complex business problems
Drive innovation by exploring new models, frameworks, reasoning techniques, and AI architectures and applying them strategically to high-impact business challenges
Run rigorous experimentation programs including hypothesis definition, A/B testing, measurement frameworks, and iterative improvement across production AI systems

What we offer

Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
Financial benefits including market-competitive compensation
a 401K savings plan vested from day one that offers a 6% match
performance and recognition-based incentives
and tuition assistance
Access to additional benefits like mental healthcare as well as fertility and adoption assistance
Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year

Fulltime

Platform Architect - Search & Retrieval Systems

AlphaSense

Location:
India , Bengaluru

Category:
IT - Software Development

Contract Type:
Not provided

Salary:

Job Description:

Job Responsibility:

Requirements:

Nice to have:

Additional Information:

Job Posted:
January 04, 2026

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Platform Architect - Search & Retrieval Systems

Principal Search Architect

Staff AI Context Engineer

Senior Product Manager, Data & Retrieval

Senior Software Engineer (Search / Retrieval)

Search Machine Learning Research Engineer

Search Senior Machine Learning Engineer

Staff AI Engineer

Senior Staff Engineer, Applied AI

Platform Architect - Search & Retrieval Systems

AlphaSense

Location:India , Bengaluru

Category:IT - Software Development

Contract Type:Not provided

Salary:

Job Description:

Job Responsibility:

Requirements:

Nice to have:

Additional Information:

Job Posted:January 04, 2026

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Platform Architect - Search & Retrieval Systems

Principal Search Architect

Staff AI Context Engineer

Senior Product Manager, Data & Retrieval

Senior Software Engineer (Search / Retrieval)

Search Machine Learning Research Engineer

Search Senior Machine Learning Engineer

Staff AI Engineer

Senior Staff Engineer, Applied AI

Location:
India , Bengaluru

Category:
IT - Software Development

Contract Type:
Not provided

Job Posted:
January 04, 2026