Job Description:
Microsoft’s Azure Data engineering team is leading the transformation of analytics in the world of data with products like databases, data integration, big data analytics, messaging & real-time analytics, and business intelligence. The products our portfolio include Microsoft Fabric, Azure SQL DB, Azure Cosmos DB, Azure PostgreSQL, Azure Data Factory, Azure Synapse Analytics, Azure Service Bus, Azure Event Grid, and Power BI. Our mission is to build the data platform for the age of AI, powering a new class of data-first applications and driving a data culture. Within Azure Data, the databases team builds and maintains Microsoft's operational Database systems. We store and manage data in a structured way to enable multitude of applications across various industries. We are on a journey to enable developer friendly, mission-critical, AI enabled operational Databases across relational, non-relational and OSS offerings. We research and develop algorithms, tools and libraries to enable AI-based information retrieval for unstructured data in Azure Data products. For example, we build market-leading vector search in multiple data products, for which innovate both on the research and engineering (e.g., see the DiskANN project). We are looking for expertise in one or more of information retrieval, NLP, representation learning, agentic memories, unstructured data processing, interaction between LLM and retrieval systems. We are looking for a senior applied scientist to join and shape the research and product roadmap for the team in this space.