CrawlJobs Logo

Senior Researcher - Efficient AI

https://www.microsoft.com/ Logo

Microsoft Corporation

Location Icon

Location:
India , Bangalore

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Generative AI is transforming how people create, collaborate, and communicate—redefining productivity across Microsoft 365 for customers worldwide. At Microsoft, we operate one of the largest collaboration and productivity platforms in the world, serving hundreds of millions of consumer and enterprise users. Delivering these AI experiences at scale requires solving some of the hardest efficiency challenges in modern AI systems. We are an applied research team focused on advancing efficiency across the AI stack, spanning models, ML frameworks, cloud infrastructure, and hardware. We drive mid- and long-term product innovation through close collaboration with research and product teams across the company. We communicate our research both internally and externally through internal technical reports, academic conference publications, open-source releases, and patents. Beyond producing research, we take responsibility for driving ideas through prototyping, validation, and production, with a strong bias toward real-world impact. The ideal Senior Researcher candidate will work across the full stack—from large-scale serving systems to hardware- and kernel-level optimizations—exploring algorithmic, systems, and hardware/software co-design techniques. Areas of focus include batching, routing, scheduling, caching, endpoint configuration, and GPU architecture–aware optimizations. This role emphasizes end-to-end ownership, with responsibility for identifying high-impact problems and driving research ideas through prototyping, validation, and deployment to deliver measurable customer impact.

Job Responsibility:

  • Formulate, develop, and evaluate new algorithmic and system-level approaches for end-to-end AI serving, using analytical modeling and large-scale measurement to study token-level latency, tail latency (p95/p99), throughput-per-dollar, cold-start behavior, warm pool strategies, and capacity planning under multi-tenant SLOs and variable sequence lengths
  • Design and experimentally evaluate endpoint configuration and execution policies, including batching, routing, and scheduling strategies, tensor and pipeline parallelism, quantization and precision profiles, speculative decoding, and chunked or streaming generation, and drive the most promising approaches through robust rollout and validation into production
  • Perform hardware- and kernel-aware optimization by collaborating closely with model, kernel, compiler, and hardware teams to align serving algorithms with attention/KV innovations and accelerator capabilities
  • Build and benchmark experimental prototypes and large-scale measurements to validate research ideas and drive them toward production readiness
  • produce clear technical documentation, design reviews, and operational playbooks
  • Publish research results, file patents, and, where appropriate, contribute to open-source systems and serving frameworks

Requirements:

  • Doctorate in relevant field
  • OR Master's Degree in relevant field AND 3+ years related research experience
  • OR Bachelor's Degree in relevant field AND 4+ years related research experience
  • OR equivalent experience
  • Demonstrated expertise in areas of algorithmic optimization, parallel computing, queuing and scheduling theory, and practical request orchestration under strict SLO constraints
  • Strong understanding of GPU architecture and memory hierarchies
  • Proficiency in C++ and Python for high-performance systems, with strong code quality and profiling/debugging skills
  • Proven record of research impact through publications and/or patents, and experience carrying ideas through to systems that operate at scale in real production environments
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter

Nice to have:

  • Deep understanding of transformer inference efficiency techniques such as sharding strategies, attention optimizations, paged KV caches, speculative decoding, LoRA, sequence packing or continuous batching, and quantization
  • 3+ years of experience with machine learning frameworks (e.g., PyTorch, TensorFlow) and inference serving frameworks (e.g., vLLM, Triton Inference Server, TensorRT-LLM, ONNX Runtime, Ray Serve, DeepSpeed-MII)
  • 3+ years of experience in GPU programming and optimization, with expert knowledge of CUDA, ROCm, Triton, PTX, CUTLASS, or similar GPU programming frameworks
  • Background in cost and performance modeling, autoscaling, and multi-region deployment or disaster recovery

Additional Information:

Job Posted:
March 01, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior Researcher - Efficient AI

AI Efficiency Intern

The AI Efficiency Intern will work on HPE's Corporate Affairs Living Progress te...
Location
Location
United States , Spring
Salary
Salary:
35.00 - 40.25 USD / Hour
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • College junior, senior, or masters' student pursuing computer science, computer engineering, data engineering, software engineering, or technology efficiency
  • Strong desire to help drive technology and AI efficiency globally
  • Microsoft Office skills
Job Responsibility
Job Responsibility
  • Support engagements with customers, governments, stakeholders and investors to showcase HPE's leadership in AI/IT efficiency and sustainability
  • Align customer and stakeholder insights to cross-functional HPE organizations
  • Perform research on various aspects of AI efficiency with focus on data and software efficiency attributes
  • Collaborate with the Living Progress team to recommend and implement changes to HPE's technical point of view on AI efficiency
  • Project management including developing project timeline and monitoring execution
What we offer
What we offer
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment
  • Fulltime
Read More
Arrow Right

Senior Design & Research Operations Specialist

We are seeking a Senior Design & Research Operations Specialist to help us scale...
Location
Location
Canada; United States , Toronto; Calgary; Vancouver
Salary
Salary:
126200.00 - 170800.00 CAD / Year
clio.com Logo
Clio
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in design, research, or product operations
  • Strong organizational, communication, and program-management skills
  • Comfort navigating ambiguity and bringing structure to complex systems
  • Proficiency with design and research tools (Figma, Miro, Maze, Pendo)
  • Experience maintaining process documentation and operational metrics
  • A pragmatic, collaborative approach to problem-solving and continuous improvement
  • Experience working in fast-paced, iterative product development environments
  • Deep understanding of good user experience, IA, and visual design
  • Demonstrate a keen interest in improving your craft by using AI
Job Responsibility
Job Responsibility
  • Lead the operational foundation for Design & Research, managing tools, systems, and budgets
  • Shape and maintain the rhythms and rituals that connect our teams within Design and within the organization
  • Coordinate research operations by managing participant recruitment, scheduling, incentives, and insights management
  • Develop and maintain shared documentation and guidelines that strengthen collaboration between Design, Research, Product, and Engineering
  • Identify and implement process improvements that increase clarity, reduce friction, and enable faster, more informed decisions
  • Design and evolve onboarding programs that help new designers and researchers ramp quickly
  • Track and report on key operational metrics including team health, delivery velocity, and efficiency
  • Partner with Design & Research leadership to align on operational vision, priorities, and long-term goals
  • Support the evaluation and rollout of new tools and vendors
  • Contribute to building a collaborative, high-performing culture
What we offer
What we offer
  • Top-tier health benefits, dental, and vision insurance
  • Hybrid work environment
  • Flexible time off policy, with an encouraged 20 days off per year
  • $2000 annual counseling benefit
  • RRSP matching and RESP contribution
  • Clioversary recognition program with special acknowledgement at 3, 5, 7, and 10 years
  • Fulltime
Read More
Arrow Right

Senior AI Engineer

We are seeking a Senior AI Engineer (L4, Individual Contributor) to design, buil...
Location
Location
India , Chennai
Salary
Salary:
Not provided
arcadia.com Logo
Arcadia
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 12+ years of professional software engineering experience
  • 3+ years in AI/ML development
  • Strong expertise in Python, PyTorch/TensorFlow, scikit-learn, and ML tooling (MLflow, LangChain)
  • Proficiency with SQL, cloud services (AWS), containers (Docker, Kubernetes), and distributed systems
  • Understanding of modern AI research (LLMs, diffusion models, transformers)
  • Experience deploying ML models in production with CI/CD
  • Strong analytical skills, ability to balance speed and rigor in experimentation
  • A passion for sustainability and the clean-energy mission
  • Experienced with building agentic pipelines with the latest models from Anthropic, Google, OpenAI, and more
Job Responsibility
Job Responsibility
  • Integrate with LLMs and be an expert in prompt engineering to derive the right results from the models with limited hallucination
  • Design and train ML/AI models (forecasting, NLP, graph learning, generative AI) to improve data quality, cost effectiveness, and system scalability
  • Deploy and optimize models for large-scale production workloads using Python-based services in AWS/Kubernetes environments
  • Build robust, automated data pipelines and ML Ops workflows for continuous training and deployment
  • Research and experiment with modern AI methods (transformers, foundation models, reinforcement learning) and adapt them to energy-sector challenges not limited to utility statements
  • Drive performance improvements in model accuracy, latency, and cost efficiency
  • Collaborate with Product, SRE, and Analytics teams to deliver AI-enabled features across Arcadia’s platform
  • Write clean, maintainable code, contribute to architecture reviews, and mentor junior engineers
  • Build true agentic workflows with multi-step processing incorporating RAG pipelines and MCPs
What we offer
What we offer
  • Competitive compensation and employee stock options
  • Hybrid/remote-first working model (India-based role, with global collaboration)
  • Flexible leave policy
  • Comprehensive medical insurance (self + family members)
  • Annual performance cycle + quarterly recognition awards
  • A supportive, diverse engineering culture grounded in empathy, teamwork, and innovation
  • Fulltime
Read More
Arrow Right

Senior Generative AI Engineer

The Citi Innovation Lab is a leader in creating new ideas, innovative technology...
Location
Location
Israel , Tel Aviv
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Hands-on experience with transformer-based models and their applications
  • Strong understanding of LLM, LLM model selection, benchmarking, and optimization
  • Experience with RAG systems and vector databases
  • Proficiency in developing and deploying AI agents
  • Knowledge of open-source models and methods, including benchmarks for evaluating AI performance
  • Knowledge of security risks and mitigation strategies for autonomous AI agents, including OWASP guidelines
  • Proficiency in Python and experience with libraries such as Pandas, Tabula, and TensorFlow/PyTorch
  • Strong problem-solving skills and attention to detail
  • Excellent communication and documentation skills
Job Responsibility
Job Responsibility
  • Develop and implement enterprise scale cutting edge models such as visual document understanding and text2code
  • Implement and Optimize vector-based retrieval systems for RAG by covering embedding models, ANN indexing, hybrid search, and re-ranking
  • Implement autonomous AI agents to implement adaptive, error resistant data extraction, and content validation tasks
  • Develop and deploy enterprise software applications using state of the art practices, such as micro services, modular code, as well as proficiency in writing unit and integration tests to ensure the accuracy and reliability of the AI applications
  • Ensure data privacy and security in all AI-driven processes, adhering to OWASP guidelines and Citi’s stringent authentication and authorization policies
  • Collaborate with cross-functional teams to integrate AI solutions into existing workflows
  • Document the development process and create comprehensive technical specifications
  • Manage and maintain AI applications, ensuring best practices in model management and versioning
  • Deploy resulting AI applications using industrial strength framework and processes, including Kubernetes and OpenShift for scalable and efficient operations on-premises
  • Ability to research and develop and utilize transformer-based models for enhanced application performance
  • Fulltime
Read More
Arrow Right

Senior Data Scientist - AI Modeling

As a Senior Data Scientist - AI Modeling at Baxter, you will work on creating an...
Location
Location
United States
Salary
Salary:
104000.00 - 143000.00 USD / Year
https://www.baxter.com/ Logo
Baxter
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree in STEM (science, technology, engineering, math) related field or a similar quantitative analytics field
  • 4+ years of professional experience with a variety of data products / data science model / algorithm development and implementing in production
  • Software development experience
  • Experience with healthcare data and working in a HIPAA regulated environment preferred
  • Experience with varying database structures and large datasets preferred
  • Experience with modern data science tools, such as Spark, Scala, Python, Databricks
  • Experience in Microsoft Azure cloud environment is preferred
  • Proficiency with developing data visualization technology and capabilities (i.e., Power BI, Tableau)
  • Brings a drive for creatively applying pragmatic and scalable approaches to Machine Learning to tackle difficult problems affecting patients and providers
  • Passionate about working on a high-performance team toward a multi-year vision with incremental deliverables
Job Responsibility
Job Responsibility
  • Responsible for the development and implementation of predictive modeling algorithms and techniques to address unmet needs, customer/business problems and optimize user experiences
  • Conduct in-depth research to stay at the forefront of AI advancements, exploring opportunities to integrate predictive and generative AI models into our products and services
  • Predictive and generative AI Modeling
  • Formulate problem statements and hypotheses for diverse business challenges (clinical, operational and business process optimization problems)
  • Create Spark & Python code in Databricks to retrieve data from across disparate data sources and create new innovative actionable insights
  • Prepare data for effective model training
  • Develop, train, and evaluate predictive AI models using various tailored to specific problems
  • Continuously refine and optimize models for performance, scalability, and efficiency
  • Deploy models into production environments and supervise their performance
  • Identify opportunities where generative AI models can add value
What we offer
What we offer
  • Comprehensive medical and dental coverage starting on day one
  • Insurance coverage for basic life, accident, short-term and long-term disability, and business travel accident
  • Employee Stock Purchase Plan (ESPP) with discount
  • 401(k) Retirement Savings Plan with employee contributions and company matching
  • Flexible Spending Accounts
  • Educational assistance programs
  • Paid holidays
  • Paid time off ranging from 20 to 35 days based on length of service
  • Family and medical leaves of absence
  • Paid parental leave
  • Fulltime
Read More
Arrow Right

Senior Computer Vision and Machine Learning Research Scientist

We are seeking a skilled and innovative Senior Research Scientist to join one of...
Location
Location
United States , Seattle
Salary
Salary:
159750.00 - 234300.00 USD / Year
axon.com Logo
Axon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD and 3+ years experience in Computer Science or a related field with a focus on computer vision, machine learning, artificial intelligence or related technical fields
  • Proven track record of research in machine learning, computer vision or related fields
  • Experience driving the ML development lifecycle and leveraging state-of-the-art research to deliver high quality models at scale
  • Strong computer vision fundamentals such as image processing, feature extractions, object detection, semantic segmentation, video analysis or action recognition
  • Excellent problem-solving skills, analytical thinking, and the ability to work independently as well as collaboratively in a team environment
  • Proficiency in Python and frameworks such as PyTorch, TensorFlow or Keras
  • Strong communication skills and the ability to effectively present complex technical concepts to both technical and non-technical audiences
Job Responsibility
Job Responsibility
  • Collaborate with other scientists, engineers and product managers to build proof-of-concepts to shape the Axon of tomorrow
  • Lead end-to-end research efforts in advanced computer vision, machine learning and gen-AI techniques for cloud and devices from multimodal data sources, including scene understanding, action recognition and anomaly detection
  • Design and implement responsible, privacy-preserving, efficient and scalable models for inference and analysis of visual data
  • Develop performance and quality metrics for CVML models and systems, and validate their effectiveness in real-world settings
  • Optimize algorithms for performance, memory footprint, and energy efficiency to meet the requirements of resource-constrained devices
  • Stay up-to-date with the latest research and advances in CVML and translate relevant findings into shipping Axon products
  • Contribute to academic publications, technical documentation, and patent disclosures to share insights and findings with the broader community
  • Coach and mentor junior scientists
What we offer
What we offer
  • Competitive salary and 401k with employer match
  • Discretionary paid time off
  • Paid parental leave for all
  • Medical, Dental, Vision plans
  • Fitness Programs
  • Emotional & Mental Wellness support
  • Learning & Development programs
  • Snacks in our offices
  • Fulltime
Read More
Arrow Right

Senior Staff Machine Learning Engineer (AI Agent)

At Cresta, the AI Agent team is on a mission to create state-of-the-art AI Agent...
Location
Location
United States; Canada
Salary
Salary:
Not provided
cresta.com Logo
Cresta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree in Computer Science, Mathematics, or a related field
  • Master’s or Ph.D. preferred, or equivalent professional experience
  • 7+ years of hands-on industry experience with AI and machine learning
  • 3+ years of experience working with LLMs in large-scale production environments
  • Expert knowledge of machine learning concepts and methods, especially those related to NLP, Generative AI, and working with LLMs
  • Proven leadership in designing and deploying AI solutions at scale
  • Extensive practical knowledge of modern machine learning frameworks and technologies (e.g., PyTorch, Tensorflow, Hugging Face, NumPy)
  • Experience with distributed systems and cloud-based AI infrastructure
  • Strong problem-solving and strategic thinking abilities
  • Proven ability to lead cross-functional teams and work collaboratively to deliver innovative AI solutions in production
Job Responsibility
Job Responsibility
  • Design, develop, and deploy Cresta’s AI Agent solutions and proprietary models
  • Focus on practical AI challenges such as improving reasoning, planning capabilities, and evaluation in real-world scenarios
  • Collaborate with cross-functional teams including front-end and back-end software engineers to integrate AI Agents into Cresta’s customer solutions
  • Lead initiatives to scale AI systems for production environments, ensuring performance and reliability across use cases
  • Contribute to solving cutting-edge problems in AI and help define the future roadmap for Cresta’s AI Agents
  • Innovate and research ways to improve security, cost-efficiency, and reliability of AI systems
What we offer
What we offer
  • Variety of medical, dental, and vision plans
  • Paid parental leave
  • Monthly Health & Wellness allowance
  • Work from home office stipend
  • Lunch reimbursement for in-office employees
  • PTO: 3 weeks in Canada
  • Base salary, equity, and a variety of benefits
  • Fulltime
Read More
Arrow Right

User Research Director

Mentimeter is an engagement tool with a clear goal in mind. To turn presentation...
Location
Location
Sweden , Stockholm
Salary
Salary:
Not provided
mentimeter.com Logo
Mentimeter
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of proven experience leading user research in fast-moving, product-led organisations
  • Collaborative leadership style, skilled at building trust across disciplines
  • Deep knowledge across multiple research methodologies, with the ability to align and integrate them effectively
  • Track record of influencing senior leaders and shaping product and service strategy through evidence-based recommendations
  • Exceptional communication and storytelling skills
  • Future-oriented mindset, committed to advancing the craft and experimenting with new approaches
  • Strong grounding in building scalable user research operations and infrastructure
  • Practical expertise in applying AI and automation to user research workflows
Job Responsibility
Job Responsibility
  • Define and own the user research agenda: Shape a proactive, company-wide view of the questions that matter most and ensure we have the evidence to answer them
  • Scale ResearchOps: Build the systems, processes, and infrastructure that make user research efficient, repeatable, and continuously compounding. Create frameworks that make insights easy to access, reuse, and activate across teams, ensuring research becomes a shared capability that grows in value over time
  • Advance mixed-method practice: Establish a mixed-method insights framework that combines behavioral data with qualitative understanding to tell the full story of our users
  • Embed insight where work happens: Operationalize user journey insights directly within product and design workflows, making them visible where decisions happen
  • Span product and service: Develop an integrated perspective that connects product experience with service delivery into a single view of the users
  • Develop the craft and team: Recruit, coach, and grow a strong user research team that operates collaboratively across product, design, engineering, and commercial
  • Influence at strategic level: Translate insights into clear recommendations that shape priorities, roadmaps, and company bets
  • Evolve practice through technology: Push the boundaries of user research by integrating emerging technologies like AI and automation to scale insight generation and continuously raise the standard of practice
What we offer
What we offer
  • Diverse and inclusive work environment supported by smart and driven colleagues
  • Continuous professional development
  • Access to a leadership program (including external personal coach) and relevant education
  • Growing company with lots of career opportunities
  • Healthy view on work-life balance
  • Competitive compensation and benefits package, including pension contributions
Read More
Arrow Right