Principal Researcher - Cloud and AI Infrastructure Job at Microsoft Corporation (Vancouver)

Senior Principal Researcher - Cloud and AI Infrastructure

Microsoft Research Asia – Vancouver lab, located in the vibrant city of Vancouve...

Location

Canada , Vancouver

Salary:

163000.00 - 296400.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Doctorate in relevant field AND 6+ years related research experience
OR Master's Degree in relevant field AND 7+ years related research experience
OR Bachelor's Degree in relevant field AND 9+ years related research experience
OR equivalent experience
3+ years’ experience in research related to infrastructure design, computer architecture, or artificial intelligence
Experience publishing academic papers as a lead author or essential contributor
Experience participating in a top conference in relevant research domain
Experience in optimizing or designing hardware components and architectures to enhance performance, reliability, efficiency

Job Responsibility

Investigate and analyze emerging hardware technologies, trends, and advancements
Design and optimize hardware components, systems, and architectures to enhance performance, reliability, and efficiency
Conduct simulations, tests, and validations to ensure hardware designs meet required specifications and performance goals
Develop prototypes and proof-of-concept models to demonstrate new hardware technologies and applications
Identify opportunities for hardware improvements and cost reductions by staying informed about industry best practices and standards
Collaborate with cross-functional teams, including software researchers, designers, and engineers, to identify hardware requirements and develop innovative solutions
Partner with manufacturing vendors and production teams to transition innovative designs and concepts into deployable systems
Document research findings, design decisions, and technical specifications to facilitate knowledge sharing and collaboration within the organization

Fulltime

Principal AI Security Researcher

Microsoft Sentinel Platform NEXT R&D labs is the strategic incubation engine beh...

Location

United States , Multiple Locations

Salary:

139900.00 - 274800.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Doctorate in Statistics, Mathematics, Computer Science, Computer Security, or related field AND 3+ years experience in software development lifecycle, large-scale computing, threat analysis or modeling, cybersecurity, vulnerability research, and/or anomaly detection
OR Master's Degree in Statistics, Mathematics, Computer Science, Computer Security, or related field AND 4+ years experience in software development lifecycle, large-scale computing, threat analysis or modeling, cybersecurity, vulnerability research, and/or anomaly detection
OR Bachelor's Degree in Statistics, Mathematics, Computer Science, Computer Security, or related field AND 6+ years experience in software development lifecycle, large-scale computing, threat analysis or modeling, cybersecurity, vulnerability research, and/or anomaly detection
OR equivalent experience
Ability to meet Microsoft, customer and/or government security screening requirements
Microsoft Cloud Background Check
5+ years of experience in cybersecurity, AI, software development lifecycle, large-scale computing, modeling, and/or anomaly detection
5+ years of professional experience in security operations, pen-testing, researching cyber threats, understanding attacker methodology, tools, and infrastructure
Demonstrated autonomy and success driving zero-to-one (0→1) initiatives
ML background and hands-on experience

Job Responsibility

Security AI Research: be the security expert to our AI-focused team, helping evaluate our systems on real data, improve system inputs, triage and investigate AI-based findings, leverage AI and security experience to incubate and transform our products, educate applied scientists in cybersecurity
Collaboration: Partner with engineering, product, and research teams to translate scientific advances into robust, scalable, and production-ready solutions
AI/ML Research: design, development, and analysis of novel AI and machine learning models and algorithms for security and enterprise-scale applications
Experimentation & Evaluation: Design and execute AI experiments, simulations, and evaluations to validate models and system performance, ensuring measurable improvements
Customer Impact: Engage with enterprise customers and field teams to co-design solutions, gather feedback, and iterate quickly based on real-world telemetry and outcomes

Fulltime

Principal Data Infrastructure Engineer

As Microsoft continues to push the boundaries of AI, we are on the lookout for p...

Location

United States , Redmond

Salary:

139900.00 - 274800.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 4+ years experience in business analytics, data science, software development, data modeling, or data engineering
OR Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 6+ years experience in business analytics, data science, software development, data modeling, or data engineering
OR equivalent experience
4+ years in Big Data Infrastructure, DevOps, SRE, or Platform Engineering
3+ years of hands-on experience managing and scaling distributed systems—from bare-metal to cloud-native environments
2+ years deploying containerized applications using Kubernetes and Helm/Kustomize
Solid scripting and automation skills using Python, Bash, or PowerShell
Proven success in CI/CD pipeline management, release automation, and production troubleshooting
Experience working with Databricks for scalable data processing and analytics
Familiarity with security practices in infrastructure environments, including IAM, OAuth, and Kerberos administration

Job Responsibility

Architect and maintain scalable, reliable, and observable Big Data Infrastructure for mission-critical AI applications
Champion DevOps and SRE best practices—automated deployments, service monitoring, and incident response
Build a self-service big data platform that empowers data and platform engineers and researchers
Develop robust CI/CD pipelines and automate infrastructure provisioning using Infrastructure as Code tools (Bicep, Terraform, ARM)
Collaborate with Data Engineers, Data Scientists, AI Researchers, and Developers to deliver secure, seamless big data workflows
Lead technical design reviews and uphold a clean, secure, and well-documented codebase
Proactively identify and resolve bottlenecks in data pipelines and infrastructure
Optimize system performance across storage, compute, and analytics layers
Partner with Security teams to enhance system security (IAM, OAuth, Kerberos)
Embody and promote Microsoft’s values: Respect, Integrity, Accountability, and Inclusion

Fulltime

Principal Applied Researcher AI/NLP

At PointClickCare our mission is simple: to help providers deliver exceptional c...

Location

United States

Salary:

195800.00 - 217500.00 USD / Year

PointClickCare

Expiration Date

Until further notice

Requirements

PhD or comparable level of experience in Computer Science, Math, Physics, Engineering or a related field
4-10+ year industry experience building solutions in commercial SaaS, including at least 4 years working in applications of NLP, Search or AI/ML technologies for healthcare
Strong interest in applying AI/ML/NLP to healthcare related problems and data
Expert-level practical, hands-on experience developing and applying a wide range of techniques in Natural Language Processing, including fine tuning of LLMs and other Transformer models, plus one or more additional AI/ML or Search related areas of expertise to solve real-world problems at scale
Demonstrated ability to lead and perform research and experimentation to select appropriate approaches, algorithms, evaluation methods, and frameworks, as well as tasks such as feature selection, language modeling, evaluation and fine tuning or training models, applying standard approaches or developing new tools or workflows as needed to meet project requirements
Significant experience building and deploying AI/machine learning and NLP models for large-scale SaaS products, including familiarity with industry standard software development concepts such as scaling issues, version control, CI/CD pipelines, and security
Solid understanding and experience with transformer models and multiple kinds of NLP and ML models and approaches including logistic regression, random forest, ensemble methods, SVM, KNN, reinforcement learning, and other ML techniques
Proficiency in Python and Java required. Proficiency in JavaScript or TypeScript and modern UI frameworks for building prototype or tool front ends desired
Proficiency doing data engineering for ML and NLP applications, including exposure to database systems and proficiency with SQL
Proficiency building models from big data using modern packages, models and data analysis stacks such as NumPy, SciPy, Pandas, Scikit-learn, PyTorch, Keras, LightGBM, fastText, NLTK, and spaCy. Proficiency fine tuning Hugging Face Transformers required

Job Responsibility

You will be applying NLP including GenAI and other AI/ML techniques to develop model systems and solutions, collaborating across functions to scale and integrate advanced solutions into successful end user experiences in large-scale cloud based SaaS production environments for healthcare
You will be working with product leaders, clinical informaticists, data scientists, UI/UX researchers and designers, other AI and machine learning and domain experts, engineering teams and others, including work with customers and users who are healthcare professionals
Design, build and evaluate solutions that may involve structured or unstructured data including speech or natural language for healthcare use cases, delivering capabilities such as summarization, predictive models, recommenders, semantic search, extraction, classification or other NLP, AI or machine learning based techniques
You will be performing research and experimentation to select appropriate approaches, algorithms, evaluation methods and frameworks and doing the R&D to deliver model systems
You will perform, oversee and assist in data collection, data cleaning, data analysis, algorithm selection or design, prompt tuning, parameter fine tuning, training, development and evaluation of systems that deliver responsible AI solutions at scale, using existing or developing new tools or workflows as needed
As a principal applied researcher, you will bring deep technical expertise and also provide mentorship on advanced AI, NLP, data science, statistical and machine learning methods and technologies, helping the organization develop new capabilities for innovative solutions
You will have substantial independence and responsibility from day one

What we offer

Benefits starting from Day 1
Retirement Plan Matching
Flexible Paid Time Off
Wellness Support Programs and Resources
Parental & Caregiver Leaves
Fertility & Adoption Support
Continuous Development Support Program
Employee Assistance Program
Allyship and Inclusion Communities
Employee Recognition … and more

Fulltime

Principal Applied Researcher AI/NLP

At PointClickCare our mission is simple: to help providers deliver exceptional c...

Location

Canada , Mississauga

Salary:

176000.00 - 195000.00 CAD / Year

PointClickCare

Expiration Date

Until further notice

Requirements

PhD or comparable level of experience in Computer Science, Math, Physics, Engineering or a related field
4-10+ year industry experience building solutions in commercial SaaS, including at least 4 years working in applications of NLP, Search or AI/ML technologies for healthcare
Strong interest in applying AI/ML/NLP to healthcare related problems and data
Expert-level practical, hands-on experience developing and applying a wide range of techniques in Natural Language Processing, including fine tuning of LLMs and other Transformer models, plus one or more additional AI/ML or Search related areas of expertise to solve real-world problems at scale
Demonstrated ability to lead and perform research and experimentation to select appropriate approaches, algorithms, evaluation methods, and frameworks, as well as tasks such as feature selection, language modeling, evaluation and fine tuning or training models, applying standard approaches or developing new tools or workflows as needed to meet project requirements
Significant experience building and deploying AI/machine learning and NLP models for large-scale SaaS products, including familiarity with industry standard software development concepts such as scaling issues, version control, CI/CD pipelines, and security
Solid understanding and experience with transformer models and multiple kinds of NLP and ML models and approaches including logistic regression, random forest, ensemble methods, SVM, KNN, reinforcement learning, and other ML techniques
Proficiency in Python and Java required
Proficiency in JavaScript or TypeScript and modern UI frameworks for building prototype or tool front ends desired
Proficiency doing data engineering for ML and NLP applications, including exposure to database systems and proficiency with SQL

Job Responsibility

Applying NLP including GenAI and other AI/ML techniques to develop model systems and solutions, collaborating across functions to scale and integrate advanced solutions into successful end user experiences in large-scale cloud based SaaS production environments for healthcare
Working with product leaders, clinical informaticists, data scientists, UI/UX researchers and designers, other AI and machine learning and domain experts, engineering teams and others, including work with customers and users who are healthcare professionals
Design, build and evaluate solutions that may involve structured or unstructured data including speech or natural language for healthcare use cases, delivering capabilities such as summarization, predictive models, recommenders, semantic search, extraction, classification or other NLP, AI or machine learning based techniques
Performing research and experimentation to select appropriate approaches, algorithms, evaluation methods and frameworks and doing the R&D to deliver model systems
Perform, oversee and assist in data collection, data cleaning, data analysis, algorithm selection or design, prompt tuning, parameter fine tuning, training, development and evaluation of systems that deliver responsible AI solutions at scale, using existing or developing new tools or workflows as needed
As a principal applied researcher, you will bring deep technical expertise and also provide mentorship on advanced AI, NLP, data science, statistical and machine learning methods and technologies, helping the organization develop new capabilities for innovative solutions
You will have substantial independence and responsibility from day one

What we offer

Benefits starting from Day 1
Retirement Plan Matching
Flexible Paid Time Off
Wellness Support Programs and Resources
Parental & Caregiver Leaves
Fertility & Adoption Support
Continuous Development Support Program
Employee Assistance Program
Allyship and Inclusion Communities
Employee Recognition

Fulltime

Principal Software Engineer - AI Ads

Microsoft AI is looking for a Principal Software Engineer - AI Ads, to shape the...

Location

United States , Mountain View, CA or Redmond, WA

Salary:

139900.00 - 274800.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
4+ years of industrial experience building large scale systems and supporting AI models
2+ years of experience with deep learning frameworks (e.g., PyTorch, TensorFlow), LLMs/SLMs, and AI Agents
2+ years of experience with cloud services, large-scale big data platforms, and streaming/real-time frameworks (e.g., Kafka, Flink, Spark Streaming), and AI infrastructure development
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter

Job Responsibility

Lead the design, development, and optimization of large-scale shopping ads infrastructure and algorithms
Build and maintain the universal product graph spanning billions of products across multiple languages
Develop scalable systems for data ingestion, storage, retrieval, and real-time serving at global scale
Apply machine learning (ML), nature language processing (NLP), and deep learning (DL) models to improve ad relevance, personalization, and selection
Collaborate with scientists and engineers across Microsoft AI to translate research into production systems
Drive innovation by identifying technical opportunities that align with Microsoft’s Commerce Strategy
Mentor and guide engineers, fostering technical competence and collaboration across the team

Fulltime

Principal Product Manager - DevOps AI - CoreAI

Microsoft’s mission is to empower every person and every organization to achieve...

Location

United States , Redmond

Salary:

139900.00 - 274800.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree AND 8+ years experience in product/service/program management or software development OR equivalent experience
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
10+ years of product management experience building and shipping platform, infrastructure, or developer tooling products
Proven experience working across multiple teams and systems to deliver outcomes in complex, highly matrixed organizations
Ability to define and drive ambiguous problem spaces, turning strategy and research into concrete, actionable product investments
Experience with AI assisted developer workflows, automation, or intelligent systems applied to software engineering
Understanding of developer workflows and DevOps systems, including build, test, release, and operational feedback loops
Excellent communication and stakeholder management skills, with the ability to align diverse partners around shared goals, tradeoffs, and success metrics
Familiarity with enterprise requirements around security, compliance, privacy, and reliability, especially in largescale engineering environments

Job Responsibility

Drive crosscutting platform investments across the GitHub / Azure DevOps / 1ES ecosystem, with a focus on AI assisted developer productivity across the full engineering lifecycle (inner loop,CI/CD ,operations, governance)
Identify high leverage opportunities where AI can meaningfully reduce friction and toil for developers while improving quality, reliability, and consistency of outcomes
Define clear problem statements, product bets, and success metrics that balance speed of iteration with trust, safety, and operational requirements
Operate in a startup mode, moving quickly from hypothesis to MVP to scaled rollout through rapid experimentation and iteration
Partner closely with engineering, security, compliance, privacy, and AI platform teams to ensure solutions are production ready and scalable, not experimental or one off
Drive execution across multiple systems and teams in a highly matrixed environment, influencing roadmaps and priorities without direct authority
Ensure AI assisted workflows are designed end-to-end, with clear ownership, feedback loops, and failure modes—not isolated point solutions
Use data, developer feedback, and operational signals to evaluate impact and continuously improve platform investments
Act as a thought partner to engineering and leadership teams on how AI should be applied responsibly within Microsoft’s engineering system

Fulltime

Principal Software Engineers - Applied AI for Microsoft Threat Protection

The Microsoft Security Organization is building the next generation of security ...

Location

United States , Redmond

Salary:

139900.00 - 274800.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, Python, C#, Go, or Java OR equivalent experience
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
6+ years of experience designing, building and operating scalable ML systems, including ML infrastructure and pipelines (Azure ML, Kubernetes), model versioning, observability, and secure deployment, with hands‑on experience in MLOps/AIOps/SecDevOps practices
6+ years of experience building secure, reliable software systems, with applied knowledge of authentication, data protection, access control, and secure coding practices
6+ years of experience designing, building, and operating distributed or cloud‑scale systems (Azure, AWS, or GCP), including production ownership, CI/CD integration, and operating highly available services
2+ years of experience designing and building applications with LLM orchestration frameworks (e.g., LangChain, AutoGen), including agent‑based workflows, RAG pipelines, prompt engineering, and model fine‑tuning/evaluation

Job Responsibility

Design and evolve AI‑driven security systems leveraging large language models, multimodal models, and frontier capabilities to address complex security challenges
Develop contextual knowledge systems, including security graphs, semantic representations, memory frameworks, and high‑quality reasoning over security data
Collaborate across disciplines with Security Engineers, domain experts, and Product Managers to define inclusive, AI‑native security experiences
Partner with AI Infrastructure and Platform teams, Research, and Model Engineering groups to translate security workflows into AI‑optimized architectures
Enable automation, augmentation, and responsible autonomy to drive measurable functional improvements across security solutions
Prototype, validate, and deploy solutions in live production environments while upholding Microsoft standards for security, reliability, privacy, and trust
Shape technical direction for AI Security by influencing architecture, tooling, engineering practices, and shared best practices
Lead cross‑team initiatives spanning security products, platforms, and business units through collaboration and shared ownership
Mentor and sponsor engineers at multiple levels, fostering inclusive technical dialogue, sound engineering judgment, and continuous growth
Contribute to a culture of learning, accountability, and impact across the broader engineering and security community

Fulltime

Select Country

Principal Researcher - Cloud and AI Infrastructure

Job Description

Job Responsibility

Requirements

Looking for more opportunities?