Senior Principal Researcher - Cloud and AI Infrastructure Job at Microsoft Corporation (Vancouver)

Senior Principal Engineer- End-to-End AI Training Framework

As the Senior Principal Engineer, E2E AI Training Framework for Autonomous Drivi...

Location

United States , Sunnyvale

Salary:

240000.00 - 320000.00 USD / Year

Robert Bosch Sp. z o.o.

Expiration Date

Until further notice

Requirements

Master’s degree or Ph.D in Computer Science, Robotics, Electrical Engineering, AI, or a closely related field with a focus on autonomous systems
10+ years of experience in software development and system engineering for autonomous driving or ADAS applications
Proven industry experience in releasing AI-based L2+ systems, with a strong track record of successful product deployments
Deep knowledge of E2E AI stack solutions and training algorithms, including reinforcement learning, and imitation learning, as well as motion control and optimization techniques
Deep knowledge of AI frameworks such as TensorFlow and PyTorch
Deep knowledge in model optimization and embedded deployment of E2E AI stacks to embedded automotive hardware
Deep knowledge of cloud-based scalable training pipelines, MLOps, and CICD for training AI models with large-scale fleet datasets
Proven track record of leading the end-to-end development and successful deployment of complex AI-powered systems into production environments at scale

Job Responsibility

Define and drive execution of the technical roadmap and strategy for the E2E AI machinery, including training pipelines, optimization techniques, simulation and MLOps tooling
Oversee the design, development, and testing of the E2E AI machinery and its interaction with data sources, model repositories, and development targets
Collaborate closely with other functional tech leads (e.g. data engineering, infrastructure) to define and drive the overall architecture of the AI machinery ecosystem
Guide the set-up of a development framework that enables fast evaluation and integration of emerging E2E AI solutions
Guide the transition from research prototypes to production-ready solutions, ensuring performance optimization on automotive-grade hardware and scalability
Leverage your prior industry experience in launching AI-based L2+ systems to implement best practices in system validation, testing (SIL/HIL), and continuous improvement
Mentor and lead a high-caliber team of AI scientists and engineers, fostering a culture of innovation, collaboration, and technical excellence

What we offer

health, dental, and vision plans
health savings accounts (HSA)
flexible spending accounts
401(K) retirement plan with an attractive employer match
wellness programs
life insurance
long term disability insurance
paid time off
parental leave

Fulltime

Senior / Principal Quantum Error Correction Engineer

Microsoft Quantum has assembled a talented and diverse international team to cre...

Location

Denmark , Copenhagen, Kongens Lyngby

Salary:

715600.00 - 1201100.00 DKK / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Doctorate in Computer Science, Software Engineering, Mathematics, Physics, Physical Sciences, or related field AND software industry experience, including developing commercial software, compilers, scientific computing applications, or multi-component systems
OR Master's Degree in Computer Science, Software Engineering, Mathematics, Physics, Physical Sciences, or related field AND proven software industry experience, including developing commercial software, compilers, scientific computing applications, or multi-component systems
OR Bachelor's Degree in Computer Science, Software Engineering, Mathematics, Physics, Physical Sciences, or related field AND demonstrated software industry experience, including developing commercial software, compilers, scientific computing applications, or multi-component systems
OR equivalent experience
Formal experience in quantum error correction and quantum fault-tolerance research and development environment
Hands-on experience with modeling and analyzing circuit-level noise in quantum circuits
Ability to meet Microsoft, customer and/or government security screening requirements
Microsoft Cloud Background Check
Citizenship & Citizenship Verification for export control
Ability to apply AI to accelerate engineering while developing shipping & prototype code

Job Responsibility

Design and develop infrastructure to evaluate fault-tolerance strategies for quantum computing systems
Advance the implementation of quantum error correction codes
Empower research and experimentation aimed at building scalable, resilient quantum computers
Engage in creative problem-solving and cross-functional collaboration
Foster a culture of collaboration, creativity, and technical excellence

What we offer

Benefits may include certain compensation and other benefits
Find additional benefits and pay information at provided link

Fulltime

Senior / Principal Quantum Error Correction Engineer

Microsoft Quantum has assembled a talented and diverse international team to cre...

Location

Denmark , Copenhagen

Salary:

Not provided

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Doctorate in Computer Science, Software Engineering, Mathematics, Physics, Physical Sciences, or related field AND software industry experience, including developing commercial software, compilers, scientific computing applications, or multi-component systems
OR Master's Degree in Computer Science, Software Engineering, Mathematics, Physics, Physical Sciences, or related field AND proven software industry experience, including developing commercial software, compilers, scientific computing applications, or multi-component systems
OR Bachelor's Degree in Computer Science, Software Engineering, Mathematics, Physics, Physical Sciences, or related field AND demonstrated software industry experience, including developing commercial software, compilers, scientific computing applications, or multi-component systems
OR equivalent experience
Formal experience in quantum error correction and quantum fault-tolerance research and development environment
Hands-on experience with modeling and analyzing circuit-level noise in quantum circuits
Ability to meet Microsoft, customer and/or government security screening requirements
Microsoft Cloud Background Check
Citizenship & Citizenship Verification
Ability to apply AI to accelerate engineering while developing shipping & prototype code

Job Responsibility

Design and develop infrastructure to evaluate fault-tolerance strategies for quantum computing systems, working in close collaboration with a multidisciplinary team of theorists and experimentalists
Advance the implementation of quantum error correction codes, contributing to the development of both logical and physical qubit architectures
Empower research and experimentation aimed at building scalable, resilient quantum computers capable of delivering practical value
Engage in creative problem-solving and cross-functional collaboration to overcome technical challenges in quantum system design
Foster a culture of collaboration, creativity, and technical excellence

Fulltime

Senior Principal Engineering Manager

Microsoft Research (MSR) is working to transform the future of artificial intell...

Location

United States , Redmond

Salary:

163000.00 - 296400.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
5+ years of people management experience leading software engineering teams, including managing principal engineers
Experience building or operating infrastructure for large-scale distributed systems, cloud platforms, or artificial intelligence (AI)/machine learning(ML) workloads
Track record of driving execution on complex, multi-workstream infrastructure projects with clear milestones and accountability
Technical fluency in one or more of: large-scale compute clusters, GPU infrastructure, scheduling and orchestration (Kubernetes, Volcano), or High-Performance Compute (HPC) environments
Experience with GPU programming (CUDA, NCCL) and frameworks such as PyTorch
Expertise in networking (InfiniBand, NVLink), storage systems, or distributed training parallelisms
A track record of strong cross-functional partnerships, including the ability to align on strategic direction, deliver joint accountabilities, and develop relationships with staff members with widely varied expertise
Experience scaling engineering teams through significant growth phases (hiring, onboarding, and integrating new engineers into a high-performing team)
Master's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 15+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience

Job Responsibility

Lead, mentor, and grow the engineering team that builds MSR’s AI research infrastructure
Recruit and develop exceptional engineering talent, building a diverse team - including hiring, onboarding, career development, and performance management
Drive execution across the team by setting clear goals, tracking milestones, managing dependencies, and ensuring accountability for delivering complex infrastructure projects on time and at high quality
Lead team culture and process changes, cultivating an AI-first mentality that accelerates our progress through agentic coding, automation, and skills development
Provide technical vision and judgment on the team's architecture, strategy, and roadmap — spanning supercomputer GPU clusters, high performance networking, workload optimization, researcher tools, and agentic workflows — while empowering engineers to own deep technical details
Collaborate closely cross-discipline with engineers, program managers, and research and science teams to align priorities, resolve dependencies, and build better solutions together
Foster a team culture of operational excellence, continuous improvement, and high psychological safety where engineers are empowered to take ownership and innovate

Fulltime

Senior Principal, Machine Learning & Artificial Intelligence

Xometry is seeking a Senior Principal, Machine Learning & Artificial Intelligenc...

Location

United States , Waltham

Salary:

150000.00 - 196000.00 USD / Year

Cherry Ventures

Expiration Date

Until further notice

Requirements

Master’s or PhD in Computer Science, Machine Learning, Applied Mathematics, Electrical Engineering or related field (PhD preferred for deep generative/3D modeling emphasis)
12+ years of professional experience in machine learning, artificial intelligence, or data science roles — with several years in senior or principal capacity leading major programs
Demonstrated experience architecting and delivering large scale ML/AI solutions - end-to-end from data ingestion, feature engineering, model training, evaluation, deployment, monitoring & operations
Deep expertise in machine learning frameworks (TensorFlow, PyTorch), data engineering, model infrastructure, MLOps, cloud platforms (AWS, GCP, Azure), and scalable production systems
Experience in 3D modeling / geometry / computer vision / generative models (e.g., point-cloud processing, mesh processing, text23D, image23D, CAD/CAM integration) is highly desirable
Strong exposure to generative AI techniques (large language models, multimodal models, diffusion, GANs) and translating them into business use-cases
Excellent cross-functional collaboration skills: you can partner with product, engineering, ops, manufacturing, design, business leadership and translate technical concepts into business language
Proven ability to influence without direct authority and drive change across organizations
Strong communication and presentation skills
you can articulate technical vision, roadmap, trade-offs and outcomes to senior leadership

Job Responsibility

Serve as the technical leader of multiple large, cross-functional ML/AI solutions with significant, lasting impact across Xometry’s business
Define, and drive the 18-24-month ML/AI technical roadmap - balancing breakthrough innovation (e.g., generative 3D, foundation models, large-scale vision/3D pipelines) with reliable business value delivery (e.g., quoting accuracy, lead-time reduction, defect detection, cost optimization)
Influence partner roadmaps across engineering, product, operations, and business teams: align priorities, advise on resourcing, champion ML/AI best practices
Proactively identify and remove roadblocks for teams and projects — whether technical, operational, data-related, or resource constraints
Mentorship of individuals and technical teams
Act as a trusted SME with strong cross-functional partnerships: your insights and guidance will shape ML/AI infrastructure, data, model, infrastructure, and tooling decisions
Play a leadership role in identifying areas of opportunity — e.g., using ML/AI to unlock new revenue streams (e.g., rapid quoting for new manufacturing modalities, generative design for customers), reduce cost (e.g., automated quality inspection), or optimize efficiency (e.g., 3D-geometry classification, defect detection, generating manufacturing ready models)
Address problems adjacent to your sphere of immediate influence: proactively tackle challenges outside direct scope and champion holistic solutions
Stay ahead of industry developments in ML, AI, generative AI, 2D/3D modeling and manufacturing tech
translate insights into the improvement of internal best practices, tooling, frameworks, model governance, data pipelines, and operationalization

What we offer

401(k) match
medical, dental and vision insurance
life and disability insurance
generous paid time off including vacation, sick leave, floating and fixed holidays, maternity and bonding leave
EAP, other wellbeing resources

Fulltime

Senior Principal, Machine Learning & Artificial Intelligence

Xometry is seeking a Senior Principal, Machine Learning & Artificial Intelligenc...

Location

United States , North Bethesda

Salary:

150000.00 - 196000.00 USD / Year

Cherry Ventures

Expiration Date

Until further notice

Requirements

Master’s or PhD in Computer Science, Machine Learning, Applied Mathematics, Electrical Engineering or related field (PhD preferred for deep generative/3D modeling emphasis)
12+ years of professional experience in machine learning, artificial intelligence, or data science roles — with several years in senior or principal capacity leading major programs
Demonstrated experience architecting and delivering large scale ML/AI solutions - end-to-end from data ingestion, feature engineering, model training, evaluation, deployment, monitoring & operations
Deep expertise in machine learning frameworks (TensorFlow, PyTorch), data engineering, model infrastructure, MLOps, cloud platforms (AWS, GCP, Azure), and scalable production systems
Strong exposure to generative AI techniques (large language models, multimodal models, diffusion, GANs) and translating them into business use-cases
Excellent cross-functional collaboration skills: you can partner with product, engineering, ops, manufacturing, design, business leadership and translate technical concepts into business language
Proven ability to influence without direct authority and drive change across organizations
Strong communication and presentation skills
you can articulate technical vision, roadmap, trade-offs and outcomes to senior leadership
Track record of identifying and delivering measurable business impact via ML/AI - e.g., revenue growth, cost savings, improved efficiency

Job Responsibility

Serve as the technical leader of multiple large, cross-functional ML/AI solutions with significant, lasting impact across Xometry’s business
Define, and drive the 18-24-month ML/AI technical roadmap - balancing breakthrough innovation (e.g., generative 3D, foundation models, large-scale vision/3D pipelines) with reliable business value delivery (e.g., quoting accuracy, lead-time reduction, defect detection, cost optimization)
Influence partner roadmaps across engineering, product, operations, and business teams: align priorities, advise on resourcing, champion ML/AI best practices
Proactively identify and remove roadblocks for teams and projects — whether technical, operational, data-related, or resource constraints
Mentorship of individuals and technical teams
Act as a trusted SME with strong cross-functional partnerships: your insights and guidance will shape ML/AI infrastructure, data, model, infrastructure, and tooling decisions
Play a leadership role in identifying areas of opportunity — e.g., using ML/AI to unlock new revenue streams (e.g., rapid quoting for new manufacturing modalities, generative design for customers), reduce cost (e.g., automated quality inspection), or optimize efficiency (e.g., 3D-geometry classification, defect detection, generating manufacturing ready models)
Address problems adjacent to your sphere of immediate influence: proactively tackle challenges outside direct scope and champion holistic solutions
Stay ahead of industry developments in ML, AI, generative AI, 2D/3D modeling and manufacturing tech
translate insights into the improvement of internal best practices, tooling, frameworks, model governance, data pipelines, and operationalization

What we offer

annual bonus
401(k) match
medical, dental and vision insurance
life and disability insurance
generous paid time off including vacation, sick leave, floating and fixed holidays, maternity and bonding leave
EAP, other wellbeing resources

Fulltime

New

Principal Machine Learning Engineer - Forecasting

We are seeking a Principal Machine Learning Engineer to join the Forecasting tea...

Location

India , Hyderabad

Salary:

Not provided

Amgen

Expiration Date

Until further notice

Requirements

Degree and 12+ years of experience in machine learning engineering, software engineering, data science engineering, or a related quantitative discipline.
10+ years of professional experience building, deploying, and operating production ML, AI, data, or software systems, including significant experience as a technical lead on complex, cross-functional initiatives.
Demonstrated track record of designing or architecting new and existing systems with emphasis on reliability, scale, security, maintainability, and operational excellence.
Deep hands-on experience with the full ML engineering lifecycle, including data pipelines, feature engineering, experimentation, model training, model integration, testing, deployment, monitoring, evaluation, observability, and continuous improvement.
Strong experience deploying forecasting, probabilistic, Bayesian, predictive, NLP, deep learning, or LLM-based systems in production environments.
Experience building or integrating AI systems, including LLM-powered applications, agentic workflows, retrieval or information-retrieval systems, evaluation frameworks, and human-in-the-loop review patterns.
Strong object-oriented programming skills in Python and SQL, with experience using modern ML and software development frameworks such as scikit-learn, PyTorch, TensorFlow/JAX, Spark, Ray, MLflow, Airflow/Prefect/Dagster, FastAPI, or equivalent technologies.
Experience with cloud platforms and distributed systems, including containerization, CI/CD, infrastructure-as-code, model serving, workflow orchestration, batch and streaming data processing, and production support.
Strong software engineering fundamentals, including system design, architecture trade-off analysis, testing strategies, code reviews, source control, build and release processes, performance optimization, and maintainability.
Demonstrated ability to communicate technical strategy, system tradeoffs, and delivery risks to technical and non-technical stakeholders, including senior leaders, product/program owners, scientists, and business partners.

Job Responsibility

Define and drive the technical strategy for enterprise forecasting and AI decision systems, aligning architecture, reusable platforms, and delivery roadmaps to Amgen's planning, supply, commercial, manufacturing, operations, and patient-focused priorities.
Partner with data scientists, product and program leaders, operations, commercial, manufacturing, supply chain, finance, and other business stakeholders to translate ambiguous requirements into shipped software and measurable business outcomes.
Architect, build, and scale production ML, LLM, and agentic AI systems that combine forecasting, predictive analytics, simulation, optimization, and autonomous or semi-autonomous workflow automation.
Productionize advanced statistical, Bayesian, deep learning, and machine learning models, including training, validation, inference, serving, evaluation, lifecycle management, and governed deployment.
Lead development of AI agent components that automate complex forecasting and operational workflows across multiple systems, decision points, datasets, and user groups while preserving appropriate human-in-the-loop review and escalation patterns.
Design secure integrations across enterprise APIs, databases, analytics platforms, workflow systems, cloud services, and AI orchestration patterns to enable multi-system decision support and scalable automation.
Establish robust MLOps and AI engineering capabilities, including model versioning, CI/CD, automated retraining, performance monitoring, observability, drift detection, service-level reliability, rollback strategies, and operational runbooks.
Implement guardrails, model and agent evaluation frameworks, auditability, explainability, responsible AI controls, and human-in-the-loop operating models for production AI systems in high-impact and regulated business contexts.
Research and evaluate state-of-the-art open-source, vendor, and internal tools related to forecasting, LLMs, AI agents, MLOps, model optimization, model serving, and scalable AI infrastructure for potential application to Amgen business problems.
Provide principal-level technical mentorship, design review leadership, and engineering standard-setting across teams, promoting code quality, documentation, reproducibility, testing, security, privacy, maintainability, and operational excellence.

Fulltime

Principal Product Manager - Foundry Inferencing & Training (CoreAI - multiple roles)

Microsoft Foundry sits at the center of Microsoft’s AI strategy, powering how mo...

Location

United States , Redmond

Salary:

139900.00 - 331200.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor’s Degree and 8+ years of experience in product management, technical program management, software engineering, or related technical fields (or equivalent experience)
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Job Responsibility

Product Strategy & Ownership: Own product strategy and roadmap across AI model training, inference, experimentation, and platform enablement, balancing near-term delivery with long-term scale
Maintain end-to-end accountability from concept through launch, iteration, and measurable impact
Model Lifecycle & Platform Enablement: Drive initiatives across the AI model lifecycle, partnering with engineering and research to bring new capabilities from research into production
Enable internal teams and customers to access, integrate, and adopt models through high-quality platform experiences
Execution, Velocity & Operating Rigor: Lead complex, multi-quarter initiatives with high visibility, managing dependencies, risks, and tradeoffs across teams
Improve execution velocity by reducing friction in planning, experimentation, launches, and iteration cycles
Experimentation, Metrics & Continuous Improvement: Define and track metrics for efficiency, performance, reliability, and adoption, using experimentation and data to drive decisions
Identify opportunities for automation, simplification, and continuous improvement as systems scale
Cross-Functional Leadership & Communication: Act as a connective leader across engineering, data science, research, infrastructure, and go-to-market teams
Influence senior stakeholders through clear decision framing, executive-ready narratives, and data-backed recommendations

Fulltime

Select Country

Senior Principal Researcher - Cloud and AI Infrastructure

Job Description

Job Responsibility

Requirements

Looking for more opportunities?