CrawlJobs Logo

Member of Technical Staff, Reinforcement Learning Systems

https://www.microsoft.com/ Logo

Microsoft Corporation

Location Icon

Location:
United States , Mountain View

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

139900.00 - 274800.00 USD / Year

Job Description:

Microsoft AI is looking for a Member of Technical Staff – Reinforcement Learning Systems to help build the world’s most advanced reinforcement learning systems. We are responsible for designing, developing, and operating the large-scale reinforcement learning systems that power several use cases across the Superintelligence team – from training trustworthy and capable agents and powerful reasoning models to helpful and conversational assistants.

Job Responsibility:

  • Develop and tune the pretraining scalable software for Nvidia GB200 72NVL CX8 and AMD MIxxx architectures
  • Benchmark GB200 and AMD MIxxx GPU clusters
  • Gather data and insights to develop the pretraining compute roadmap
  • Care deeply about conversational AI and its deployment
  • Actively contribute to the development of AI models that are powering our innovative products
  • Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
  • Enjoy working in a fast-paced, design-driven, product development cycle
  • Embody our Culture and Values

Requirements:

  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Excel in programming (especially parallel/concurrent and distributed), software engineering, and system design
  • Have experience in large-scale systems, preferably having built some components from scratch
  • Thrive in a highly collaborative, fast-paced environment
  • Have a high degree of craftsmanship and pay close attention to details
  • Effectively manage multiple responsibilities and can adjust to shifting priorities

Nice to have:

  • Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Experience with generative AI
  • Experience with distributed computing
  • Experience in leading technical projects and supporting architectural decisions with data
  • A background in machine learning
  • Backgrounds in mathematics, competitive programming, and related domains

Additional Information:

Job Posted:
April 01, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:
PREMIUM
More languages and countries
Unlock 29494 hidden job offers
Languages
English Čeština Deutsch Ελληνικά Español Français +15
Countries
United States United Kingdom India Canada Australia +
See plans
Plans from $2.99 / month

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Member of Technical Staff, Reinforcement Learning Systems

Member of Technical Staff (RL systems)

Microsoft AI is looking for a Member of Technical Staff – Reinforcement Learning...
Location
Location
Switzerland , Zürich
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • Master’s Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor’s Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • Experience with generative AI
  • Experience with distributed computing
  • Experience in leading technical projects and supporting architectural decisions with data
Job Responsibility
Job Responsibility
  • Develop and tune the pretraining scalable software for Nvidia GB200 72NVL CX8 and AMD MIxxx architectures
  • Benchmark GB200 and AMD MIxxx GPU clusters
  • Gather data and insights to develop the pretraining compute roadmap
  • Care deeply about conversational AI and its deployment
  • Actively contribute to the development of AI models that are powering our innovative products
  • Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
  • Enjoy working in a fast-paced, design-driven, product development cycle
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right

Distinguished Technologist, Deep Learning

Joining our HPE Hybrid Cloud team and working as part of our OpsRamp team is a c...
Location
Location
United States , San Jose
Salary
Salary:
164500.00 - 398500.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years of relevant experience in the industry delivering technical and business strategy at an advanced/strategist level
  • Master's, or PhD degree in Computer Science, Information Systems, Engineering, or equivalent
  • At least 4 years of hands-on expertise in defining, building, training and / or optimizing foundational deep learning models at scale in PyTorch, HF and other ML frameworks and libraries
  • Experience and/or deep understanding in various deep learning architectures like CNNs, GNNs, Transformers, Reinforcement Learning etc. is a strong advantage
  • Strong hands-on experience/understanding in pre-training, fine-tuning, distilling, aligning open-source large language models and have them complement the in-house foundational models
  • Hands-on experience developing multi-agent applications around a mixture of in-house and open-source models while leveraging latest in RAG and Prompt Engineering tooling techniques
  • Strong customer focus and obsession with improving service availability/performance and user experience/consumption using measurable SRE metrics
  • Must have a track record of working alongside other engineering teams architecting, building, and deploying mission-critical, highly distributed, large-scale SaaS applications
  • Must have strong knowledge of application failure modes, resiliency patterns, and techniques to enable robust, self-healing architecture
  • Effective technical leadership skills to influence diverse groups to move toward common goals/strategies
Job Responsibility
Job Responsibility
  • Oversee build of OpsRamp’s CoPilot for Autonomous Operations for the Hybrid Cloud
  • Understand latest in GenAI/ML for ITOM
  • Understand cloud-native architecture concepts and have knowledge of best practices for high availability, scalability, resilience, performance, and security requirements in the cloud
  • Act as a cross-functional product and technical expert for GenAI within engineering with close working relationships with customers, product management, support, and marketing supporting edge-to-cloud services offering
  • Provides consultation, design input, and feedback for product development and design reviews across multiple organizations and architectures
  • Help transition proof-of-concept implementations into R&D teams to accelerate new product delivery
  • Creates technical content such as designs, specifications, and initial software implementations
  • Guides and mentors less-experienced staff members to set an example of software systems design and development innovation and excellence, helping to grow engineers into more senior technical roles
  • Collect product feedback from field interactions to provide input into Engineering and Product Management to influence product roadmap direction
  • Maintain a high level of knowledge of OpsRamp SaaS product and product road maps, as well as that of the competition and prospective strategic partners
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Applied Scientist

As Microsoft continues to push the boundaries of AI, we are on the lookout for p...
Location
Location
United States , Mountain View
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience (e.g., statistics predictive analytics, research) OR Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research) OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) related experience (e.g., statistics, predictive analytics, research) OR equivalent experience.
  • Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 6+ years related experience (e.g., statistics, predictive analytics, research) OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research) OR equivalent experience.
  • 3+ years experience creating publications (e.g., patents, libraries, peer-reviewed academic papers).
  • Experience presenting at conferences or other events in the outside research/industry community as an invited speaker.
  • 3+ years experience conducting research as part of a research program (in academic or industry settings).
  • 1+ year(s) experience developing and deploying live production systems, as part of a product team.
  • 1+ year(s) experience developing and deploying products or systems at multiple points in the product cycle from ideation to shipping.
Job Responsibility
Job Responsibility
  • Develop and refine data pipelines and infrastructure to support AI model development for Copilot.
  • Collaborate with research teams to integrate cutting-edge AI advancements into production systems.
  • Design, train, and evaluate machine learning models, ensuring performance optimization and scalability.
  • Work closely with engineering and product teams to ensure AI-driven experiences meet quality and user experience standards.
  • Conduct rigorous data analysis and experimentation, leveraging insights to improve Copilot’s intelligence.
  • Overcome obstacles to deliver iterative improvements in AI performance and responsiveness.
  • Stay ahead of the latest innovations in deep learning, reinforcement learning, and generative AI.
Read More
Arrow Right

Member of Technical Staff, Applied Scientist - Windows Copilot

The Windows Copilot team is at the forefront of redefining how AI enhances every...
Location
Location
United States , Redmond
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience (e.g., statistics predictive analytics, research) OR Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research) OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) related experience (e.g., statistics, predictive analytics, research) OR equivalent experience.
  • Proven track record of deploying machine learning models in large-scale production environments.
  • 9+ years of experience building data pipelines, training deep learning models, and optimizing AI workflows.
  • 4+ years of experience building data pipelines, training deep learning models, and optimizing AI workflows.
Job Responsibility
Job Responsibility
  • Develop and refine data pipelines and infrastructure to support AI model development for Copilot.
  • Collaborate with research teams to integrate cutting-edge AI advancements into production systems.
  • Design, train, and evaluate machine learning models, ensuring performance optimization and scalability.
  • Work closely with engineering and product teams to ensure AI-driven experiences meet quality and user experience standards.
  • Conduct rigorous data analysis and experimentation, leveraging insights to improve Copilot’s intelligence.
  • Overcome obstacles to deliver iterative improvements in AI performance and responsiveness.
  • Stay ahead of the latest innovations in deep learning, reinforcement learning, and generative AI.
  • Embody our Culture and Values.
  • Fulltime
Read More
Arrow Right

Distinguished Technologist, Cloud Development (AI/ML)

Joining our HPE Hybrid Cloud team and working as part of our OpsRamp team is a c...
Location
Location
United States , San Jose
Salary
Salary:
164500.00 - 398500.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years of relevant experience in the industry delivering technical and business strategy at an advanced/strategist level
  • Master's, or PhD degree in Computer Science, Information Systems, Engineering, or equivalent
  • At least 4 years of hands-on expertise in defining, building, training and / or optimizing foundational deep learning models at scale in PyTorch, HF and other ML frameworks and libraries
  • Experience and/or deep understanding in various deep learning architectures like CNNs, GNNs, Transformers, Reinforcement Learning etc. is a strong advantage
  • Strong hands-on experience/understanding in pre-training, fine-tuning, distilling, aligning open-source large language models and have them complement the in-house foundational models
  • Hands-on experience developing multi-agent applications around a mixture of in-house and open-source models while leveraging latest in RAG and Prompt Engineering tooling techniques
  • Strong customer focus and obsession with improving service availability/performance and user experience/consumption using measurable SRE metrics
  • Must have a track record of working alongside other engineering teams architecting, building, and deploying mission-critical, highly distributed, large-scale SaaS applications
  • Must have strong knowledge of application failure modes, resiliency patterns, and techniques to enable robust, self-healing architecture
  • Effective technical leadership skills to influence diverse groups to move toward common goals/strategies
Job Responsibility
Job Responsibility
  • Lead strategy and innovation across OpsRamp’s Intelligent Observability portfolio
  • Champion HPE OpsRamp’s position with HPE customers and GTM partners externally and HPE internal cross-functional stakeholders
  • Drive technical strategy for emerging GenAI trends across Hybrid Observability and AIOps for cloud-scale modern applications
  • Design and introduce new products to the market
  • Provide consultation, design input, and feedback for product development and design reviews
  • Transition proof-of-concept implementations into R&D teams to accelerate new product delivery
  • Guide and mentor less-experienced staff members.
What we offer
What we offer
  • Health and wellbeing benefits
  • Career development programs
  • Diversity, inclusion, and belonging initiatives.
  • Fulltime
Read More
Arrow Right

Member of technical staff - Research - Model

H exists to push the boundaries of superintelligence with agentic AI. By automat...
Location
Location
France; United Kingdom , Paris; London
Salary
Salary:
Not provided
hcompany.ai Logo
H Company
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong programming skills (Python, Git)
  • Expertise in deep learning frameworks (PyTorch, JAX, TensorFlow)
  • Experience with large-scale distributed training of LLMs and VLMs
  • Hands-on experience with LLM training, alignment, and reinforcement learning
  • Knowledge of multimodal architectures and applications
  • Publications in top-tier AI conferences (e.g., NeurIPS, ICML, CVPR, ACL, ICCV)
  • Advanced degree (PhD or MSc) in a relevant field (e.g., ML, DL, NLP, CV)
  • Excellent communication and presentation skills
  • Strong collaboration and teamwork skills
  • Passion for AI and problem-solving
Job Responsibility
Job Responsibility
  • Develop and train advanced LLMs and VLMs, including multimodal architectures
  • Research and implement training methods for enhanced capabilities like instruction following and tool use
  • Design and optimize data pipelines and training systems for large-scale distributed training
  • Collaborate with cross-functional teams to integrate models into agentic AI systems
  • Evaluate model performance and communicate findings to stakeholders
  • Stay current with advancements in LLMs, VLMs, and related fields
What we offer
What we offer
  • Join the exciting journey of shaping the future of AI, and be part of the early days of one of the hottest AI startups
  • Collaborate with a fun, dynamic and multicultural team, working alongside world-class AI talent in a highly collaborative environment
  • Enjoy a competitive salary
  • Unlock opportunities for professional growth, continuous learning, and career development
  • Fulltime
Read More
Arrow Right

Bim Coordinator

As part of the development of its MENA business, and overall global growth, our ...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
cundall.com Logo
Cundall
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in a relevant Mechanical or equivalent Building services Engineering with proven track record of successfully undertaken BIM responsibilities within MEPF design delivery team with a background in modeling, data management and knowledge of best practices in BIM
  • Proven ability and leadership skills to lead, monitor and support a team of BIM Technicians and BIM Modelers as well as supporting MEPF design team members already in use of the BIM authoring solutions
  • Minimum of 7 years broad experience in design and BIM management on large scale projects. (inclusive but not limited to BIM modeling, Coordination, collaboration, data management, Quality Assurance and Control) on large scale multi-disciplinary projects within the Middle East region
  • Mandatory proficiency on Autodesk Revit and AutoCAD Civil 3D BIM authoring tools
  • Mandatory proficiency on Autodesk Navisworks Manage
  • Mandatory proficiency on BIM360 Environment
  • Excellent English communication and presentation skills
  • Ability to effectively prioritize tasks and solve problems
  • High level of creativity and forward thinking with analytical mindset
  • High Sense of Quality driven deliverables
Job Responsibility
Job Responsibility
  • Coordinating the overall BIM Implementation within the Mechanical/Electrical/Public Health /ELV (MEPF) Discipline team and support the multidisciplinary projects delivery
  • Coordinating the assigned project BIM Engineers / CAD technicians including input to Quality Assurance and Quality Control of models and drawings
  • Assisting in the Implementation of Building Information Management (BIM) systems
  • Assisting in the creation of BIM deliverables, Model Federation, Clash Detection Reports, BIM Quality Control Plans, BIM Design Review Procedures, Construction Sequencing (4D) and Cost Estimating (5D), Asset Management BIM Implementation (COBie), e-Specs implementation
  • Attending meetings with client and third parties and assisting the BIM Leads in presentation and navigation throughout the BIM Models
  • Assisting and supporting in analysis/implementation of BIM execution plans for BIM deliverables production
  • Configuring and maintaining the project common data environment to enable the effective management of engineering design deliverables
  • Assist in planning project BIM deliverables and managing their effective delivery in accordance with the project plan and BIM execution plan
  • Coordinating BIM issues with design teams to develop and reinforce standards with the goal of refining the models and BIM deliverables
  • Assist in leading in BIM implementation, standards, modelling workflow & methodologies during project design lifecycle
Read More
Arrow Right
New

Waiter/Waitress

If you're passionate about fresh food, the highest service standards and thrive ...
Location
Location
United Kingdom , Reading
Salary
Salary:
12.71 GBP / Hour
brunningandprice.co.uk Logo
Brunning & Price
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • You have a natural ability to look after people and make them happy and like us, you’re passionate about food and drink. Previous experience in hospitality as a waitress/waiter, bartender, barista or food and beverage assistant ideally in a pub, restaurant, hotel or bar would be helpful but isn’t essential. More importantly, you’re hands on with a positive attitude. You’ll bring your life experience to help keep a busy shift calm and collected whilst ensuring you look after our customers in the best way possible.
Job Responsibility
Job Responsibility
  • Look after customers in the best way possible
  • keep a busy shift calm and collected
What we offer
What we offer
  • Basic up to £12.71 per hour, plus tronc (card tips paid into your bank)
  • Paid overtime
  • Great cash tips
  • Free meals on shift (choose from the menu)
  • 30% discount for you, your friends and family across B&P and group including wagamama
  • NEST pension
  • Discounts via Perks on Tap
  • £1,000 referral bonus for introducing new Managers or Chefs
  • Stream - flexible pay to choose when to get paid
  • Weekly pay
Read More
Arrow Right