CrawlJobs Logo

Member of Technical Staff - Post Training, Reinforcement Learning

United States, San Francisco · Job Posted February 21, 2026
Apply Position
Job Link Share

Job Description

At Liquid, we’re not just building AI models—we’re redefining the architecture of intelligence itself. Spun out of MIT, our mission is to build efficient AI systems at every scale. Our Liquid Foundation Models (LFMs) operate where others can’t: on-device, at the edge, under real-time constraints. We’re not iterating on old ideas—we’re architecting what comes next. We believe great talent powers great technology. The Liquid team is a community of world-class engineers, researchers, and builders creating the next generation of AI. Whether you're helping shape model architectures, scaling our dev platforms, or enabling enterprise deployments—your work will directly shape the frontier of intelligent systems.

Job Responsibility

  • Profile, optimize, and scale RL training runs to reduce iteration time
  • Integrate new optimization techniques as they emerge from the research community
  • Design and implement tools and environments that test the boundaries of model capabilities
  • Turn proof-of-concept ideas into robust training pipelines and best-in-class models

Requirements

  • Strong Python and PyTorch proficiency, with hands-on experience optimizing training pipelines
  • Hands-on experience with reinforcement learning and the ability to translate optimization techniques from theory into practical implementations
  • Track record of integrating research ideas into robust, maintainable code
  • Experience with frameworks like DeepSpeed, FSDP, or vLLM for efficient model training and inference
  • Experience working with data pipelines, including curation, validation, and analysis to support post-training objectives
  • Contributions to open-source machine learning projects
  • M.S. or Ph.D. in Computer Science, Electrical Engineering, Mathematics, or a related field

What we offer

  • The opportunity to work directly on state-of-the-art AI systems at one of the most advanced AI companies in the world
  • A fast-paced, collaborative environment where your work has direct impact on model performance and product capability
  • The satisfaction of knowing your craftsmanship helps define the next frontier in AI

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Member of Technical Staff - Post Training, Reinforcement Learning

8 matching positions

Member of Technical Staff - Post Training, Applied

This is a rare chance to sit at the intersection of frontier foundation models a...
Location
Location
United States , San Francisco; Boston
Salary
Salary:
Not provided
liquid.ai Logo
Liquid AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Hands-on experience with data generation and evaluation for LLM post-training
  • Experience training or fine-tuning models using SFT, preference alignment, and/or RL
  • Strong intuition for data quality and evaluation design
  • Familiarity with alignment or RL techniques beyond basic supervised fine-tuning
Job Responsibility
Job Responsibility
  • Act as the technical owner for enterprise customer post-training engagements
  • Translate customer requirements into concrete post-training specifications and workflows
  • Design and execute data generation, filtering, and quality assessment processes
  • Run supervised fine-tuning, preference alignment, and reinforcement learning workflows
  • Design task-specific evaluations, interpret results, and feed learnings back into core post-training pipelines
What we offer
What we offer
  • Competitive base salary with equity in a unicorn-stage company
  • We pay 100% of medical, dental, and vision premiums for employees and dependents
  • 401(k) matching up to 4% of base pay
  • Unlimited PTO plus company-wide Refill Days throughout the year
  • Fulltime
Read More
Arrow Right

Member of Technical Staff

The Microsoft AI Superintelligence (MAIST) Post Training team is dedicated to ad...
Location
Location
United States , Redmond
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Doctorate OR equivalent experience
  • Significant experience in large-scale model training, data curation, and hands-on coding, ideally from leading research labs
  • Deep expertise in pre-training, post-training, and reinforcement learning (RL) for both language and multimodal models
  • Ability to develop LLMs, SLMs, multimodal, and coding models using both proprietary and open-source frameworks
  • Self-driven, able to write efficient code and debug training jobs, document findings, and demonstrate a track record in these fields
  • Curious, adaptable problem-solver who thrives on continuous learning, embraces changing priorities, and is motivated by creating meaningful impact
Job Responsibility
Job Responsibility
  • Design & Evaluate Datasets – Build high-quality datasets and benchmarks for training AI models
  • run ablation studies to measure impact and optimize data effectiveness
  • Advance Model Training – Apply deep expertise in pre-training, post-training, and reinforcement learning (RL) for both language and multimodal models
  • Develop Data Infrastructure – Create and maintain scalable pipelines for ingestion, preprocessing, filtering, and annotation of large, complex datasets
  • Data Quality & Analysis – Assess real-world multimodal datasets (text, image, video, audio, code) for quality, diversity, and relevance
  • identify gaps and propose improvements
  • Tooling & Workflows – Build lightweight tools for dataset auditing, visualization, and versioning to streamline experimentation
  • Research & Innovation – Collaborate with cross-functional teams to push research and product boundaries, delivering models that make a real-world impact
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - Post-Training

This Microsoft AI Superintelligence Post-Training team is dedicated to advancing...
Location
Location
United States , Redmond
Salary
Salary:
84200.00 - 199000.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree (complete or in progress) in relevant field AND 3+ months related research internship experience OR Master's Degree in relevant field OR equivalent experience
  • Software engineering skills with fluency in Python and modern data libraries
  • The ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Design & Evaluate Datasets – Build high-quality datasets and benchmarks for training AI models
  • run ablation studies to measure impact and optimize data effectiveness
  • Advance Model Training – Apply deep expertise in pre-training, post-training, and reinforcement learning (RL) for both language and multimodal models
  • Develop Data Infrastructure – Create and maintain scalable pipelines for ingestion, preprocessing, filtering, and annotation of large, complex datasets
  • Data Quality & Analysis – Assess real-world multimodal datasets (text, image, video, audio, code) for quality, diversity, and relevance
  • identify gaps and propose improvements
  • Tooling & Workflows – Build lightweight tools for dataset auditing, visualization, and versioning to streamline experimentation
  • Research & Innovation – Collaborate with cross-functional teams to push research and product boundaries, delivering models that make a real-world impact
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - Post-Training

This Microsoft AI Superintelligence Post-Training team is dedicated to advancing...
Location
Location
United States , Redmond
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Doctorate in relevant field OR equivalent experience
  • Software engineering skills with fluency in Python and modern data libraries
  • The ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Design & Evaluate Datasets – Build high-quality datasets and benchmarks for training AI models
  • run ablation studies to measure impact and optimize data effectiveness
  • Advance Model Training – Apply deep expertise in pre-training, post-training, and reinforcement learning (RL) for both language and multimodal models
  • Develop Data Infrastructure – Create and maintain scalable pipelines for ingestion, preprocessing, filtering, and annotation of large, complex datasets
  • Data Quality & Analysis – Assess real-world multimodal datasets (text, image, video, audio, code) for quality, diversity, and relevance
  • identify gaps and propose improvements
  • Tooling & Workflows – Build lightweight tools for dataset auditing, visualization, and versioning to streamline experimentation
  • Research & Innovation – Collaborate with cross-functional teams to push research and product boundaries, delivering models that make a real-world impact
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff

The Microsoft AI Superintelligence (MAIST) Post Training team is dedicated to ad...
Location
Location
United States , Redmond
Salary
Salary:
100600.00 - 199000.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master's Degree in relevant field AND 1+ year(s) related research experience OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements
  • Microsoft Cloud Background Check
Job Responsibility
Job Responsibility
  • Design & Evaluate Datasets – Build high-quality datasets and benchmarks for training AI models
  • run ablation studies to measure impact and optimize data effectiveness
  • Advance Model Training – Apply deep expertise in pre-training, post-training, and reinforcement learning (RL) for both language and multimodal models
  • Develop Data Infrastructure – Create and maintain scalable pipelines for ingestion, preprocessing, filtering, and annotation of large, complex datasets
  • Data Quality & Analysis – Assess real-world multimodal datasets (text, image, video, audio, code) for quality, diversity, and relevance
  • identify gaps and propose improvements
  • Tooling & Workflows – Build lightweight tools for dataset auditing, visualization, and versioning to streamline experimentation
  • Research & Innovation – Collaborate with cross-functional teams to push research and product boundaries, delivering models that make a real-world impact
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Multimodal Infrastructure

Microsoft AI is looking for a Member of Technical Staff, Multimodal Infrastructu...
Location
Location
United States , Mountain View
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, or related technical discipline AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Experience in multi-modal data processing: Strong proficiency in distributed data processing infra (resource utilization management, fault tolerance, ray & spark) and CPU/GPU batch processing optimizations
  • Experience with state-of-art model inference and serving frameworks
  • Experience with image/video/audio data processing
  • Experience with common data formats for efficient I/O
  • Experience in multi-modal pretraining and post-training: Strong proficiency in deep learning frameworks such as PyTorch, Megatron and Deepspeed
  • Knowledge of auto-regressive and diffusion transformer models
  • Experience with distributed training techniques such as data parallelism, model parallelism, and pipeline parallelism
  • Proven experiences in at least one of the following areas: image/video generation and editing
  • efficient architectures (e.g., MoE, window attention)
Job Responsibility
Job Responsibility
  • Design, develop and maintain large-scale multimodal data processing pipelines
  • Design, develop and maintain large-scale multimodal model pretraining and post-training frameworks
  • Design, develop and maintain large-scale multimodal model inference and serving frameworks
  • Work with research scientists and product engineers to solve infra-related problems
  • Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
  • Enjoy working in a fast-paced, design-driven, product development cycle
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right
New

Resident Technical Officer (M&E)

This is an excellent opportunity, for an experienced M&E RTO to represent Cundal...
Location
Location
Singapore , Singapore
Salary
Salary:
Not provided
cundall.com Logo
Cundall
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Educated to diploma or degree level in Mechanical/Electrical Engineering or Building Services Engineering
  • Significant experience of both contractor and consultancy liaison
  • Practical experience of site based mechanical engineering and issue management
  • Experience in carrying out work inspection, making detailed site records and producing reports
  • Capable of producing and interpreting technical design solutions, specifications and safety documentation
  • Solid understanding of building regulations and codes of practice
  • Excellent time management skills
  • Ability to solve complex problems and meet project deadlines
  • Must hold a valid BCA Registered Accredited Resident Technical Officer certification
  • Excellent communication skills
Job Responsibility
Job Responsibility
  • Represent Cundall on one of our prestigious projects
  • Responsible for the maintenance and control of daily site records, report preparation, drawings and documents
  • Ensure that all site instructions are carried out accordingly
  • Issue further instructions and clarifications on design details when necessary
  • Manage the day to day inspections of construction works
  • Ensure all M&E construction activities comply with the consultant's specifications, local codes and regulatory requirements
  • Supervise and enforce site quality control
  • Coordinate with the client/main contractor/sub-contractor
  • Attend site meetings as required
  • Fulltime
Read More
Arrow Right
New

Assistant Retail Store Manager | Jacqui E

Jacqui E at Brisbane DFO is looking for an Assistant Store Manager to support th...
Location
Location
Australia , Brisbane
Salary
Salary:
Not provided
justgroup.com.au Logo
Just Group
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Previous leadership experience in a retail environment and familiarity with KPIs and driving sales
  • Confidence in providing feedback to team members
  • A strong passion for delivering exceptional customer service
  • Knowledge of loss prevention and health & safety practices
  • Visual Merchandising experience is a plus
Job Responsibility
Job Responsibility
  • Support the Store Manager in achieving sales targets and delivering personalised customer experiences
  • Coach and develop the team, providing feedback to enhance performance
  • Manage daily operations, including sales briefing, stock control, visual merchandising, and store presentation
What we offer
What we offer
  • Hourly rate with penalty rates for evening, weekends and public holidays shifts
  • Up to 70% off Jacqui E products
  • KPI and sales incentives
  • Exclusive perks via the Just Us Portal, such as gym membership discounts
  • Opportunity to take part in internal development workshops and programs to further your career in retail
  • A structured 3-month Assistant Store Manager training plan
  • Access to leadership and recruitment workshops for career development
  • Flexible rosters to support a healthy work/life balance
  • Employee Assistance Program for wellbeing and mental health support
  • Fulltime
Read More
Arrow Right