CrawlJobs Logo

Member of Technical Staff - Post Training, Reinforcement Learning

liquid.ai Logo

Liquid AI

Location Icon

Location:
United States , San Francisco

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

At Liquid, we’re not just building AI models—we’re redefining the architecture of intelligence itself. Spun out of MIT, our mission is to build efficient AI systems at every scale. Our Liquid Foundation Models (LFMs) operate where others can’t: on-device, at the edge, under real-time constraints. We’re not iterating on old ideas—we’re architecting what comes next. We believe great talent powers great technology. The Liquid team is a community of world-class engineers, researchers, and builders creating the next generation of AI. Whether you're helping shape model architectures, scaling our dev platforms, or enabling enterprise deployments—your work will directly shape the frontier of intelligent systems.

Job Responsibility:

  • Profile, optimize, and scale RL training runs to reduce iteration time
  • Integrate new optimization techniques as they emerge from the research community
  • Design and implement tools and environments that test the boundaries of model capabilities
  • Turn proof-of-concept ideas into robust training pipelines and best-in-class models

Requirements:

  • Strong Python and PyTorch proficiency, with hands-on experience optimizing training pipelines
  • Hands-on experience with reinforcement learning and the ability to translate optimization techniques from theory into practical implementations
  • Track record of integrating research ideas into robust, maintainable code
  • Experience with frameworks like DeepSpeed, FSDP, or vLLM for efficient model training and inference
  • Experience working with data pipelines, including curation, validation, and analysis to support post-training objectives
  • Contributions to open-source machine learning projects
  • M.S. or Ph.D. in Computer Science, Electrical Engineering, Mathematics, or a related field
What we offer:
  • The opportunity to work directly on state-of-the-art AI systems at one of the most advanced AI companies in the world
  • A fast-paced, collaborative environment where your work has direct impact on model performance and product capability
  • The satisfaction of knowing your craftsmanship helps define the next frontier in AI

Additional Information:

Job Posted:
February 21, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Member of Technical Staff - Post Training, Reinforcement Learning

Member of Technical Staff - Post Training, Applied

This is a rare chance to sit at the intersection of frontier foundation models a...
Location
Location
United States , San Francisco; Boston
Salary
Salary:
Not provided
liquid.ai Logo
Liquid AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Hands-on experience with data generation and evaluation for LLM post-training
  • Experience training or fine-tuning models using SFT, preference alignment, and/or RL
  • Strong intuition for data quality and evaluation design
  • Familiarity with alignment or RL techniques beyond basic supervised fine-tuning
Job Responsibility
Job Responsibility
  • Act as the technical owner for enterprise customer post-training engagements
  • Translate customer requirements into concrete post-training specifications and workflows
  • Design and execute data generation, filtering, and quality assessment processes
  • Run supervised fine-tuning, preference alignment, and reinforcement learning workflows
  • Design task-specific evaluations, interpret results, and feed learnings back into core post-training pipelines
What we offer
What we offer
  • Competitive base salary with equity in a unicorn-stage company
  • We pay 100% of medical, dental, and vision premiums for employees and dependents
  • 401(k) matching up to 4% of base pay
  • Unlimited PTO plus company-wide Refill Days throughout the year
  • Fulltime
Read More
Arrow Right
New

Member of Technical Staff

The Microsoft AI Superintelligence (MAIST) Post Training team is dedicated to ad...
Location
Location
United States , Redmond
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Doctorate OR equivalent experience
  • Significant experience in large-scale model training, data curation, and hands-on coding, ideally from leading research labs
  • Deep expertise in pre-training, post-training, and reinforcement learning (RL) for both language and multimodal models
  • Ability to develop LLMs, SLMs, multimodal, and coding models using both proprietary and open-source frameworks
  • Self-driven, able to write efficient code and debug training jobs, document findings, and demonstrate a track record in these fields
  • Curious, adaptable problem-solver who thrives on continuous learning, embraces changing priorities, and is motivated by creating meaningful impact
Job Responsibility
Job Responsibility
  • Design & Evaluate Datasets – Build high-quality datasets and benchmarks for training AI models
  • run ablation studies to measure impact and optimize data effectiveness
  • Advance Model Training – Apply deep expertise in pre-training, post-training, and reinforcement learning (RL) for both language and multimodal models
  • Develop Data Infrastructure – Create and maintain scalable pipelines for ingestion, preprocessing, filtering, and annotation of large, complex datasets
  • Data Quality & Analysis – Assess real-world multimodal datasets (text, image, video, audio, code) for quality, diversity, and relevance
  • identify gaps and propose improvements
  • Tooling & Workflows – Build lightweight tools for dataset auditing, visualization, and versioning to streamline experimentation
  • Research & Innovation – Collaborate with cross-functional teams to push research and product boundaries, delivering models that make a real-world impact
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right
New

Member of Technical Staff - Post-Training

This Microsoft AI Superintelligence Post-Training team is dedicated to advancing...
Location
Location
United States , Redmond
Salary
Salary:
84200.00 - 199000.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree (complete or in progress) in relevant field AND 3+ months related research internship experience OR Master's Degree in relevant field OR equivalent experience
  • Software engineering skills with fluency in Python and modern data libraries
  • The ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Design & Evaluate Datasets – Build high-quality datasets and benchmarks for training AI models
  • run ablation studies to measure impact and optimize data effectiveness
  • Advance Model Training – Apply deep expertise in pre-training, post-training, and reinforcement learning (RL) for both language and multimodal models
  • Develop Data Infrastructure – Create and maintain scalable pipelines for ingestion, preprocessing, filtering, and annotation of large, complex datasets
  • Data Quality & Analysis – Assess real-world multimodal datasets (text, image, video, audio, code) for quality, diversity, and relevance
  • identify gaps and propose improvements
  • Tooling & Workflows – Build lightweight tools for dataset auditing, visualization, and versioning to streamline experimentation
  • Research & Innovation – Collaborate with cross-functional teams to push research and product boundaries, delivering models that make a real-world impact
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right
New

Member of Technical Staff - Post-Training

This Microsoft AI Superintelligence Post-Training team is dedicated to advancing...
Location
Location
United States , Redmond
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Doctorate in relevant field OR equivalent experience
  • Software engineering skills with fluency in Python and modern data libraries
  • The ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Design & Evaluate Datasets – Build high-quality datasets and benchmarks for training AI models
  • run ablation studies to measure impact and optimize data effectiveness
  • Advance Model Training – Apply deep expertise in pre-training, post-training, and reinforcement learning (RL) for both language and multimodal models
  • Develop Data Infrastructure – Create and maintain scalable pipelines for ingestion, preprocessing, filtering, and annotation of large, complex datasets
  • Data Quality & Analysis – Assess real-world multimodal datasets (text, image, video, audio, code) for quality, diversity, and relevance
  • identify gaps and propose improvements
  • Tooling & Workflows – Build lightweight tools for dataset auditing, visualization, and versioning to streamline experimentation
  • Research & Innovation – Collaborate with cross-functional teams to push research and product boundaries, delivering models that make a real-world impact
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff

The Microsoft AI Superintelligence (MAIST) Post Training team is dedicated to ad...
Location
Location
United States , Redmond
Salary
Salary:
100600.00 - 199000.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master's Degree in relevant field AND 1+ year(s) related research experience OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements
  • Microsoft Cloud Background Check
Job Responsibility
Job Responsibility
  • Design & Evaluate Datasets – Build high-quality datasets and benchmarks for training AI models
  • run ablation studies to measure impact and optimize data effectiveness
  • Advance Model Training – Apply deep expertise in pre-training, post-training, and reinforcement learning (RL) for both language and multimodal models
  • Develop Data Infrastructure – Create and maintain scalable pipelines for ingestion, preprocessing, filtering, and annotation of large, complex datasets
  • Data Quality & Analysis – Assess real-world multimodal datasets (text, image, video, audio, code) for quality, diversity, and relevance
  • identify gaps and propose improvements
  • Tooling & Workflows – Build lightweight tools for dataset auditing, visualization, and versioning to streamline experimentation
  • Research & Innovation – Collaborate with cross-functional teams to push research and product boundaries, delivering models that make a real-world impact
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Multimodal Infrastructure

Microsoft AI is looking for a Member of Technical Staff, Multimodal Infrastructu...
Location
Location
United States , Mountain View
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, or related technical discipline AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Experience in multi-modal data processing: Strong proficiency in distributed data processing infra (resource utilization management, fault tolerance, ray & spark) and CPU/GPU batch processing optimizations
  • Experience with state-of-art model inference and serving frameworks
  • Experience with image/video/audio data processing
  • Experience with common data formats for efficient I/O
  • Experience in multi-modal pretraining and post-training: Strong proficiency in deep learning frameworks such as PyTorch, Megatron and Deepspeed
  • Knowledge of auto-regressive and diffusion transformer models
  • Experience with distributed training techniques such as data parallelism, model parallelism, and pipeline parallelism
  • Proven experiences in at least one of the following areas: image/video generation and editing
  • efficient architectures (e.g., MoE, window attention)
Job Responsibility
Job Responsibility
  • Design, develop and maintain large-scale multimodal data processing pipelines
  • Design, develop and maintain large-scale multimodal model pretraining and post-training frameworks
  • Design, develop and maintain large-scale multimodal model inference and serving frameworks
  • Work with research scientists and product engineers to solve infra-related problems
  • Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
  • Enjoy working in a fast-paced, design-driven, product development cycle
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right
New

Lead Reliability Engineer

This is a key leadership role, responsible for driving reliability, asset perfor...
Location
Location
United Kingdom , Hereford
Salary
Salary:
41000.00 - 44000.00 GBP / Year
avarafoods.co.uk Logo
Avara Foods
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • HND or above in Engineering (Mechanical, Electrical, or related discipline)
  • Proven experience in reliability, maintenance, or engineering leadership in an FMCG or manufacturing environment
  • Strong understanding of maintenance systems (CMMS), asset management, and performance metrics (OEE, MTBF, MTTR)
  • Demonstrable leadership, coaching, and influencing skills
  • Excellent analytical, problem-solving, and communication abilities
  • Ability to manage multiple priorities and work effectively across teams
Job Responsibility
Job Responsibility
  • Lead and manage the Reliability Team, ensuring effective delivery of asset performance, maintenance planning, and reliability projects
  • Act as lead for reliability and asset care, championing continuous improvement across site
  • Develop and sustain proactive maintenance strategies, including predictive and condition-based maintenance, to improve equipment availability and reduce unplanned downtime
  • Analyse performance and downtime data to identify and eliminate root causes of equipment failure
  • Collaborate with the wider Engineer Team to coordinate planned maintenance, improvement activities, and engineering support during production
  • Support the Engineering Reliability Manager in the development and execution of the site’s maintenance and reliability roadmap
  • Maintain the office, reliability- and outside areas to high standard, ensuring regular checks are conducted and satisfactory feedback is received from GMP/WPW audits
  • Lead cross-functional reliability reviews, ensuring effective communication between Engineering, Operations, Planning, and technical teams
  • Manage contractor and OEM support, ensuring all work complies with site safety, technical, and legislative standards
  • Ensure all rectification actions identified on service reports are followed up and completed in a timely manner
What we offer
What we offer
  • 6% Pension
  • 31 Days Holiday
  • Life Assurance
  • Private Medical Health Cover
  • Subsidised Canteen
  • Free Staff Parking
  • Wellbeing and lifestyle benefits, including discounts with major retailers and access to health resources
  • Fulltime
Read More
Arrow Right
New

Accountant

We are looking for a skilled Accountant with extensive experience in managing fi...
Location
Location
United States , Honolulu
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7 to 10 years of accounting experience, with a strong background in financial management
  • Proficiency in General Ledger systems and Great Plains Accounting software
  • Advanced skills in Excel for data analysis and reporting
  • Hands-on experience with payroll systems, including Kronos
  • Demonstrated ability to perform month-end close procedures
  • Solid understanding of accrual accounting principles and practices
  • Capability to work independently and manage multiple financial entities
  • Strong attention to detail and ability to meet deadlines effectively
Job Responsibility
Job Responsibility
  • Reconcile and update financial records for multiple entities to address existing backlogs
  • Perform month-end close processes to ensure financial accuracy and compliance
  • Manage accrual accounting and ensure proper documentation of expenses and revenues
  • Oversee payroll operations, including processing and reviewing for accuracy
  • Utilize Microsoft Great Plains Accounting and other general ledger systems to maintain financial data
  • Collaborate with team members to resolve discrepancies and streamline workflows
  • Prepare detailed financial reports and summaries for internal review
  • Implement effective processes to maintain consistency in financial operations
  • Ensure compliance with all relevant accounting standards and regulations
  • Provide expertise and guidance with minimal need for training or supervision
What we offer
What we offer
  • medical, vision, dental, life and disability insurance
  • 401(k) or deferred compensation plan (if eligible)
  • paid time off for vacation, personal needs, and sick time
  • paid holidays
  • Choice Time Off (CTO)
  • free online training
  • Fulltime
Read More
Arrow Right