CrawlJobs Logo

Post Training Algorithm Engineer

https://www.randstad.com Logo

Randstad

Location Icon

Location:
China , Shanghai

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

600000.00 - 1000000.00 CNY / Year

Job Responsibility:

  • Convert the current drug discovery data into data that can be used for prospective model testing (benchmarks) and and model training
  • Finetune and post-train open source models such as Qwen, LLaMA, DeepSeek and others for AI for science tasks
  • Integrate expert post-trained models into the current drug discovery workflows, interfaces and tools

Requirements:

  • Over 2-3 years experience in LLM post-training experience
  • Master degree or above in computer science, AI related major
  • Demonstrated experience in setting up the environment, sequence, and data for post-training of open source models
  • also experience in benchmarking the performance of the post-trained models
  • Increasing the efficiency of training to minimize training cost
  • Experience in post training in major LLM vendors is preferred

Nice to have:

Experience in post training in major LLM vendors is preferred

Additional Information:

Job Posted:
February 28, 2026

Expiration:
May 10, 2026

Employment Type:
Fulltime
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Post Training Algorithm Engineer

Research Engineer, Core ML

This is a research engineering role with direct production impact. You will tran...
Location
Location
United States , San Francisco
Salary
Salary:
200000.00 - 280000.00 USD / Year
together.ai Logo
Together AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of experience working on ML systems, large‑scale model training, inference, or adjacent areas (or equivalent experience via research / open source)
  • Advanced degree in Computer Science, EE, or a related field, or equivalent practical experience
  • Demonstrated experience owning complex technical projects end‑to‑end
  • Strong expertise in at least one of the following: Large‑scale inference systems (e.g., SGLang, vLLM, FasterTransformer, TensorRT, custom engines, or similar), GPU performance, distributed serving
  • RL / post‑training for LLMs or large models (e.g., GRPO, RLHF/RLAIF, DPO‑like methods, reward modeling)
  • Model architecture design for Transformers or other large neural nets
  • Distributed systems / high‑performance computing for ML
  • Strong coding ability in Python
  • Experience profiling and optimizing performance across GPU, networking, and memory layers
  • Track record of impactful work in ML systems, RL, or large‑scale model training (papers, open‑source projects, or production systems)
Job Responsibility
Job Responsibility
  • Advance inference efficiency end‑to‑end
  • Design and prototype algorithms, architectures, and scheduling strategies for low‑latency, high‑throughput inference
  • Implement and maintain changes in high‑performance inference engines
  • Profile and optimize performance across GPU, networking, and memory layers
  • Unify inference with RL / post‑training
  • Design and operate RL and post‑training pipelines
  • Make RL and post‑training workloads more efficient with inference‑aware training loops
  • Co‑design algorithms and infrastructure
  • Run ablations and scale‑up experiments to understand trade‑offs
  • Own critical systems at production scale
What we offer
What we offer
  • Startup equity
  • Health insurance
  • Competitive benefits
  • Fulltime
Read More
Arrow Right

Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI

The Enterprise ML Research Lab works on the front lines of this AI revolution. W...
Location
Location
United States , San Francisco; New York
Salary
Salary:
218400.00 - 273000.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 1-3 years of LLM training in a production environment
  • Passionate about system optimization
  • Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc.
  • Ability to demonstrate know-how on how to operate the architecture of the modern GPU cluster
  • Experience with multi-node LLM training and inference
  • Strong software engineering skills, proficient in frameworks and tools such as CUDA, Pytorch, transformers, flash attention, etc.
  • Strong written and verbal communication skills to operate in a cross functional team environment
  • PhD or Masters in Computer Science or a related field
Job Responsibility
Job Responsibility
  • Build, profile and optimize our training and inference framework
  • Post-train state of the art models, developed both internally and from the community, to define stable post-training recipes for our enterprise engagements
  • Collaborate with ML teams to accelerate their research and development, and enable them to develop the next generation of models and data curation
  • Create a next-gen agent training algorithm for multi-agent/multi-tool rollouts
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • additional benefits such as a commuter stipend
  • equity based compensation
  • Fulltime
Read More
Arrow Right

Staff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAI

Build out our next-gen Agent RL training platform; build out the platform that w...
Location
Location
United States , San Francisco; New York
Salary
Salary:
218400.00 - 273000.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of LLM training in a production environment
  • Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc.
  • Publications in top conferences such as NEURIPS, ICLR, or ICML within the last two years
  • PhD or Masters in Computer Science or a related field
Job Responsibility
Job Responsibility
  • Train state of the art models, developed both internally and from the community, to deploy to our enterprise customers
  • Research cutting edge algorithms to integrate directly into our training stack
  • Design solutions that enable complex multi-agent systems to directly learn from both process + outcome based rewards
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • commuter stipend
  • equity based compensation
  • Fulltime
Read More
Arrow Right

Research Scientist - Generative AI

As a Research Scientist in the Emergent Machine Intelligence Team at Hewlett Pac...
Location
Location
United States , Santa Barbara
Salary
Salary:
101900.00 - 234500.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in Computer Science, Artificial Intelligence, Machine Learning, Physics, Mathematics, or other related fields
  • 3-5 years working experience with training and fine-tuning generative AI models including LLMs, diffusion models, or Energy-Based Models
  • Proven track record of research in generative models, demonstrated through publications, patents, or publicly available projects
  • Proficiency in programming languages commonly used in AI research, such as Python, and experience with AI/ML frameworks (e.g., TensorFlow, PyTorch)
  • Deep understanding of machine learning algorithms and principles, especially in the context of generative AI
  • Strong mathematical background, with excellent skills in areas such as statistics, probability, linear algebra
  • Creative and analytical thinking abilities, with a passion for solving complex problems
  • Excellent communication skills, capable of conveying complex ideas clearly and engaging with both technical and non-technical audiences.
Job Responsibility
Job Responsibility
  • Conduct high-quality research in generative AI, including but not limited to designing algorithms for pre-training and post-training current autoregressive and diffusion models for multimodal data
  • Design, implement, and validate new algorithms and models for augmented LLMs, pushing the boundaries of AI capabilities
  • Developing and prototyping novel algorithms for fine-tuning, retrieval augmented generation, and in-context learning for various generative models
  • Developing algorithms for training and inference in Energy-Based Models
  • Collaborate with cross-functional teams to apply research findings to develop new products or enhance existing ones
  • Publish research papers in top-tier journals and conferences, sharing findings with the broader scientific community
  • Stay abreast of the latest AI research and trends, identifying opportunities for innovation and improvement
  • Mentor junior researchers and engineers, fostering a culture of knowledge sharing and collaboration
  • Develop prototypes and proof-of-concept implementations to demonstrate the potential of research findings
  • Engage with the academic community by attending conferences, workshops, and seminars.
What we offer
What we offer
  • A competitive salary and extensive social benefits
  • Diverse and dynamic work environment
  • Work-life balance and support for career development.
  • Fulltime
Read More
Arrow Right

Research Scientist - Generative AI

This role involves conducting high-quality research in generative AI, designing ...
Location
Location
United States
Salary
Salary:
101900.00 - 234500.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in Computer Science, Artificial Intelligence, Machine Learning, Physics, Mathematics, or other related fields
  • 3-5 years working experience with training and fine-tuning generative AI models including LLMs, diffusion models, or Energy-Based Models
  • Proven track record of research in generative models, demonstrated through publications, patents, or publicly available projects
  • Proficiency in programming languages commonly used in AI research, such as Python, and experience with AI/ML frameworks (e.g., TensorFlow, PyTorch)
  • Deep understanding of machine learning algorithms and principles, especially in the context of generative AI
  • Strong mathematical background, with excellent skills in areas such as statistics, probability, linear algebra
  • Creative and analytical thinking abilities, with a passion for solving complex problems
  • Excellent communication skills, capable of conveying complex ideas clearly and engaging with both technical and non-technical audiences
Job Responsibility
Job Responsibility
  • Conduct high-quality research in generative AI, including but not limited to designing algorithms for pre-training and post-training current autoregressive and diffusion models for multimodal data
  • Design, implement, and validate new algorithms and models for augmented LLMs, pushing the boundaries of AI capabilities
  • Developing and prototyping novel algorithms for fine-turning, retrieval augmented generation, and in-context learning for various generative models
  • Developing algorithms for training and inference in Energy-Based Models
  • Collaborate with cross-functional teams to apply research findings to develop new products or enhance existing ones
  • Publish research papers in top-tier journals and conferences, sharing findings with the broader scientific community
  • Stay abreast of the latest AI research and trends, identifying opportunities for innovation and improvement
  • Mentor junior researchers and engineers, fostering a culture of knowledge sharing and collaboration
  • Develop prototypes and proof-of-concept implementations to demonstrate the potential of research findings
  • Engage with the academic community by attending conferences, workshops, and seminars
What we offer
What we offer
  • A competitive salary and extensive social benefits
  • Diverse and dynamic work environment
  • Work-life balance and support for career development
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Research Engineer

As a Research Engineer at Mercor, you’ll work at the intersection of engineering...
Location
Location
United States , San Francisco
Salary
Salary:
130000.00 - 500000.00 USD / Year
mercor.com Logo
Mercor
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong applied research background, with a focus on post-training and/or model evaluation
  • Strong coding proficiency and hands-on experience working with machine learning models
  • Strong understanding of data structures, algorithms, backend systems, and core engineering fundamentals
  • Familiarity with APIs, SQL/NoSQL databases, and cloud platforms
  • Ability to reason deeply about model behavior, experimental results, and data quality
  • Excitement to work in person in San Francisco, five days a week (with optional remote Saturdays), and thrive in a high-intensity, high-ownership environment
Job Responsibility
Job Responsibility
  • Work on post-training and RLVR pipelines to understand how datasets, rewards, and training strategies impact model performance
  • Design and run reward-shaping experiments and algorithmic improvements (e.g., GRPO, DAPO) to improve LLM tool-use, agentic behavior, and real-world reasoning
  • Quantify data usability, quality, and performance uplift on key benchmarks
  • Build and maintain data generation and augmentation pipelines that scale with training needs
  • Create and refine rubrics, evaluators, and scoring frameworks that guide training and evaluation decisions
  • Build and operate LLM evaluation systems, benchmarks, and metrics at scale
  • Collaborate closely with AI researchers, applied AI teams, and experts producing training data
  • Operate in a fast-paced, experimental research environment with rapid iteration cycles and high ownership
What we offer
What we offer
  • Generous equity grant vested over 4 years
  • A $20K relocation bonus (if moving to the Bay Area)
  • A $10K housing bonus (if you live within 0.5 miles of our office)
  • A $1K monthly stipend for meals
  • Free Equinox membership
  • Health insurance
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - Multimodal

Join Microsoft AI in building one of the world’s most advanced foundation models...
Location
Location
United States , Mountain View
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • Proven expertise, demonstrated through impactful publications or technical leadership on high-scale projects
  • Strong analytical skills, attention to detail, and a data-driven approach to decision-making
  • Experience with large-scale distributed systems and scalable architectures
  • Ability to thrive in fast-paced, collaborative environments and embrace innovation
Job Responsibility
Job Responsibility
  • Develop algorithms, design model architectures, conduct experiments, champion measurement and evaluation, innovate datasets and data pipelines
  • Improve training and deployment efficiency, paying careful attention to detail, persevering, and learning from everyone’s attempts whether successful or not
  • Follow a rigorous data-driven approach grounded in meticulous ablation studies and scientific analysis
  • Innovate and iterate over ideas, prototypes, and product
  • Collaborate closely with teams on infrastructure, data engineering, pre-training, post-training, and product feedback
  • Advance the AI frontier responsibly
  • Embody our culture and values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, AI Multimodal - MAI Superintelligence Team

At Microsoft AI, we are on a mission to train the world’s most capable AI fronti...
Location
Location
Switzerland , Zürich
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modelling or data engineering work
  • OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, or data engineering work
  • OR equivalent experience
  • Expertise in multimodal Research with a strong publishing track record
  • Proven expertise in areas of interest, evidenced by an exceptional publication track record and/or significant technical leadership in high-impact projects
  • Strong analytical skills, attention to detail, and a commitment to data-driven decision-making
  • Experience and/or in-depth understandings about large-scale distributed systems
  • Ability to work collaboratively in a fast-paced, innovative environment
Job Responsibility
Job Responsibility
  • Develop algorithms, design model architectures, conduct experiments, champion measurement and evaluation, innovate datasets and data pipelines
  • Improve training and deployment efficiency, paying careful attention to detail, persevering, and learning from everyone’s attempts whether successful or not
  • Follow a rigorous data-driven approach grounded in meticulous ablation studies and scientific analysis
  • Innovate and iterate over ideas, prototypes, and product
  • Collaborate closely with teams on infrastructure, data engineering, pre-training, post-training, and product feedback
  • Advance the AI frontier responsibly
  • Embody our culture and values
  • Fulltime
Read More
Arrow Right