CrawlJobs Logo

Research Scientist - Large Language Model

lumalabs.ai Logo

Luma AI

Location Icon

Location:
United States , Palo Alto

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

250000.00 - 450000.00 USD / Year

Job Description:

This is a rare opportunity to help define the future of large-scale language models. You will work across the entire lifecycle of model development — from large-scale pre-training, to targeted mid-training, to post-training alignment and capability refinement. You will operate at the frontier of scaling laws, reasoning, and alignment, directly shaping how foundation models learn, generalize, and behave in real-world deployments.

Job Responsibility:

  • Architect and scale large autoregressive language models
  • Design improved pre-training objectives to enhance reasoning, knowledge retention, and compositional generalization
  • Develop mid-training strategies such as continued pre-training, domain adaptation, curriculum learning, and synthetic data integration
  • Advance post-training techniques, including instruction tuning, preference optimization, reinforcement learning, distillation, and inference-time compute scaling
  • Study and improve long-context modeling, planning depth, and multi-step reasoning behavior
  • Curate and construct massive, high-quality text corpora for pre-training
  • Design synthetic data pipelines for reasoning, tool use, mathematics, coding, and structured problem solving
  • Develop filtering, mixture weighting, and curriculum strategies that shape emergent capabilities
  • Formulate new tasks that improve coherence, logical consistency, factual grounding, and robustness
  • Train frontier-scale language models across large GPU clusters
  • Optimize distributed training (data, tensor, pipeline parallelism), mixed precision, and memory efficiency
  • Build infrastructure for large-scale experimentation, ablations, and reproducibility
  • Improve inference efficiency and support scalable deployment
  • Define and build evaluation frameworks for language intelligence, including: Multi-step reasoning and mathematical problem solving, Coding and structured generation, Knowledge grounding and factuality, Planning and agentic behavior, Instruction following and alignment
  • Track capability development across pre-training, mid-training, and post-training
  • Close the loop between evaluation signals and data/model improvements

Requirements:

  • Strong foundation in machine learning and large language models
  • Deep understanding of autoregressive transformers and large-scale training dynamics
  • Experience with pre-training large models and/or post-training techniques such as instruction tuning, RLHF, preference optimization, or distillation
  • Hands-on experience with PyTorch and distributed training at scale
  • Comfortable operating across research and production environments

Nice to have:

  • Experience training frontier-scale language models from scratch
  • Research contributions in scaling laws, reasoning, alignment, or inference-time compute
  • Experience designing large-scale synthetic reasoning data
  • Expertise in long-context modeling or structured reasoning systems
  • Experience optimizing models for real-world deployment constraints

Additional Information:

Job Posted:
March 13, 2026

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Research Scientist - Large Language Model

Sr. Applied Research Scientist

We’re looking for a Sr. Applied Research Scientist to lead efforts in building l...
Location
Location
United States
Salary
Salary:
280000.00 - 380000.00 USD / Year
runwayml.com Logo
Runway
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of relevant ML engineering or research experience in language models
  • Very strong programming skills and ability to write clean and maintainable research code
  • Deep interest in building human-in-the-loop systems for creativity
  • Passion for seeing research through from initial conception to eventual application
  • Experience mentoring and teaching other researchers
  • Strong communication, collaboration, and documentation skills
Job Responsibility
Job Responsibility
  • Lead efforts in building large language models and vision language models that power Runway’s research and tools, with a focus on multimodal capabilities and reasoning
  • Fulltime
Read More
Arrow Right

Machine Learning Research Scientist

This role focuses on cutting-edge research and development in Artificial Intelli...
Location
Location
United States , Milpitas
Salary
Salary:
117500.00 - 270000.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in Computer Science, Electrical Engineering, or related fields focusing on Machine Learning for the dissertation
  • extensive experience in deep learning research, preferably in Large Language Models or Reinforcement Learning
  • experience developing applications with deep learning frameworks like PyTorch with a high software proficiency
  • strong programming skills in Python, data structures, and algorithms are required
  • experience with ML model optimization, GPU acceleration, heterogeneous computation, system software, and performance optimization desired
  • experience in Python Web Frameworks – Django, Flask - a plus but not required.
Job Responsibility
Job Responsibility
  • conducting research, developing solutions, and creating intellectual property in emerging fields like reinforcement learning, LLMs, digital twins, clean energy, data center optimization, and sustainability
  • developing advanced technologies for analysis, optimization, time series forecasting, uncertainty quantification, and control
  • providing thought leadership, collaborating internally and externally, and contributing to HPE’s strategy by identifying emerging technologies
  • publishing in top conferences like NeurIPS, AAAI, and ACL
  • developing patent applications
  • software development, GPU acceleration, model optimization, and real-time data streaming to create robust AI solutions for real-world use cases.
What we offer
What we offer
  • a competitive salary and extensive social benefits
  • diverse and dynamic work environment
  • work-life balance and support for career development
  • health and wellbeing programs
  • personal and professional development programs
  • diversity, inclusion, and belonging initiatives.
  • Fulltime
Read More
Arrow Right

Post Doctoral Machine Learning Research Scientist

The Core Machine Learning Research team within the Artificial Intelligence Resea...
Location
Location
United States , Milpitas
Salary
Salary:
47.75 - 72.00 USD / Hour
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in Computer Science, Electrical Engineering, or related fields focusing on Machine Learning
  • Extensive experience in deep learning research is required, preferably with Reinforcement Learning and Large Language Models
  • Experience in developing applications with deep learning frameworks like PyTorch with a high software proficiency
Job Responsibility
Job Responsibility
  • LLM and agentic architectures with refinements to enhance trust for complex applications and workflows
  • Multi-agent and multi-objective reinforcement learning for complex physical systems
  • Generative models and Optimization for scientific domains such as inertial confinement fusion
  • Scalable, safe AI systems that push the boundary of what’s possible in applied ML
What we offer
What we offer
  • Comprehensive suite of benefits that supports physical, financial and emotional wellbeing
  • Specific programs catered to helping you achieve career goals
  • Inclusive working environment
  • Collaborations with top-tier research institutions, national labs, and global AI initiatives
  • Fulltime
Read More
Arrow Right

Senior Research Scientist, Intelligent Talent Acquisition - Lead Generation & Detection Services

Do you want a role with deep meaning and the ability to make a major impact? As ...
Location
Location
United Kingdom , Edinburgh
Salary
Salary:
Not provided
amazon.de Logo
Amazon Pforzheim GmbH
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master's degree, or a PhD and experience in quantitative field research
  • Experience investigating the feasibility of applying scientific principles and concepts to business problems and products
  • 5+ years of experience in applied selection research, job analysis, test development, and validation
  • Foundational skills in conducting experimental research studies and data analysis
  • Proficiency in scripting for data analysis (e.g., R, Python)
Job Responsibility
Job Responsibility
  • Partner on design and development of AI-powered systems to scale job analyses enterprise-wide
  • Match potential candidates to the jobs they’ll be most successful in
  • Conduct validation research for top-of-funnel AI-based evaluation tools
  • Develop and implement novel research strategies using the latest technology
  • Build solutions while experiencing Amazon’s customer-focused culture
  • Work with diverse groups of people and inter-disciplinary cross-functional teams to solve complex business problems
Read More
Arrow Right

Research Scientist

The Research Scientist will be an integral part of Oumi's research team, focusin...
Location
Location
United States , Seattle, WA, San Mateo, CA, New York, NY
Salary
Salary:
Not provided
Oumi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A Ph.D. or MSc. in computer science, machine learning, artificial intelligence, or a related field is preferred
  • Candidates with a strong publication record, or equivalent industry experience will be considered
  • Demonstrated experience in conducting original research in machine learning, with a strong publication record in top-tier conferences or journals
  • Deep understanding of machine learning and deep learning concepts, with specific knowledge of large language models (LLMs) and/or vision language models (VLMs)
  • Strong programming skills in Python and experience using deep learning frameworks (e.g. PyTorch)
  • Familiarity with open-source projects and a passion for contributing to the open-source community
  • A self-starter who can work independently and take ownership of initiatives
  • Share Oumi's values: Beneficial for all, Customer-obsessed, Radical Ownership, Exceptional Teammates, Science-grounded
Job Responsibility
Job Responsibility
  • Model Development: Conduct research on training and evaluating new Large language models (LLMs), Vision Language Models (VLMs), and other AI models. This includes exploring new architectures, training techniques, and optimization methods
  • Data Curation: Develop methodologies for curating high-quality datasets for training and evaluating LLMs. This may involve data synthesis and other novel techniques
  • Benchmark Development: Develop evaluation benchmarks to measure the performance of LLMs across various tasks and domains
  • Research and Experimentation: Design and conduct experiments to validate research hypotheses and improve model performance
  • Open Source Contribution: Contribute to the Oumi open-source platform, models and projects, and other relevant tools and libraries
  • Collaboration: Collaborate with other researchers, engineers, and the broader community to advance the field of open-source AI
  • Publication: Publish research findings in leading conferences and journals
  • Platform Evaluation: Evaluate existing models and identify areas of improvement
  • Flexibility: Work with various models, including text and multimodal models, and both open and closed models
  • Problem Solving: Focus on the research that matters by skipping the plumbing and moving straight to research, building on the work of others and contributing back
What we offer
What we offer
  • Equity in a high-growth startup
  • Comprehensive health, dental and vision insurance
  • 21 days PTO
  • Regular team offsites and events
  • Fulltime
Read More
Arrow Right

Research Scientist Intern, Embodied Foundation Models

Our team is seeking a talented Applied Scientist Intern to join us for 3-6 month...
Location
Location
United States , Sunnyvale
Salary
Salary:
Not provided
wayve.ai Logo
Wayve
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently pursuing a graduate degree in Computer Science, Machine Learning, Robotics, or related technical field
  • Proficient in at least one backend/systems programming language (e.g. Python, Ruby, Java, etc)
  • Previous experience in vision-language models, large language models, natural language processing, especially around reasoning
  • Solid software engineering fundamentals, especially in Python
  • Previously used PyTorch or a similar library for deep learning (e.g. Tensorflow, JAX)
  • Experience with multi-node distributed training of large models
  • Interested in using large-scale multimodal (vision, language, etc.) datasets to improve embodied AI
  • Previous publications in conferences (e.g., CVPR, ICCV, CoRL, NeurIPS, CoLM, RSS, ICRA, among others)
Job Responsibility
Job Responsibility
  • Work on foundation models for embodied AI, including large-scale pretraining, post-training, leveraging language, or improving reasoning capabilities
  • Train models on large-scale multimodal (vision, language, etc.) data efficiently in a multi-node distributed system, and evaluate their performance on open (and closed) datasets/benchmarks
  • Lead a high-impact research work and publish at a top tier conference
Read More
Arrow Right

Senior People Scientist

The Sr People Scientist is responsible for supplying to the development of an en...
Location
Location
United States , Bellevue
Salary
Salary:
127700.00 - 230300.00 USD / Year
https://www.t-mobile.com Logo
T-Mobile
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree Quantitative Subject area (math, statistics, economics, computer science, physics, engineering)
  • Master's/Advanced Degree Quantitative Subject area (I-O Psychology
  • Behavioral Economics
  • Applied Social Psychology w/emphases on research science and advanced statistics)
  • Doctorate Quantitative Discipline (I-O Psychology
  • Behavioral Economics
  • Applied Social Psychology w/emphases on research science and advanced statistics)
  • 7-10 years Research science or related experience
  • Proven experience with Gen AI for foundational models and LLM and demonstrating for analytics
  • 4-7 years Combination of deep technical skills and business savvy to interface and influence all levels and fields
Job Responsibility
Job Responsibility
  • Support the vision and research science roadmap in collaboration with the HR leadership team and senior leadership partners
  • Collaborate in identifying and addressing large-scale, sophisticated business problems related to employee experience, talent, and organizational capability
  • Drive the development and integration of diverse and complex data sources for advanced and sophisticated qualitative and quantitative modeling
  • Contribute to maintaining high standards in research science, including supporting the mentoring and development of team members
  • Develop and implement network analytics, AI/ML, and Deep Learning models to analyze sophisticated datasets and support innovation in people science
  • Build and run true A/B and quasi-experimental designs to assess the impact of mechanisms, programs, and various tested solutions that align to the overall T-Mobile people strategy
  • evaluate research initiatives to provide bottom line value, return on investment and improvements
  • Translate technical research findings into clear, concise, and engaging reports that support decisions and applications across the employee lifecycle
  • Collaborate with multiple teams and account teams to influence, build consensus, and drive significant T-Mobile wide changes related to applying research science proposals and recommendations, including changes to programs, engineering and system needs, and people strategy roadmaps
What we offer
What we offer
  • medical, dental and vision insurance
  • flexible spending account
  • 401(k)
  • employee stock grants
  • employee stock purchase plan
  • paid time off
  • up to 12 paid holidays
  • paid parental and family leave
  • family building benefits
  • back-up care
  • Fulltime
Read More
Arrow Right

Research Scientist Intern, Embodied Foundation Models (Evaluation)

Our team is seeking a talented Applied Scientist Intern to join us for 3-6 month...
Location
Location
United States , Sunnyvale
Salary
Salary:
Not provided
wayve.ai Logo
Wayve
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • You are currently pursuing a graduate degree in a Computer Science, Machine Learning, Robotics, or related technical field
  • You are proficient in at least one backend/systems programming language (e.g. Python, Ruby, Java, etc)
  • You have previous experience in vision-language models, large language models, natural language processing, especially around reasoning
  • You have solid software engineering fundamentals, especially in Python
  • You have previously used PyTorch or a similar library for deep learning (e.g. Tensorflow, JAX)
  • Experience with multi-node distributed training of large models
  • You are interested in using large-scale multimodal (vision, language, etc.) datasets to improve embodied AI
  • You have previous publications in the following conferences (e.g., CVPR, ICCV, CoRL, NeurIPS, CoLM, RSS, ICRA, among others)
Job Responsibility
Job Responsibility
  • Work on foundation models for embodied AI, including large-scale pretraining, post-training, leveraging language, or improving reasoning capabilities
  • Train models on large-scale multimodal (vision, language, etc.) data efficiently in a multi-node distributed system, and evaluate their performance on open (and closed) datasets/benchmarks
  • Lead a high-impact research work and publish at a top tier conference (e.g., CVPR, ICCV, CoRL, NeurIPS, CoLM, RSS, ICRA, among others)
Read More
Arrow Right