CrawlJobs Logo

Research Engineer, GenAI

United States, San Francisco 175000.00 - 250000.00 USD / Year · Job Posted December 09, 2025
Apply Position
Job Link Share

Job Description

You will be part of Kiddom’s Data Science team, building the foundation of our search, recommendation, and insights systems. Your work will directly support teachers and students by delivering timely insights, personalized content, and intelligent assistance.

Job Responsibility

  • Architect and scale machine learning systems for search, personalization, and recommendations that power Kiddom’s teacher helper and insight engine
  • Develop evaluation-first development workflows to measure how models improve teaching efficiency, lesson planning, and student learning outcomes
  • Fine-tune machine learning models with feedback signals from teachers and students to align outputs with instructional goals and classroom needs
  • Design intelligent discovery pipelines that combine semantic retrieval, curriculum alignment, and real-time personalization
  • Build agentic assistants that help teachers plan lessons, adapt instruction, and reduce repetitive tasks
  • Collaborate closely with product managers, designers, and curriculum experts to translate high-level educational goals into scalable ML-powered systems
  • Coach and mentor junior ML engineers and data scientists, fostering technical and professional growth

Requirements

  • 5+ years of industry experience applying machine learning to solve real-world problems with large, complex datasets
  • 1–2 years in a technical leadership role
  • Proven track record designing, evaluating, and deploying ML/AI systems in production environments that drive measurable business impact, ideally in recommendation, personalization, search, or workflow optimization
  • Strong programming skills in Python
  • Fluency in data manipulation (SQL, Pandas) and common ML toolkits (scikit-learn, XGBoost, TensorFlow/PyTorch)
  • Strong analytical skills and ability to break down complex problems into measurable hypotheses and experiments
  • Excellent communication skills with a history of cross-functional collaboration with product, design, and engineering stakeholders

Nice to have

  • Deep expertise in modern deep learning frameworks and advanced LLM architectures
  • Experience building evaluation pipelines for ML/AI systems, ensuring reliable measurement of impact and quality in real-world use
  • Experience implementing and fine-tuning large language models (LLMs), including prompt engineering, embeddings, and efficient inference optimization
  • Familiarity with foundation model adaptation techniques such as PEFT, LoRA, or RLHF
  • Self-motivated innovator who thrives in fast-moving environments and is excited to explore emerging AI techniques to solve meaningful problems in education
  • Passion for applying cutting-edge AI research to improve teaching workflows and personalize student learning at scale

What we offer

  • Meaningful equity
  • Health insurance benefits: medical (various PPO/HMO/HSA plans), dental, vision, disability and life insurance
  • One Medical membership (in participating locations)
  • Flexible vacation time policy (subject to internal approval). Average use 4 weeks off per year
  • 10 paid sick days per year (pro rated depending on start date)
  • Paid holidays
  • Paid bereavement leave
  • Paid family leave after birth/adoption. Minimum of 16 paid weeks for birthing parents, 10 weeks for caretaker parents. Meant to supplement benefits offered by State
  • Commuter and FSA plans

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Research Engineer, GenAI

8 matching positions

Machine Learning Research Engineer, GenAI Applied ML

Lead applied ML engineering on Scale's Applied ML team, powering data infrastruc...
Location
Location
United States , San Francisco; New York
Salary
Salary:
176000.00 - 220000.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD or MSc in Computer Science, Mathematics, Statistics, or related field
  • 3+ years shipping scaled production ML systems
  • Demonstrated real-world impact
  • Mastery of PyTorch, TensorFlow, JAX, or scikit-learn
  • Deep expertise in agentic LLMs and multi-agent systems
  • Strong software engineering and microservices (AWS/GCP)
  • Rapid, data-driven iteration
  • Proficiency using AI tools to accelerate work
  • Strong research depth with practical bias
  • Excellent cross-functional communication
Job Responsibility
Job Responsibility
  • Build and deploy multi-agent systems for agentic reasoning validation
  • Develop pipelines to detect errors and scale human judgment
  • Combine classical ML, LLMs, and multi-agent techniques for reliability
  • Lead research into agent failure modes and ship fixes
  • Use AI tools to speed prototyping and iteration
  • Build data-driven evaluations and deploy rapid improvements
  • Integrate systems into Scale's platform
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • Fulltime
Read More
Arrow Right

Engineer Intern, GenAI Research

Appen’s GenAI research team advances how frontier models are evaluated, improved...
Location
Location
United States
Salary
Salary:
Not provided
appen.com Logo
Appen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Current enrollment in or recent completion of a Master’s or PhD in Computer Science, AI, Machine Learning, Computer Engineering, or a closely related technical field
  • Strong experience working with large language models, including supervised fine tuning, prompt engineering, or model evaluation
  • Hands on experience building machine learning pipelines or research infrastructure
  • Experience improving model performance through retraining or hyperparameter tuning
  • Proficiency in Python and comfort working with machine learning frameworks and open source model ecosystems
  • Familiarity with cloud environments such as AWS or Azure
  • Strong technical problem solving ability, including use of LLMs as development aids for building and iteration
  • Ability to work independently with minimal hand holding
  • Strong written communication skills for summarising research and drafting technical documentation
  • Ability to collaborate effectively in a remote research environment
Job Responsibility
Job Responsibility
  • Design and implement a lightweight supervised fine tuning training pipeline using open source LLMs
  • Create new benchmarks to evaluate frontier models across defined scientific and performance criteria
  • Analyze production models to identify measurable areas for improvement
  • Improve model performance through targeted retraining and hyperparameter search
  • Deploy improved models while maintaining core model characteristics and avoiding regression
  • Build Python tooling to automate training, evaluation, benchmarking, and experimentation workflows
  • Implement structured evaluation methods, including rubric based scoring and LLM as a judge workflows
  • Document experimental design, benchmark methodology, and performance results with clarity and precision
  • Iterate rapidly in a research driven environment to increase model quality and reliability
  • Fulltime
Read More
Arrow Right

Engineer Intern, GenAI Research

Location
Location
United States
Salary
Salary:
Not provided
appen.com Logo
Appen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently enrolled in a Master's program
  • Currently enrolled in a PhD program
  • Recently completed a Master’s degree within the last 12 months
  • Recently completed a PhD within the last 12 months
  • Hands-on experience with Python
  • Hands-on experience with Machine learning frameworks
  • Hands-on experience with Open source large language models
  • Hands-on experience with Hyperparameter tuning workflows
  • Hands-on experience with Model evaluation frameworks
  • Hands-on experience with Cloud platforms such as AWS or Azure
  • Fulltime
Read More
Arrow Right

Machine Learning Research Engineer, Agents - Enterprise GenAI

The Enterprise ML Research Lab works on the front lines of this AI revolution. W...
Location
Location
United States , San Francisco; New York
Salary
Salary:
218400.00 - 273000.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 1-3 years of building with LLMs in a production environment
  • Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc.
  • Publications in top conferences such as NEURIPS, ICLR, or ICML within the last two years
  • PhD or Masters in Computer Science or a related field
Job Responsibility
Job Responsibility
  • Train state of the art models, developed both internally and from the community, to deploy to our enterprise customers
  • Research cutting edge algorithms to integrate directly into our training stack
  • Build agents that leverage our proprietary agent-building algorithms to automatically hill climb datasets – including defining highly performant tools, multi-agent systems, and complex rewards
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • Fulltime
Read More
Arrow Right

Machine Learning Research Engineer, Agent Data Foundation - Enterprise GenAI

Join the team shaping the future of AI at Scale. The Enterprise ML Research Lab ...
Location
Location
United States , San Francisco; New York
Salary
Salary:
218400.00 - 273000.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of building with LLMs in a production environment
  • Clear experiences with constructing high quality data to use to improve an LLM/Agent
  • Publications in top conferences such as NEURIPS, ICLR, or ICML within the last two years
  • PhD or Masters in Computer Science or a related field
Job Responsibility
Job Responsibility
  • Build synthetic data pipelines to generate enterprise environments to use for RL post-training
  • Create agents to convert traces from production into actionable insights to use to improve agents
  • Contribute to our agent building product which can construct other agents using coding agents + proprietary algorithms
  • Train state of the art models, developed both internally and from the community, to deploy to our enterprise customers
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • commuter stipend
  • Fulltime
Read More
Arrow Right

Staff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAI

Build out our next-gen Agent RL training platform; build out the platform that w...
Location
Location
United States , San Francisco; New York
Salary
Salary:
218400.00 - 273000.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of LLM training in a production environment
  • Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc.
  • Publications in top conferences such as NEURIPS, ICLR, or ICML within the last two years
  • PhD or Masters in Computer Science or a related field
Job Responsibility
Job Responsibility
  • Train state of the art models, developed both internally and from the community, to deploy to our enterprise customers
  • Research cutting edge algorithms to integrate directly into our training stack
  • Design solutions that enable complex multi-agent systems to directly learn from both process + outcome based rewards
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • commuter stipend
  • equity based compensation
  • Fulltime
Read More
Arrow Right

Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI

The Enterprise ML Research Lab works on the front lines of this AI revolution. W...
Location
Location
United States , San Francisco; New York
Salary
Salary:
218400.00 - 273000.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 1-3 years of LLM training in a production environment
  • Passionate about system optimization
  • Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc.
  • Ability to demonstrate know-how on how to operate the architecture of the modern GPU cluster
  • Experience with multi-node LLM training and inference
  • Strong software engineering skills, proficient in frameworks and tools such as CUDA, Pytorch, transformers, flash attention, etc.
  • Strong written and verbal communication skills to operate in a cross functional team environment
  • PhD or Masters in Computer Science or a related field
Job Responsibility
Job Responsibility
  • Build, profile and optimize our training and inference framework
  • Post-train state of the art models, developed both internally and from the community, to define stable post-training recipes for our enterprise engagements
  • Collaborate with ML teams to accelerate their research and development, and enable them to develop the next generation of models and data curation
  • Create a next-gen agent training algorithm for multi-agent/multi-tool rollouts
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • additional benefits such as a commuter stipend
  • equity based compensation
  • Fulltime
Read More
Arrow Right

Genai Software Engineer Intern – Genai Model Experiment

This is an exciting opportunity to light the way as a GenAI Engineer Intern with...
Location
Location
China , Shanghai
Salary
Salary:
Not provided
signify.com Logo
Signify
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently pursuing a Master's/PhD in Computer Science, Data Science, AI, or related fields
  • Availability: Able to commit to a 4-5 days, 4-month internship starting in May - June 2026
  • Proven coding rigor: Requiring strong Python programming or vibe-coding skills with hands-on experience building or testing LLM-based applications and utilizing modern development toolchains (Claude Code, Git)
  • Research & Writing Excellence: Demonstrated ability to formalize research methodologies and write high-quality academic or technical papers are preferred
  • Analytical mindset: Experience with A/B testing frameworks, experimental design, and defining/tracking KPIs for model evaluation are preferred
  • Self-starter: Ability to set up experiments independently, troubleshoot code, and drive projects forward
  • English proficiency: Strong ability to read, write, and synthesize complex technical documents and academic papers
Job Responsibility
Job Responsibility
  • Generative Engine Optimization (GEO) & Application Development: Drive GEO initiatives to enhance brand visibility for campaigns. Your core responsibilities will include: Designing and executing rigorous A/B test planning
  • Conducting prompt optimization to maximize AI-driven search relevance
  • Creating and implementing custom AI skills and Agentic workflows
  • Tracking KPIs to measure optimization success and visibility lift
  • Developing, maintaining, and scaling features for our internal GEO applications
  • Academic Research & Publication: Collaborate with the team to conduct deep, rigorous research. You will be responsible for composing and co-authoring academic papers targeted for publication in top-tier journals, exploring advanced AI topics such as LLM Supervised Fine-Tuning (SFT), time series foundation models, and model alignment
What we offer
What we offer
  • Pleasant work environment
  • Attractive compensation
  • Career guidance
  • Learning and development
  • Employee benefits
  • Fulltime
Read More
Arrow Right