CrawlJobs Logo

Applied Research Engineer, Agents

United States, New York City 160000.00 - 300000.00 USD / Year · Job Posted December 09, 2025
Apply Position
Job Link Share

Job Description

As an Applied Research Engineer, you will be the bridge between research, industry, and application shaping the future of our core natural language processing systems. You will be responsible for enabling agentic capabilities across the Hebbia product suite. You will own experiments and POCs focused on combining the latest research findings with specific high value problems that our customers encounter each and every day. You will leverage our deep relationships with foundation model providers - partnering to beta test models, experiment with new features, and develop guidance on relative model strengths

Job Responsibility

  • Focused on LLMs, you will play a crucial role in analyzing and interpreting complex data types to derive and implement cutting edge insight generation systems
  • Iterate and explore new LLM and NLP techniques maintaining our foothold as an industry leader
  • You will utilize your expertise in statistics, programming, and machine learning to develop and deploy data-driven models and algorithms
  • Your work will contribute to solving business problems, improving processes, and enhancing the overall performance of the company
  • Collaborate with cross-functional teams to improve NLP/LLM capabilities in app
  • Stay up-to-date with the latest advancements and research in the space
  • Collaborate with software engineers to integrate agentic capabilities into existing systems or develop new applications
  • Ensure that systems are efficient, maintainable and well monitored
  • Iterate on validation and testing frameworks

Requirements

  • Bachelor's degree in Computer Science, Engineering, or related field
  • 7+ years software development experience at a venture-backed startup or top technology firm, with a focus on applied machine learning systems
  • Strong programming skills in Python
  • Experience with NLP and text processing libraries such as NLTK, SpaCy, or Apache Tika
  • Experience with Search and Indexing technologies
  • Proficient in machine learning techniques and algorithms
  • Experience working with foundational models and corresponding APIs
  • Knowledge of statistical analysis and data scraping techniques
  • Prior experience in developing NLP models and systems
  • Excellent problem-solving and analytical skills
  • Strong communication and teamwork abilities
  • Strong capability to translate research into production software systems

Nice to have

  • Master’s degree in Computer Science, Mathematics, Machine Learning or a related field is a plus
  • Experience with prompting and building LLM applications and agents is a plus
  • Experience building agentic systems or LLM enabled products
  • Frequent user of AI products, especially during the development lifecycle (i.e. Cursor, Claude Code, etc)
  • experience building with foundation models and experience working with Attention based NLP models is a plus

What we offer

  • PTO: Unlimited
  • Insurance: Medical + Dental + Vision + 401K
  • Eats: Catered lunch daily + doordash dinner credit if you ever need to stay late
  • Parental leave policy: 3 months non-birthing parent, 4 months for birthing parent
  • Fertility benefits: $15k lifetime benefit
  • New hire equity grant: competitive equity package with unmatched upside potential

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Applied Research Engineer, Agents

8 matching positions

Applied Research - RL & Agents

Prime Intellect builds the infrastructure that frontier AI labs build internally...
Location
Location
United States , San Francisco
Salary
Salary:
Not provided
Prime Intellect
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong background in machine learning engineering, with experience in post-training, RL, or large-scale model alignment
  • Experience with agent frameworks and tooling (e.g. DSPy, LangGraph, MCP, Stagehand)
  • Familiarity with distributed training/inference frameworks (e.g., vLLM, sglang, Accelerate, Ray, Torch)
  • Track record of research contributions (publications, open-source contributions, benchmarks) in ML/RL
  • Passion for advancing the state-of-the-art in reasoning and building practical, agentic AI systems
  • Strong technical writing abilities (documentation, blogs, papers) and research taste
  • Eagerness to drive collaborations with external partners and engage with the broader open-source community
Job Responsibility
Job Responsibility
  • Advancing Agent Capabilities: Designing and iterating on next-generation AI agents that tackle real workloads—workflow automation, reasoning-intensive tasks, and decision-making at scale
  • Building Robust Infrastructure: Developing the systems and frameworks that enable these agents to operate reliably, efficiently, and at massive scale
  • Bridge Between Applications & Research: Translate ambiguous objectives into clear technical requirements that guide product and research priorities
  • Prototype in the Field: Rapidly design and deploy agents, evals, and harnesses for real-world tasks to validate solutions
  • Application-Driven Research & Infrastructure: Shape the direction and feature set for verifiers, the Environments Hub, training services, and other research platform offerings
  • Build high‑quality examples, reference implementations, and “recipes” that make it easy for others to extend the stack
  • Prototype agents and eval harnesses tailored to real-world use cases and external systems
  • Pair with technical end‑users (research teams, infra‑heavy customers, open‑source contributors) to design environments, evals, and verifiers that reflect real workloads
  • Post-training & Reinforcement Learning: Design and implement novel RL and post-training methods (RLHF, RLVR, GRPO, etc.) to align large models with domain-specific tasks
  • Build evaluations and harnesses and to measure reasoning, robustness, and agentic behavior in real-world workflows
What we offer
What we offer
  • Competitive Compensation + equity incentives
  • Flexible Work (San Francisco or hybrid-remote)
  • Visa Sponsorship & relocation support
  • Professional Development budget
  • Team Off-sites & conference attendance
  • Fulltime
Read More
Arrow Right

Machine Learning Research Engineer, GenAI Applied ML

Lead applied ML engineering on Scale's Applied ML team, powering data infrastruc...
Location
Location
United States , San Francisco; New York
Salary
Salary:
176000.00 - 220000.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD or MSc in Computer Science, Mathematics, Statistics, or related field
  • 3+ years shipping scaled production ML systems
  • Demonstrated real-world impact
  • Mastery of PyTorch, TensorFlow, JAX, or scikit-learn
  • Deep expertise in agentic LLMs and multi-agent systems
  • Strong software engineering and microservices (AWS/GCP)
  • Rapid, data-driven iteration
  • Proficiency using AI tools to accelerate work
  • Strong research depth with practical bias
  • Excellent cross-functional communication
Job Responsibility
Job Responsibility
  • Build and deploy multi-agent systems for agentic reasoning validation
  • Develop pipelines to detect errors and scale human judgment
  • Combine classical ML, LLMs, and multi-agent techniques for reliability
  • Lead research into agent failure modes and ship fixes
  • Use AI tools to speed prototyping and iteration
  • Build data-driven evaluations and deploy rapid improvements
  • Integrate systems into Scale's platform
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • Fulltime
Read More
Arrow Right

Research Engineer, Text Data Research - MSL FAIR

Meta is seeking AI research engineers to help us build the data foundation for M...
Location
Location
United States , Menlo Park
Salary
Salary:
257000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • 2+ years of industry research experience in LLM/NLP or related AI/ML models
  • Experience as a formal technical lead, leading major technical initiatives with cross-functional impact, and/or influencing strategy across multiple teams
  • Practical experience with pre-training or mid-training data curation for large foundational models and experience working with organic, synthetic, agentic, or reasoning data for LLMs
  • Demonstrated data infrastructure and software background, and experience building data tooling and services
  • Published research in leading peer-reviewed conferences (e.g., NeurIPS, ICML, ICLR, ACL, EMNLP) and/or demonstrated significant industry influence in the field of AI
Job Responsibility
Job Responsibility
  • Collaborate with cross-functional teams to develop Meta’s next foundational models
  • Architect efficient and scalable data curation systems and pipelines
  • Fundamentally improve our data velocity across workflows and projects by contributing to the advancement of data tooling
  • Execute on high priority projects in pre-training, mid-training, or post-training data curation
  • Apply specialized expertise in agentic data, synthetic data, reasoning data, web parser, coding data, data scaling laws, or datamix optimization
  • Lead complex technical projects end-to-end
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right
New

Product Manager, Applied Research

Luma's mission is to build unified general intelligence that can generate, under...
Location
Location
United States , SF Bay Area
Salary
Salary:
Not provided
lumalabs.ai Logo
Luma AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Principal or Staff-level product management experience (L6 seniority or above) with a track record of bridging pure research and ambitious product development at an AI lab or applied AI company
  • You are genuinely comfortable in a research environment — you can read evaluations, understand what they mean, ask the right questions, and help researchers see the product implications of their work
  • You've shipped products that originated in research — not just optimized existing products, but taken emergent capabilities and turned them into new product surfaces
  • You can build prototypes yourself — you have the technical depth to work directly with models, write prompts, test hypotheses, and validate ideas before involving a full engineering team
  • You operate with extremely high agency and a strong self-starter mentality — research environments are inherently ambiguous and this role has no existing playbook
  • You have a general manager mindset — you can evaluate research directions not just for technical novelty but for commercial viability and customer impact
  • You bring a mix of large-company and earlier-stage experience — you understand both the pace of a research lab and the urgency of a startup building a commercial business
  • You can influence without authority — researchers don't report to you, but your input on priorities needs to carry weight because it's well-informed and well-reasoned
Job Responsibility
Job Responsibility
  • Own the product strategy for translating Luma's frontier research capabilities into shippable features across Canvas, Agents, and the model platform
  • Work directly within the research team to shape priorities, ensuring model development and evaluation are informed by real enterprise needs in marketing, advertising, and entertainment
  • Establish and operate the feedback loops between research, product, go-to-market, and forward-deployed teams — so that market signals flow into research prioritization and research breakthroughs flow into product planning
  • Interpret evaluation results and research findings to identify which emerging capabilities have the highest product and commercial leverage — and make bets accordingly
  • Build prototypes yourself to validate product ideas before committing engineering resources — using the models directly to test hypotheses about what's possible and what customers will value
  • Define the product implications of research advances in multimodal understanding, generation, tool use, and agent capabilities — translating "the model can now do X" into "customers can now solve Y"
  • Partner with the Agent PM and Enterprise PM to ensure research priorities align with the capabilities that enterprise products and customer deployments actually need
  • Drive the cadence and process for research-to-product handoffs — defining what "ready to ship" means, managing the transition from research prototype to production feature, and owning the quality bar
  • Fulltime
Read More
Arrow Right

Research Engineer (Technical Leadership), FAIR Data - Meta Superintelligence Labs

Meta is seeking Research Engineers to help us build the data foundation for Meta...
Location
Location
United States , Menlo Park
Salary
Salary:
219000.00 - 301000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • 4+ years of industry research experience with pre/mid/post-training data curation for large language or large media models
  • 4+ years of formal technical lead experience
  • Experience leading major technical initiatives with cross-functional impact and influencing strategy across multiple teams
  • Published research in leading peer-reviewed conferences (e.g., ACL, NeurIPS, ICML, ICLR, AAAI, KDD, CVPR, ICCV) and/or demonstrated significant industry influence in the field of AI
Job Responsibility
Job Responsibility
  • Collaborate with cross-functional teams to develop Meta’s next foundational models
  • Architect efficient and scalable data curation systems and pipelines
  • Fundamentally improve our data velocity across workflows and projects by contributing to the advancement of data tooling
  • Execute on high priority projects in pre-training, mid-training, or post-training data curation
  • Apply specialized expertise in video/image perception or generation, OCR, agentic data, synthetic data, multilingual data, reasoning data, web parser, coding data, data scaling laws, or datamix optimization
  • Lead complex technical projects end-to-end
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Senior Software Engineer Applied Gen AI Engineering

At Citi, we are pioneering the future of enterprise operations through innovativ...
Location
Location
Canada , Mississauga
Salary
Salary:
120800.00 - 170800.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of professional software engineering experience, demonstrating a strong track record of designing, building, and delivering scalable enterprise-grade solutions in commercial production environments, not just proofs-of-concept
  • Expert-level proficiency in Python is a must-have, with a deep understanding of its ecosystem for AI/ML development, data engineering, and backend services
  • Extensive hands-on experience with Generative AI concepts, Large Language Models (LLMs), transformer architectures, RAG, and advanced agentic frameworks (e.g., LangChain, LangGraph, Google ADK)
  • Deep comfort and practical experience with containers and orchestration technologies, specifically OpenShift
  • Demonstrated ability to architect, develop, and deploy highly performant, large-scale AI/ML systems into production environments
  • Strong understanding of modern software development principles, clean code practices, data structures, algorithms, and distributed systems
  • Proficiency with Relational (preferably, PostgreSQL) and Vector (preferably, pgvector) databases
Job Responsibility
Job Responsibility
  • Architect & Build Production Systems
  • Pioneer Automation with Agents
  • Master Containerized Deployments
  • Drive Technical Direction & Ownership
  • Champion Engineering Excellence
  • Innovate & Research
  • Mentor & Collaborate
  • Iterate & Deliver
  • Ensure Responsible AI
What we offer
What we offer
  • Unprecedented Impact & Visibility
  • Cutting-Edge Technology
  • Growth & Development
  • Collaborative Environment
  • Flexible Work Environment
  • Global Scale
  • Fulltime
Read More
Arrow Right

Applied Research - Forward-Deployed

Prime Intellect builds the infrastructure that frontier AI labs build internally...
Location
Location
United States , San Francisco
Salary
Salary:
150000.00 - 300000.00 USD / Year
Prime Intellect
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Deep hands-on experience building, evaluating, or deploying LLM-based agents in the past 1–2 years
  • Strong intuition for evaluation design
  • Working understanding of RL and post-training concepts (GRPO, RLHF, reward modeling, SFT)
  • Strong Python skills and comfort with the modern AI stack (Hugging Face, inference engines, agent frameworks)
  • Experience in a customer-facing or consulting-adjacent technical role, or as a technical founder
  • Excellent written and verbal communication
  • High agency and comfort with ambiguity
Job Responsibility
Job Responsibility
  • Embed directly with strategic customers to understand their agent architectures, failure modes, and product goals
  • Design and build custom RL environments, evaluation harnesses, and verifiers that capture what 'good' looks like for each customer's domain
  • Architect agent scaffolding — tool use, multi-step reasoning, memory, sandbox execution — tailored to customer workflows
  • Configure and launch training runs on Lab, iterating on reward functions, rollout strategies, and evaluation criteria
  • Serve as the technical lead for engagements end-to-end: from discovery through deployed, improved models
  • Identify repeatable patterns from customer engagements and codify them into reference implementations, templates, and documentation
  • Serve as the voice of the customer internally, shaping the roadmap for Lab, verifiers, the Environments Hub, and training infrastructure
  • Build high-quality examples and 'recipes' that make it easy for new customers and open-source contributors to extend the stack
  • Contribute to technical content (blog posts, tutorials, case studies) that demonstrates real-world platform usage
  • Develop novel evaluation methodologies for agentic behavior — multi-step reasoning, tool use correctness, recovery from failure, long-horizon task completion
What we offer
What we offer
  • Cash Compensation Range of $150-300k + equity incentives
  • Flexible Work (San Francisco or hybrid-remote)
  • Visa Sponsorship & relocation support
  • Professional Development budget
  • Team Off-sites & conference attendance
  • Fulltime
Read More
Arrow Right

Research Engineer

As a Research Engineer at Mercor, you’ll work at the intersection of engineering...
Location
Location
United States , San Francisco
Salary
Salary:
130000.00 USD / Year
mercor.com Logo
Mercor
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong applied research background, with a focus on post-training and/or model evaluation
  • Strong coding proficiency and hands-on experience working with machine learning models
  • Strong understanding of data structures, algorithms, backend systems, and core engineering fundamentals
  • Familiarity with APIs, SQL/NoSQL databases, and cloud platforms
  • Ability to reason deeply about model behavior, experimental results, and data quality
  • Excitement to work in person in San Francisco, five days a week (with optional remote Saturdays), and thrive in a high-intensity, high-ownership environment
Job Responsibility
Job Responsibility
  • Work on post-training and RLVR pipelines to understand how datasets, rewards, and training strategies impact model performance
  • Design and run reward-shaping experiments and algorithmic improvements (e.g., GRPO, DAPO) to improve LLM tool-use, agentic behavior, and real-world reasoning
  • Quantify data usability, quality, and performance uplift on key benchmarks
  • Build and maintain data generation and augmentation pipelines that scale with training needs
  • Create and refine rubrics, evaluators, and scoring frameworks that guide training and evaluation decisions
  • Build and operate LLM evaluation systems, benchmarks, and metrics at scale
  • Collaborate closely with AI researchers, applied AI teams, and experts producing training data
  • Operate in a fast-paced, experimental research environment with rapid iteration cycles and high ownership
What we offer
What we offer
  • Generous equity grant vested over 4 years
  • A $20K relocation bonus (if moving to the Bay Area)
  • A $10K housing bonus (if you live within 0.5 miles of our office)
  • A $1K monthly stipend for meals
  • Free Equinox membership
  • Health insurance
  • Fulltime
Read More
Arrow Right