CrawlJobs Logo

Data Scientist - Applied AI Research

United States, Westlake · Job Posted April 21, 2026
Apply Position
Job Link Share

Job Description

The Data Scientist - Applied AI Research will ideate, design, and develop NLP across our products. You will partner with other teams on technical matters and work closely with our engineers. The ideal candidate will be comfortable reading academic papers and designing experiments to assess utility of any new ideas.

Job Responsibility

  • Work closely with the Agile team members to bring ML solutions into the product
  • Benchmark and optimize existing ML solutions performance (e.g, model footprint or latency)
  • Deliver reports on a sprint cadence
  • Peer review code and reports written by teammates
  • Bring good ideas during brainstorming sessions

Requirements

  • 1+ years of Data Science experience, specializing with NLP
  • 1+ years of generative AI experience including LLMs, Agents, MCPs, etc
  • 1+ years of experience working in an Agile environment
  • 1+ years of experience with AWS products in a Linux environment
  • Experience developing and optimizing solutions from transformer-based, fastText-based or ensemble based models
  • Experience with Pytorch
  • Understanding of text representation techniques and classification algorithms
  • Deep understanding of experiment design and documentation
  • Statistical acumen and experience applying statistical concepts to data science experiments
  • Deep knowledge of machine learning algorithms, with the ability to choose the optimal algorithm for a given problem
  • Ability to write robust and testable code, in Python
  • Ability to write/read bash shell scripts, and comfortable on the Linux command line
  • Masters Degree or higher in a quantitative subject: Mathematics, Physics, Computer Science, Computational Linguistics, or similar

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Data Scientist - Applied AI Research

8 matching positions

AI Research Scientist, Media Data Research - MSL FAIR

Meta is seeking AI research scientists to help us build the data foundation for ...
Location
Location
United States , Menlo Park
Salary
Salary:
154000.00 - 217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • PhD in Computer Science or a related technical field
  • 1+ year of industry research experience in LLM/LMM, computer vision, or related AI/ML models
  • Experience owning and/or driving complex technical projects from end-to-end
  • Practical experience with multimodal pre-training or mid-training data curation for large media perception or generation models
  • Published research in leading peer-reviewed conferences (e.g., ACL, NeurIPS, ICML, ICLR, AAAI, KDD, CVPR, ICCV) and/or demonstrated significant industry influence in the field of AI
Job Responsibility
Job Responsibility
  • Collaborate with cross-functional teams to develop Meta’s next foundational models
  • Advance our understanding of data research, such as how to overcome data walls and how best to create synthetic data
  • Fundamentally improve our data velocity across workflows and projects by contributing to quality in data tooling
  • Execute on high priority projects in pre-training, mid-training, or post-training data curation
  • Apply specialized expertise in video/image generation, video/image perception, OCR, data scaling laws, or data mixing
  • Lead complex technical projects end-to-end
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

AI Research Scientist, Text Data Research - MSL FAIR

Meta is seeking AI research scientists to help us build the data foundation for ...
Location
Location
United States , Menlo Park
Salary
Salary:
154000.00 - 217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • PhD in Computer Science or a related technical field
  • 1+ year of industry research experience in LLM/NLP or related AI/ML models
  • Experience owning and/or driving complex technical projects from end-to-end
  • Practical experience with pre-training or mid-training data curation for large foundational models and experience working with organic, synthetic, agentic, or reasoning data for LLMs
  • Published research in leading peer-reviewed conferences (e.g., NeurIPS, ICML, ICLR, ACL, EMNLP) and/or demonstrated significant industry influence in the field of AI
Job Responsibility
Job Responsibility
  • Collaborate with cross-functional teams to develop Meta’s next foundational models
  • Advance our understanding of data research, such as how to overcome data walls and how best to create synthetic data
  • Architect efficient and scalable data curation systems and pipelines
  • Fundamentally improve our data velocity across workflows and projects by contributing to the advancement of data tooling
  • Execute on high priority projects in pre-training, mid-training, or post-training data curation
  • Apply specialized expertise in agentic data, synthetic data, reasoning data, web parser, coding data, data scaling laws, or datamix optimization
  • Lead complex technical projects end-to-end
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

AI Research Scientist, Media Data Research - MSL FAIR

Meta is seeking AI research scientists to help us build the data foundation for ...
Location
Location
United States , Bellevue
Salary
Salary:
122000.00 - 181000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • PhD in Computer Science or a related technical field, plus 1+ years of industry research experience in LLM/LMM, computer vision, or related AI/ML models
  • Published research in leading peer-reviewed conferences (e.g., ACL, NeurIPS, ICML, ICLR, AAAI, KDD, CVPR, ICCV) and/or demonstrated significant industry influence in the field of AI
  • Experience owning and/or driving complex technical projects from end-to-end
  • Practical experience with multimodal pre-training or mid-training data curation for large media perception or generation models
Job Responsibility
Job Responsibility
  • Collaborate with cross-functional teams to develop Meta’s next foundational models
  • Advance our understanding of data research, such as how to overcome data walls and how best to create synthetic data
  • Fundamentally improve our data velocity across workflows and projects by contributing to innovation in data tooling
  • Execute on high priority projects in pre-training, mid-training, or post-training data curation
  • Apply specialized expertise in video/image generation, video/image perception, OCR, data scaling laws, or data mixing
  • Lead complex technical projects end-to-end
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

AI Research Scientist, Text Data Research - MSL FAIR

Meta is seeking AI research scientists to help us build the data foundation for ...
Location
Location
United States , Menlo Park
Salary
Salary:
184000.00 - 257000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • PhD in Computer Science or a related technical field
  • 2+ years of industry research experience in LLM/NLP or related AI/ML models
  • Experience as a formal technical lead, leading major technical initiatives with cross-functional impact, and/or influencing strategy across multiple teams
  • Practical experience with pre-training or mid-training data curation for large foundational models and experience working with organic, synthetic, agentic, or reasoning data for LLMs
  • Published research in leading peer-reviewed conferences (e.g., NeurIPS, ICML, ICLR, ACL, EMNLP) and/or demonstrated significant industry influence in the field of AI
Job Responsibility
Job Responsibility
  • Collaborate with cross-functional teams to develop Meta’s next foundational models
  • Advance our understanding of data research, such as how to overcome data walls and how best to create synthetic data
  • Fundamentally improve our data velocity across workflows and projects by contributing to the advancement of data tooling
  • Architect efficient and scalable data curation systems and pipelines
  • Execute on high priority projects in pre-training, mid-training, or post-training data curation
  • Apply specialized expertise in agentic data, synthetic data, reasoning data, web parser, coding data, data scaling laws, or datamix optimization
  • Lead complex technical projects end-to-end
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

AI Research Scientist, Media Data Research

Meta is seeking AI research scientists to help us build the data foundation for ...
Location
Location
United States , Menlo Park
Salary
Salary:
184000.00 - 257000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • PhD in Computer Science or a related technical field
  • 2+ years of industry research experience in LLM/NLP, computer vision, or related AI/ML models
  • Experience as a formal technical lead, leading major technical initiatives with cross-functional impact, and/or influencing strategy across multiple teams
  • Practical experience with multimodal pre-training or mid-training data curation for large media perception or generation models
  • Published research in leading peer-reviewed conferences (e.g., ACL, NeurIPS, ICML, ICLR, AAAI, KDD, CVPR, ICCV) and/or demonstrated significant industry influence in the field of AI
Job Responsibility
Job Responsibility
  • Collaborate with cross-functional teams to develop Meta’s next foundational models
  • Advance our understanding of data research, such as how to overcome data walls and how best to create synthetic data
  • Fundamentally improve our data velocity across workflows and projects by contributing to the advancement of data tooling
  • Execute on high priority projects in pre-training, mid-training, or post-training data curation
  • Apply specialized expertise in video/image generation, video/image perception, OCR, data scaling laws, or data mixing
  • Lead complex technical projects end-to-end
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

AI Research Scientist (Technical Leadership), Data Research - MSL FAIR

Meta is seeking research scientists to help us build the data foundation for Met...
Location
Location
United States , Menlo Park
Salary
Salary:
219000.00 - 301000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • PhD in Computer Science or a related technical field
  • 4+ years of industry research experience in NLP or CV
  • 4+ years as a formal technical lead experience
  • Experience leading major technical initiatives with cross-functional impact and influencing strategy across multiple teams
  • Practical experience with multimodal pre-training or mid-training data curation for large language models, media perception, or media generation models
  • Published research in leading peer-reviewed conferences (e.g., ACL, NeurIPS, ICML, ICLR, AAAI, KDD, CVPR, ICCV) and/or demonstrated significant industry influence in the field of AI
Job Responsibility
Job Responsibility
  • Collaborate with cross-functional teams to develop Meta’s next foundational models
  • Advance our understanding of data research, such as how to overcome data walls and how best to create synthetic data
  • Architect efficient and scalable data curation systems and pipelines
  • Fundamentally improve our data velocity across workflows and projects by contributing to the advancement of data tooling
  • Execute on high priority projects in pre-training, mid-training, or post-training data curation
  • Apply specialized expertise in video/image generation, video/image perception, OCR, agentic data, synthetic data, reasoning data, web parser, coding data, data scaling laws, or datamix optimization
  • Lead complex technical projects end-to-end
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Senior Data Scientist - Applied AI

Working at Uber means solving hard problems in a high-stakes, fast-moving enviro...
Location
Location
Brazil , Sao Paulo
Salary
Salary:
Not provided
uber.com Logo
Uber
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience (or Ph.D. equivalent) in an Applied Science, Machine Learning, or Data Science role
  • Specialized domain expertise in Ranking, Recommender Systems (RecSys), or Search
  • Proven experience in training and deploying Deep Learning models at scale within a production environment
  • Proficiency in Python and SQL with experience handling large-scale datasets using Spark, Hive, or PySpark
  • Solid understanding of statistical methods, experimental design, and A/B testing
  • BSc., M.S., or Ph.D. in Computer Science, Machine Learning, Statistics, Economics, or a related quantitative field
Job Responsibility
Job Responsibility
  • Design and implement ML models and objective functions that unify competing business interests like organic relevance and sponsored content into a single value space
  • Act as the science lead for foundational machine learning initiatives, unblocking technical debt and optimizing feature engineering for high-scale, real-time systems
  • Navigate the ambiguity of user behavior by designing sophisticated experiments and causal inference frameworks that go beyond standard A/B testing
  • Collaborate across disciplines (Product, Engineering, and Data Science) to translate high-level business goals into theoretically sound and performant technical roadmaps
  • Research and apply advancements in Deep Learning, Reinforcement Learning, and GenAI to solve complex, high-impact problems without a clear starting point
  • Own your models end-to-end, from the first scientific hypothesis to debugging production issues in real-time, low-latency environments
  • Fulltime
Read More
Arrow Right

Principal Applied Data Scientist - AI for Good Lab

The AI for Good Lab is hiring a Principal Applied Data Scientist to join our tea...
Location
Location
United States , Redmond
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 6+ years related experience (e.g., statistics, predictive analytics, research)
  • OR Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience (e.g., statistics, predictive analytics, research)
  • OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience (e.g., statistics, predictive analytics, research) OR equivalent experience
  • Deep foundation in AI, machine learning, statistics, or related quantitative methods applied to real-world problems
  • Experience working end-to-end with data-from sourcing and exploration through modeling, interpretation, and communication
  • Proficiency in at least one scientific programming language (Python, R or equivalent languages) and experience with SQL or similar query languages
  • Excellent written and verbal communication skills, with demonstrated experience communicating complex ideas clearly and persuasively to non-technical audiences
  • Proven ability to influence outcomes and lead work in cross-functional, matrixed environments
Job Responsibility
Job Responsibility
  • Lead and develop applied AI solutions (LLMs, Agents, Computer Vision) and data science solutions by identifying and gathering data, shaping problem formulations, applying AI, machine learning, and statistical methods, and generating insight with real-world impact
  • Use AI creatively as a research and solution-building tool, combining quantitative methods, experimentation, and domain knowledge to surface patterns, test ideas, and inform decisions
  • Rapidly prototype and validate approaches using modeling, statistics and experimentation
  • select methods under real-world constraints (cost/latency, safety, privacy, maintainability)
  • Design and build reliable, maintainable, end-to-end systems spanning data pipelines, model lifecycle, evaluation/telemetry, deployment, and operations
  • Advance the AI for Good Lab research agenda by authoring technical papers and presentation, published both internally and externally
  • Work in close partnership with other researchers and research organizations, as well as policy, industry, and nonprofit stakeholders, to co-create solutions
  • Present findings with clear and compelling narratives, using impactful visualizations and storytelling to articulate insights that drive understanding and action
  • Lead through influence by shaping technical direction and standards (model evaluation, responsible AI, safety/privacy, and monitoring), aligning collaborators, navigating tradeoffs, and sustaining momentum across teams and institutions
  • Fulltime
Read More
Arrow Right