CrawlJobs Logo

AI Research Scientist, Evaluations - Meta Superintelligence Lab

meta.com Logo

Meta

Location Icon

Location:
United States , Menlo Park

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

184000.00 - 257000.00 USD / Year

Job Description:

Meta is seeking Research Scientists to join the Evaluations team within Meta Superintelligence Labs (MSL). Evaluations are the core of AI progress at MSL, determining what capabilities get built, which features get prioritized, and how fast our models improve. As a Research Scientist, you will provide the technical capabilities to measure and understand the capabilities of our frontier AI systems. You'll work in tandem with world-class researchers to envision, develop, and validate novel evaluations that shape the future of AI capability measurement. This is a technical research role requiring good scientific judgment, creativity, and the ability to drive ambitious research agendas with independence. The evaluations you develop will directly influence research direction and major model lines within MSL, making scientific validity, methodological rigor, and clear communication important. You will collaborate closely with technical leadership to ensure evaluations capture the most important capabilities, translating organizational priorities into measurable benchmarks, and translating evaluation insights back into research direction. We are looking for exceptional research talent – researchers who have shaped the field of machine learning, and are ready to do so again at the frontier of AI. If you are passionate about defining how we measure AI progress and want to shape the scientific foundations of frontier AI development, we encourage you to apply for this exciting opportunity at the core of MSL.

Job Responsibility:

  • Curate and integrate publicly available and internal benchmarks to direct the capabilities of frontier model development
  • Develop and implement evaluation environments, including environments for novel model capabilities and modalities
  • Collaborate with external data vendors to source and prepare high-quality evaluation datasets
  • Execute on the technical vision of research scientists designing new benchmarks and evaluations
  • Build robust, reusable evaluation pipelines that scale across multiple model lines and product areas
  • Contribute to evaluation tooling that measures the quality and reliability of evaluation suites

Requirements:

  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • PhD degree in Computer Science, Machine Learning, or a related technical field
  • 3+ years of experience in machine learning engineering, machine learning research, or a related technical role
  • Proficiency in Python and experience with ML frameworks such as PyTorch
  • Experience identifying, designing and completing medium to large technical features independently, without guidance
  • Proven success in software engineering practices including version control, testing, and code review practices
  • Ability to work independently and adapt to rapidly changing priorities

Nice to have:

  • Publications at peer-reviewed venues (NeurIPS, ICML, ICLR, ACL, EMNLP, or similar) related to language model evaluation, benchmarking, or deep learning
  • Hands-on experience with language model post-training and deep learning systems, or building reinforcement learning environments
  • Experience implementing or developing evaluation benchmarks for large language models and multimodal models (e.g., vision-language, audio, video)
  • Experience working with large-scale distributed systems and data pipelines
  • Familiarity with language model evaluation frameworks and metrics
  • Track record of open-source contributions to ML evaluation tools or benchmarks
What we offer:
  • bonus
  • equity
  • benefits

Additional Information:

Job Posted:
February 20, 2026

Employment Type:
Fulltime
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for AI Research Scientist, Evaluations - Meta Superintelligence Lab

AI Research Scientist - Voice AI Team, Meta Superintelligence Labs

Meta is seeking AI Research Scientists to join the Realtime AI Voice team in Met...
Location
Location
United States , Menlo Park, CA +2 locations
Salary
Salary:
184000.00 - 257000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • PhD degree in Computer Science, Mathematics, or similar quantitative field
  • 2+ years of post-PhD experience in an academic, industry, or government laboratory setting, with primary responsibilities focused on AI research
  • Proven track record of publications at peer-reviewed AI & speech conferences (e.g. NeurIPS, ICML, ICLR, ICASSP)
  • Experience in training, fine-tuning, and/or experimenting with foundation models beyond black-box use
  • Familiarity with one or more deep learning frameworks (e.g., pytorch, tensorflow)
  • Experience communicating complex research to public audiences of peers
Job Responsibility
Job Responsibility
  • Lead, collaborate, and execute on research that pushes forward the state of the art in speech and large language model research
  • Directly contribute to experiments, including designing experimental details, develop reusable code, running evaluations, and organizing results
  • Help identify long-term research goals as well as intermediate milestones
  • Work cross-functionally to translate research breakthroughs into scalable, production-ready solutions for Meta's conversational AI / product experiences
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

AI Research Scientist, Personalization, Meta SuperIntelligence Labs

Meta is seeking AI research scientists to help us build the solutions for Person...
Location
Location
United States , Menlo Park
Salary
Salary:
154000.00 - 217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Phd in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Experience in Generative AI models and building LLM technologies particularly post training
  • Experience solving complex problems and comparing alternative solutions, tradeoffs, and different perspectives to determine a path forward. Proven experience of proactively identifying, scoping and implementing innovative research solutions
  • Programming experience in Python and hands-on experience with frameworks like Pytorch, Spark
  • Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Collaborate with cross-functional teams to develop and improve personalization in Meta’s frontier foundation models
  • Directly contribute to experiments, including designing experimental details, authoring reusable code, running evaluations, and organizing results
  • Prioritize research that can be applied to Meta's product development
  • Lead complex research projects end-to-end
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Research Engineering Manager, Evaluations, Meta Superintelligence Labs

Meta is seeking a Research Engineering Manager to lead the Evaluations team with...
Location
Location
United States , Menlo Park
Salary
Salary:
219000.00 - 301000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Machine Learning, or a related technical field
  • 4+ years of experience in machine learning engineering, machine learning research, or a related technical role
  • 3+ years of experience managing or leading technical teams, including hiring, mentoring, and performance management
  • Proficiency in Python and experience with ML frameworks such as PyTorch
  • Proven track record of leading medium to large-scale technical projects from conception to deployment
  • Demonstrated experience balancing hands-on technical work with people management and strategic planning
  • Clear communication and experience influencing cross-functional stakeholders
Job Responsibility
Job Responsibility
  • Build, mentor, and grow a team of research engineers and scientists focused on evaluation infrastructure and benchmarking
  • Conduct performance reviews, career development conversations, and provide technical mentorship to team members
  • Foster a culture of engineering excellence, research rigor, and rapid iteration within the team
  • Partner with recruiting to hire world-class research engineering talent
  • Curate and integrate publicly available and internal benchmarks to direct the capabilities of frontier model development
  • Oversee the development and implementation of evaluation environments, including environments for novel model capabilities and modalities
  • Establish partnerships with external data vendors to source and prepare high-quality evaluation datasets
  • Influence the technical roadmap for evaluation infrastructure in collaboration with MSL Infra team
  • Translate the technical vision of research scientists into actionable engineering plans and execution strategies
  • Partner with research scientists, product teams, and other engineering teams to align evaluation priorities with organizational goals
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Data Scientist, Evaluations - Meta Superintelligence Labs

Meta is seeking a Data Scientist to join the Evaluations team within Meta Superi...
Location
Location
United States , Menlo Park
Salary
Salary:
177000.00 - 247000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Bachelor's degree in Mathematics, Statistics, a relevant technical field, or equivalent practical experience
  • A minimum of 6 years of work experience in analytics (minimum of 4 years with a Ph.D.)
  • Experience with data querying languages (e.g. SQL), scripting languages (e.g. Python), and/or statistical/mathematical software (e.g. R)
Job Responsibility
Job Responsibility
  • Scientific Design & Validity: Lead the design of evaluation stimuli and benchmarks, ensuring they have minimal bias and high construct validity for frontier LLM capabilities
  • Experimental Methodology: Design and execute effective sampling strategies and experimental frameworks to measure model performance and errors accurately
  • Deep-Dive Analysis: Perform rigorous data and model error analyses to provide deep insights into model behavior, quality gaps, and failure modes
  • Collaborative Research: Partner closely with Research Scientists and Engineers to translate organizational priorities into measurable, scientifically sound benchmarks
  • External Impact: Drive the publication of novel evaluation research and the open-sourcing of benchmarks to influence the broader AI research community
  • Strategic Influence: Use data-driven insights to influence research directions and major model development lines within MSL
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right
New

Controller

We are looking for an experienced Controller to lead core accounting and finance...
Location
Location
United States , South Salt Lake
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Demonstrated experience in a Controller or senior accounting leadership role with responsibility for full-cycle financial operations
  • Strong background in financial statement preparation and ownership of the month-end close process
  • Knowledge of accounts payable, accounts receivable, collections, and treasury management practices
  • Experience working with external auditors and supporting audit readiness with accurate documentation
  • Familiarity with merger and acquisition activity, including financial review or due diligence support
  • Proven ability to lead staff effectively, manage priorities, and coordinate work across multiple stakeholders
  • High level of accuracy and organization in document management, recordkeeping, and supporting financial files
Job Responsibility
Job Responsibility
  • Direct day-to-day accounting functions across payables, receivables, and collection activities to support timely and accurate financial operations
  • Lead the monthly close cycle from planning through final review, ensuring balances are reconciled and reporting deadlines are met
  • Prepare and review financial statements and related schedules, delivering clear and reliable information for business decision-making
  • Partner with external auditors by coordinating documentation, responding to requests, and supporting annual audit activities
  • Oversee treasury-related tasks, including cash management and monitoring financial liquidity across the business
  • Contribute financial leadership to merger and acquisition efforts through analysis, due diligence support, and integration planning
  • Supervise a small accounting team with three direct reports while providing guidance to additional cross-functional contributors
  • Maintain organized financial records and documentation processes, including compiling and managing supporting files for reporting and compliance needs
What we offer
What we offer
  • medical
  • vision
  • dental
  • life and disability insurance
  • 401(k) plan
  • Fulltime
Read More
Arrow Right
New

Beauty & Wellness Merchandiser

Beauty & Wellness Merchandiser. Do you enjoy travelling and working as part of a...
Location
Location
United Kingdom , Stoke-on-Trent
Salary
Salary:
13.01 - 13.88 GBP / Hour
jobs.360resourcing.co.uk Logo
360 Resourcing Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Motivated individuals who take pride in their work
  • Previous retail or merchandising experience, particularly within a beauty environment, is preferred
  • Full UK manual driving licence is a plus
Job Responsibility
Job Responsibility
  • Merchandising and replenishing stock using planograms within set timescales
  • Profiling shelving, installing equipment, and implementing displays to a high standard
  • Maintaining cleanliness of equipment and general housekeeping duties
  • Manual handling and lifting (up to 25kg- male 16kg - female may be required)
  • Working efficiently in a fast-paced environment
  • Installing POS (Point of Sale) materials
  • Cosmetic merchandising
  • Basic IT tasks such as downloading and printing planograms (preferred)
What we offer
What we offer
  • Carpooling options for non-drivers
  • Driver incentive scheme
  • Pre-paid shared hotel accommodation with breakfast
  • Uniform provided
  • Supportive team environment
  • Development opportunities through our training programme
  • Holiday pay accrued and paid when not working
  • Refer-a-friend scheme – earn a £200 bonus (T&Cs apply)
  • Parttime
Read More
Arrow Right
New

Fractional Controller

Robert Half Management Resources is seeking an experienced Fractional Controller...
Location
Location
United States , Beverly Hills
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Prior experience serving as a Controller or senior finance leader within commercial real estate ownership, investment, or property operations strongly preferred.
  • Strong command of financial reporting, management reporting, and annual close processes in a multi-entity environment.
  • Solid knowledge of US GAAP and its application to property accounting and real estate financial operations.
  • Demonstrated ability to analyze property-level economics, including leasing structures, operating costs, and cash flow performance.
  • Experience with account reconciliation, audit preparation, and coordinating documentation for external accountants or advisors.
  • Familiarity with commercial property management accounting, including CAM reconciliation practices.
  • Ability to work independently, communicate clearly with non-financial stakeholders, and operate effectively in a high-trust setting.
  • Consultative and adaptable approach suited to a fractional leadership assignment.
Job Responsibility
Job Responsibility
  • Lead financial oversight for a diversified commercial multi-state real estate portfolio, ensuring accurate reporting across individual properties and related entities.
  • Examine monthly statements, including income statements, balance sheets, and cash flow reports, to confirm consistency and reliability of results.
  • Advise ownership on asset performance by highlighting trends in leasing activity, operating expenses, recoveries, and cash generation.
  • Work closely with bookkeeping and operations team members to improve coordination, reinforce internal controls, and maintain accountability.
  • Evaluate current accounting procedures and introduce practical enhancements that fit a streamlined, owner-directed environment.
  • Partner with external tax professionals and other advisors to support organized reporting, smooth information sharing, and audit-ready documentation.
  • Provide financial perspective on portfolio strategy, including capital allocation considerations and performance patterns across properties.
  • Support account reconciliations, annual close activities, and preparation of audit support materials such as PBC schedules.
  • Monitor commercial property management accounting details, including CAM reconciliations and related reporting accuracy.
What we offer
What we offer
  • medical
  • vision
  • dental
  • life and disability insurance
  • 401(k) plan
  • Parttime
Read More
Arrow Right
New

Mot Tester

Here at Williams, our MOT Testers take pride in accuracy, safety and ensuring ve...
Location
Location
United Kingdom , Rochdale
Salary
Salary:
29000.00 - 32000.00 GBP / Year
jobs.360resourcing.co.uk Logo
360 Resourcing Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A full, valid UK driving licence
  • A valid DVSA MOT Tester licence
  • Previous experience as an MOT Tester in a busy workshop
  • Strong attention to detail and knowledge of MOT testing standards
  • Ability to work efficiently and accurately under pressure
  • A professional, safety-focused and customer-oriented approach
Job Responsibility
Job Responsibility
  • Carrying out MOT tests in accordance with DVSA regulations
  • Completing work within times set by the Workshop Controller
  • Accurately recording work using the company's time recording systems
  • Identifying and reporting any additional work required on vehicles
  • Using appropriate diagnostic tools and testing equipment
  • Maintaining a safe, clean and organised working environment
  • Supporting the workshop team to deliver efficient and high-quality service
What we offer
What we offer
  • Monthly performance bonus, with on-target earnings of £3,000 per year
  • Career development plan, including full manufacturer training and certification
  • Structured working hours
  • Ongoing investment in digital aftersales systems
  • Prestige staff car options and staff discounts across the Group
  • Great facilities and a professional, supportive team environment
  • Generous annual leave-30 days paid holiday (including Bank Holidays), increasing to 35 days with length of service
  • £1,000 employee referral bonus
  • Medicash healthcare scheme
  • Employee Assistance Programme (EAP)
  • Fulltime
Read More
Arrow Right