CrawlJobs Logo

AI Research Scientist, Evaluations - Meta Superintelligence Lab

United States, Menlo Park 184000.00 - 257000.00 USD / Year · Job Posted February 20, 2026
Apply Position
Job Link Share

Job Description

Meta is seeking Research Scientists to join the Evaluations team within Meta Superintelligence Labs (MSL). Evaluations are the core of AI progress at MSL, determining what capabilities get built, which features get prioritized, and how fast our models improve. As a Research Scientist, you will provide the technical capabilities to measure and understand the capabilities of our frontier AI systems. You'll work in tandem with world-class researchers to envision, develop, and validate novel evaluations that shape the future of AI capability measurement. This is a technical research role requiring good scientific judgment, creativity, and the ability to drive ambitious research agendas with independence. The evaluations you develop will directly influence research direction and major model lines within MSL, making scientific validity, methodological rigor, and clear communication important. You will collaborate closely with technical leadership to ensure evaluations capture the most important capabilities, translating organizational priorities into measurable benchmarks, and translating evaluation insights back into research direction. We are looking for exceptional research talent – researchers who have shaped the field of machine learning, and are ready to do so again at the frontier of AI. If you are passionate about defining how we measure AI progress and want to shape the scientific foundations of frontier AI development, we encourage you to apply for this exciting opportunity at the core of MSL.

Job Responsibility

  • Curate and integrate publicly available and internal benchmarks to direct the capabilities of frontier model development
  • Develop and implement evaluation environments, including environments for novel model capabilities and modalities
  • Collaborate with external data vendors to source and prepare high-quality evaluation datasets
  • Execute on the technical vision of research scientists designing new benchmarks and evaluations
  • Build robust, reusable evaluation pipelines that scale across multiple model lines and product areas
  • Contribute to evaluation tooling that measures the quality and reliability of evaluation suites

Requirements

  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • PhD degree in Computer Science, Machine Learning, or a related technical field
  • 3+ years of experience in machine learning engineering, machine learning research, or a related technical role
  • Proficiency in Python and experience with ML frameworks such as PyTorch
  • Experience identifying, designing and completing medium to large technical features independently, without guidance
  • Proven success in software engineering practices including version control, testing, and code review practices
  • Ability to work independently and adapt to rapidly changing priorities

Nice to have

  • Publications at peer-reviewed venues (NeurIPS, ICML, ICLR, ACL, EMNLP, or similar) related to language model evaluation, benchmarking, or deep learning
  • Hands-on experience with language model post-training and deep learning systems, or building reinforcement learning environments
  • Experience implementing or developing evaluation benchmarks for large language models and multimodal models (e.g., vision-language, audio, video)
  • Experience working with large-scale distributed systems and data pipelines
  • Familiarity with language model evaluation frameworks and metrics
  • Track record of open-source contributions to ML evaluation tools or benchmarks

What we offer

  • bonus
  • equity
  • benefits

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

AI Research Scientist, Evaluations - Meta Superintelligence Lab

8 matching positions

AI Research Scientist - Voice AI Team, Meta Superintelligence Labs

Meta is seeking AI Research Scientists to join the Realtime AI Voice team in Met...
Location
Location
United States , Menlo Park, CA +2 locations
Salary
Salary:
184000.00 - 257000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • PhD degree in Computer Science, Mathematics, or similar quantitative field
  • 2+ years of post-PhD experience in an academic, industry, or government laboratory setting, with primary responsibilities focused on AI research
  • Proven track record of publications at peer-reviewed AI & speech conferences (e.g. NeurIPS, ICML, ICLR, ICASSP)
  • Experience in training, fine-tuning, and/or experimenting with foundation models beyond black-box use
  • Familiarity with one or more deep learning frameworks (e.g., pytorch, tensorflow)
  • Experience communicating complex research to public audiences of peers
Job Responsibility
Job Responsibility
  • Lead, collaborate, and execute on research that pushes forward the state of the art in speech and large language model research
  • Directly contribute to experiments, including designing experimental details, develop reusable code, running evaluations, and organizing results
  • Help identify long-term research goals as well as intermediate milestones
  • Work cross-functionally to translate research breakthroughs into scalable, production-ready solutions for Meta's conversational AI / product experiences
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

AI Research Scientist, Personalization, Meta SuperIntelligence Labs

Meta is seeking AI research scientists to help us build the solutions for Person...
Location
Location
United States , Menlo Park
Salary
Salary:
154000.00 - 217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Phd in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Experience in Generative AI models and building LLM technologies particularly post training
  • Experience solving complex problems and comparing alternative solutions, tradeoffs, and different perspectives to determine a path forward. Proven experience of proactively identifying, scoping and implementing innovative research solutions
  • Programming experience in Python and hands-on experience with frameworks like Pytorch, Spark
  • Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Collaborate with cross-functional teams to develop and improve personalization in Meta’s frontier foundation models
  • Directly contribute to experiments, including designing experimental details, authoring reusable code, running evaluations, and organizing results
  • Prioritize research that can be applied to Meta's product development
  • Lead complex research projects end-to-end
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Data Scientist, Evaluations - Meta Superintelligence Labs

Meta is seeking a Data Scientist to join the Evaluations team within Meta Superi...
Location
Location
United States , Menlo Park
Salary
Salary:
177000.00 - 247000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Bachelor's degree in Mathematics, Statistics, a relevant technical field, or equivalent practical experience
  • A minimum of 6 years of work experience in analytics (minimum of 4 years with a Ph.D.)
  • Experience with data querying languages (e.g. SQL), scripting languages (e.g. Python), and/or statistical/mathematical software (e.g. R)
Job Responsibility
Job Responsibility
  • Scientific Design & Validity: Lead the design of evaluation stimuli and benchmarks, ensuring they have minimal bias and high construct validity for frontier LLM capabilities
  • Experimental Methodology: Design and execute effective sampling strategies and experimental frameworks to measure model performance and errors accurately
  • Deep-Dive Analysis: Perform rigorous data and model error analyses to provide deep insights into model behavior, quality gaps, and failure modes
  • Collaborative Research: Partner closely with Research Scientists and Engineers to translate organizational priorities into measurable, scientifically sound benchmarks
  • External Impact: Drive the publication of novel evaluation research and the open-sourcing of benchmarks to influence the broader AI research community
  • Strategic Influence: Use data-driven insights to influence research directions and major model development lines within MSL
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right

Research Engineering Manager, Evaluations, Meta Superintelligence Labs

Meta is seeking a Research Engineering Manager to lead the Evaluations team with...
Location
Location
United States , Menlo Park
Salary
Salary:
219000.00 - 301000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Machine Learning, or a related technical field
  • 4+ years of experience in machine learning engineering, machine learning research, or a related technical role
  • 3+ years of experience managing or leading technical teams, including hiring, mentoring, and performance management
  • Proficiency in Python and experience with ML frameworks such as PyTorch
  • Proven track record of leading medium to large-scale technical projects from conception to deployment
  • Demonstrated experience balancing hands-on technical work with people management and strategic planning
  • Clear communication and experience influencing cross-functional stakeholders
Job Responsibility
Job Responsibility
  • Build, mentor, and grow a team of research engineers and scientists focused on evaluation infrastructure and benchmarking
  • Conduct performance reviews, career development conversations, and provide technical mentorship to team members
  • Foster a culture of engineering excellence, research rigor, and rapid iteration within the team
  • Partner with recruiting to hire world-class research engineering talent
  • Curate and integrate publicly available and internal benchmarks to direct the capabilities of frontier model development
  • Oversee the development and implementation of evaluation environments, including environments for novel model capabilities and modalities
  • Establish partnerships with external data vendors to source and prepare high-quality evaluation datasets
  • Influence the technical roadmap for evaluation infrastructure in collaboration with MSL Infra team
  • Translate the technical vision of research scientists into actionable engineering plans and execution strategies
  • Partner with research scientists, product teams, and other engineering teams to align evaluation priorities with organizational goals
What we offer
What we offer
  • bonus
  • equity
  • benefits
  • Fulltime
Read More
Arrow Right
New

Embedded Software Engineer (Chinese Speaking)

Analyze, design, develop, and maintain complex embedded software components base...
Location
Location
Vietnam , Ho Chi Minh City
Salary
Salary:
Not provided
https://www.bosch.pl/ Logo
Robert Bosch Sp. z o.o.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree or higher in Computer Science, Software Engineering, Electrical Engineering, Electronics, Telecommunications, Control and Automation Engineering, Mechatronics, or a related field
  • 3+ years of proven professional experience in embedded software development
  • Hands-on experience with microcontroller architecture, device drivers, and real-time operating systems (RTOS)
  • Strong programming skills in C/C++, with experience in scripting languages like Python or others such as C#
  • Solid foundation in Control Systems, Automation, Embedded Systems, and familiarity with automotive communication protocols (e.g., CAN, LIN, Ethernet)
  • Experience in areas such as Automotive Ethernet, Base Software (BSW), Bootloader, COM Stack, Cyber Security, Device Drivers, Diagnostics, and Real-Time Operating Systems
  • Proficient in English Communication
  • Advanced Chinese Proficiency: Must be able to communicate fluently in technical and business contexts and read/understand technical documents in Chinese (HSK5 or above)
  • Results-driven with a quality-focused, structured, and disciplined engineering approach
  • Possesses a safety-critical mindset and an architecture-first approach
Job Responsibility
Job Responsibility
  • Analyze, design, develop, and maintain complex embedded software components based on business and technical requirements
  • Perform software requirement engineering, including analyzing, validating, and maintaining customer requirements
  • Perform software integration activities, including configuring and merging software modules into a unified build
  • Create and execute unit, component, and integration test cases to verify software functionality and ensure compliance with quality standards
  • Apply established software development processes and coding standards to produce reliable and maintainable code for embedded systems
  • Utilize debugging and analysis tools to investigate, troubleshoot, and resolve complex software defects and performance issues
  • Mentor junior engineers by providing guidance on technical tasks, coding practices, and problem-solving techniques
  • Contribute to technical reviews and team knowledge-sharing sessions
  • Ensure compliance with applicable industry standards, regulatory requirements, company policies, and quality frameworks applicable to the role and assigned projects
What we offer
What we offer
  • Working in one of the Best Places to Work in Vietnam and Top 30 of the Most Innovative Companies all over the world
  • Join a dynamic and fast growing global company (English-speaking environment)
  • 13th-month salary bonus + attractive performance bonus (you'll love it!) + annual performance appraisal
  • 100% monthly salary and mandatory social insurances in 2-month probation
  • Onsite opportunities: short-term and long-term assignments
  • 15++ days of annual leave + 1 day of birthday leave
  • Premium health insurance for employee and 02 family members
  • Flexible working time
  • Lunch and parking allowance
  • Various training on hot-trend technologies/ foreign language (English/Chinese/Japanese) and soft-skills
  • Fulltime
Read More
Arrow Right
New

Project purchasing engineer (exporting team)_EM

Location
Location
China , Changsha
Salary
Salary:
Not provided
https://www.bosch.pl/ Logo
Robert Bosch Sp. z o.o.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor degree in technical background are preferred
  • Speak and write English fluently (additional language skill are plus preferable German/Spain)
  • Have working experience up to 3-5 years, automotive field experience are preferred
  • Skills in efficient project management
  • Good communication skills, open mind and a good team player
  • Flexible working time style requested (communication with other region due to time zone different)
  • Frequently domestic and oversea business trip required based on working tasks needs.
Job Responsibility
Job Responsibility
  • Project management purchasing for oversea supplier and oversea BOSCH plant
  • Responsible for cross region project purchasing management
  • Responsible for projects target cost / time-bound / quality fulfillment
  • Lead technical discussion together with supplier & Engineer & PMQ
  • Responsible for RPP (Cost saving) projects planning and realization
  • Responsible for ECR (Change management) in purchasing.
  • Fulltime
Read More
Arrow Right
New

Internship – Automotive Hardware Penetration Testing

The Bosch Group is a leading global supplier of technology and services. It empl...
Location
Location
Vietnam , Ho Chi Minh City
Salary
Salary:
Not provided
https://www.bosch.pl/ Logo
Robert Bosch Sp. z o.o.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Embedded Systems Knowledge: Familiarity with microcontrollers (preferably RH850 or similar), memory maps, and boot processes
  • Programming: Proficiency in C/C++, basic Python
  • experience with embedded firmware development is a plus
  • Hardware Debugging: Exposure to tools like oscilloscopes, logic analyzers, JTAG/SWD debuggers
  • Security Concepts: Basic understanding of cybersecurity principles, threat modeling, or cryptography
  • 3rd or 4th student in Electrical Engineering, Computer Engineering, Computer Science, Mechatronics, or related fields
  • Available to commit to a full-time internship for 6 months, working Monday to Friday
  • Able to communicate effectively in English, both written and verbal
  • Curiosity & Problem-Solving: Strong interest in automotive security and willingness to explore new attack vectors
  • Teamwork: Ability to collaborate in a team setting, especially during the innovation project phase
What we offer
What we offer
  • Monthly Internship allowance + Meal & Parking allowance
  • 1 day of paid leave per month
  • Good benefits of Trade Union activities, team building and company trip
  • Opportunity to work in global projects of fast developing company and being a part of innovation team contributing initiative ideas to the hi-tech world
  • Engage in our diverse training programs which surely help strengthen both your personal and professional skills
  • Fulltime
Read More
Arrow Right
New

Senior Field Service Parts Planner

Anduril’s Planning team is seeking a world-class Senior Field Service Parts Plan...
Location
Location
United States , Costa Mesa
Salary
Salary:
129000.00 - 171000.00 USD / Year
a16z
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in a technical field (i.e. manufacturing, engineering, analytics, computer science, etc.) or business field (i.e. finance, economics, supply chain management, business administration, marketing, etc.)
  • 6+ years of experience in supply chain planning, inventory management, or MRO planning within a fast-paced manufacturing, aerospace & defense, or technical environment
  • Demonstrated ability to solve complex operational challenges with creative solutions in a fast-paced, resource-limited environment, with a focus on speed and accuracy
  • Excellent communication, collaboration, and interpersonal skills to work effectively with cross-functional teams
  • Proven ability to be proactive, take substantial responsibility, and manage multiple priorities effectively
  • Experience with ERP systems such as Oracle, Netsuite, and CRM systems like Salesforce
  • Ability to travel up to 10% of the time
  • Ability to relocate, if not already local to be onsite in Costa Mesa, CA
Job Responsibility
Job Responsibility
  • Plan, manage, and optimize Field Service & MRO inventory, including spare parts, consumables, and critical components, ensuring their availability when and where maintenance needs them
  • Collaborate closely with Field Service, Maintenance, Operations, and Reliability Engineering teams to understand and anticipate demand for both preventative/scheduled and corrective maintenance activities
  • Develop and maintain comprehensive critical spares lists, especially for hazard zone deployments, establishing appropriate stocking strategies for highly variable and mission-critical items
  • Design and implement effective inventory control strategies such as min/max levels, safety stock calculations, and reorder points, accounting for intermittent and event-driven demand patterns
  • Monitor Field Service & MRO part usage, analyze failure rates, and collaborate with Reliability Engineering to drive continuous improvement in material planning and asset reliability
  • Proactively identify and expedite at-risk materials or troubleshoot potential supply chain disruptions to prevent maintenance delays and protect asset uptime
  • Foster strong cross-functional coordination with Maintenance, Operations, Deployment, and Procurement teams to ensure seamless material flow, improve visibility into demand, and align on Field Service & MRO strategies
  • Ensure Field Service & MRO planning strategies and inventory management practices align with and fulfill O&S (Operations & Sustainment) contract requirements
  • Continuously seek opportunities to right-size inventory levels, reducing excess while mitigating stockout risks for critical components
What we offer
What we offer
  • Highly competitive equity grants
  • top-tier benefits for full-time employees (available at little to no cost to employees)
  • Fulltime
Read More
Arrow Right