CrawlJobs Logo

Research Scientist Intern, Language & Multimodal Foundations

meta.com Logo

Meta

Location Icon

Location:
United States , Menlo Park

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

7650.00 - 12134.00 USD / Month

Job Description:

Meta is seeking Research Interns to join the Multimedia Perception team in the Applied Research org of Meta Superintelligence Labs. These teams are committed to advancing the field of Artificial Intelligence by making fundamental advances in technologies to help interact with and understand our world. Our interns have an opportunity to make core algorithmic advances and apply their ideas at an unprecedented scale. Our teams are looking for candidates with expertise in the Natural Language Processing & Multimodality domains, such as: natural language understanding and generation, audio and understanding, multi-modal language models, dialogue, cross-lingual and cross-domain transfer learning, and computer vision. Our team at Meta AI offers twelve (12) to twenty-four (24) weeks long internships and we have various start dates throughout the year.

Job Responsibility:

  • Perform research that enables learning the semantics of data (specifically text, audio, images, video, and other modalities)
  • Brainstorm with research mentors, review literature and existing solutions of a challenging real-world research problem
  • Develop novel solutions, implement prototypes, and perform extensive experiments to test the proposed solutions in meaningful benchmarks and metrics, analyze the results and verify the conclusions
  • Draft and polish research publications
  • Present research outcomes to internal and/or external audiences
  • Contribute research that can be applied to Meta product development

Requirements:

  • Currently has or is in the process of obtaining a PhD degree in the field of Natural Language Processing, Speech Processing, Computer Vision, Machine Learning, Artificial Intelligence, or equivalent
  • Research and/or work experience in Natural Language Processing, Speech Processing, Computer Vision, Machine Learning, Deep Learning, or equivalent
  • Experience in Python, C++, or other related languages
  • Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment

Nice to have:

  • Proven track record of achieving significant results as demonstrated by grants, fellowships, patents, as well as publications at leading workshops or conferences in Speech & Language (ACL, EMNLP, NAACL, TACL, ICASSP, Interspeech, SLT, ASRU, TASLP)
  • Experience advancing AI techniques in Natural Language Processing, including contributions to open source libraries and frameworks in Speech & Language
  • Proven track record of achieving significant results as demonstrated by grants, fellowships, patents, as well as publications at leading workshops or conferences in Computer Vision (CVPR, ECCV, ICCV) or Machine Learning (NeurIPS, ICML, ICLR)
  • Experience advancing AI techniques in Computer Vision and/or Machine Learning
  • Experience manipulating and analyzing complex, large scale, high-dimensionality data from varying sources
  • Experience in utilizing theoretical and empirical research to solve problems
  • Experience working and communicating cross functionally in a team environment
  • Intent to return to degree-program after the completion of the internship/co-op

Additional Information:

Job Posted:
February 16, 2026

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Research Scientist Intern, Language & Multimodal Foundations

PhD AI Research Intern

Join our cutting-edge Machine Learning Research team at Atlassian as a PhD Resea...
Location
Location
Canada
Salary
Salary:
55.00 USD / Hour
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Completed Bachelors degree in Computer Science or a related field
  • Currently pursuing a PhD in Computer Science or a related field at any stage of your doctoral studies
  • Strong foundation in AI/ML, LLMs, modeling and/or optimization techniques
Job Responsibility
Job Responsibility
  • Collaborate cross-functionally with Research Scientists and Machine Learning Engineers to design, implement, and evaluate experiments that advance the performance, efficiency, and scalability of modern ML and LLM systems for our AI products
  • Curate, preprocess, and manage large-scale datasets for training and evaluation, ensuring data quality, diversity, and reproducibility across experiments
  • Conduct continued training, fine-tuning, and alignment of large language models for specialized applications such as conversational AI, summarization, generative search, and multimodal agents
  • Evaluate cutting-edge ML algorithms through rigorous experimentation and provide detailed analyses highlighting performance insights, failure modes, and opportunities for improvement
  • Contribute to publications and presentations at internal workshops or top-tier academic venues, helping to drive innovation in Enterprise AI and large-scale ML systems
What we offer
What we offer
  • health and wellbeing resources
  • paid volunteer days
Read More
Arrow Right

PhD AI Research Intern

Join our cutting-edge Machine Learning Research team at Atlassian as a PhD Resea...
Location
Location
United States , Seattle
Salary
Salary:
49.00 - 75.00 USD / Hour
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Completed Bachelors degree in Computer Science or a related field
  • Currently pursuing a PhD in Computer Science or a related field at any stage of your doctoral studies
  • Degree completion date cannot be earlier than September 2026 - June 2027
  • Strong foundation in AI/ML, LLMs, modeling and/or optimization techniques
  • Exhibit a solid grasp of algorithms and data structures
  • Demonstrate proficiency in Python programming and ability to write clean, efficient, and well-documented code
  • Experience working with large-scale datasets, including data preprocessing, augmentation, and scaling techniques
  • Has expertise in managing data using Python libraries such as NumPy, Pandas, Matplotlib, in addition to leveraging models from Hugging Face and has practical knowledge of applied machine learning and deep learning frameworks, like PyTorch
  • Demonstrated exposure to natural language processing (NLP) and Computer Vision (CV)
  • Familiarity with state-of-the-art research in machine learning and AI, as evidenced by relevant coursework, publications, or projects
Job Responsibility
Job Responsibility
  • Collaborate cross-functionally with Research Scientists and Machine Learning Engineers to design, implement, and evaluate experiments that advance the performance, efficiency, and scalability of modern ML and LLM systems for our AI products
  • Curate, preprocess, and manage large-scale datasets for training and evaluation, ensuring data quality, diversity, and reproducibility across experiments
  • Conduct continued training, fine-tuning, and alignment of large language models for specialized applications such as conversational AI, summarization, generative search, and multimodal agents
  • Evaluate cutting-edge ML algorithms through rigorous experimentation and provide detailed analyses highlighting performance insights, failure modes, and opportunities for improvement
  • Contribute to publications and presentations at internal workshops or top-tier academic venues, helping to drive innovation in Enterprise AI and large-scale ML systems
What we offer
What we offer
  • health and wellbeing resources
  • paid volunteer days
Read More
Arrow Right

Research Scientist Intern, Real-Time Multimodal AI

Reality Labs is building the future of connection through world-class AR/VR hard...
Location
Location
United States , Burlingame
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining, a PhD degree in Computer Science, Machine Learning, Electrical Engineering, or a related field
  • 2+ years of research experience in one or more of the following areas: multimodal learning, vision-language models, large language models, or foundation model fine-tuning
  • Hands-on experience fine-tuning large foundation models (e.g., LLaVA, InternVL, Qwen-VL, LLaMA, or similar)
  • Strong programming skills in Python
  • Experience with deep learning frameworks such as PyTorch
  • Excellent communication skills and ability to work independently
  • Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Research and develop novel approaches for fine-tuning large multimodal foundation models (vision-language, audio-visual) for real-time applications
  • Design and implement efficient inference pipelines for deploying fine-tuned models in real-time communication scenarios
  • Explore agentic architectures that leverage fine-tuned models as tools within larger AI systems
  • Collaborate with cross-functional teams to integrate models into prototype experiences
  • Document and present research progress with the goal of publishing findings at top-tier ML/CV conferences
  • Contribute to building working prototypes that demonstrate the capabilities of fine-tuned multimodal models
Read More
Arrow Right

Research Scientist Intern, AI Research - World Models

Meta is seeking Research Interns to join the SAM team in the Multimedia Percepti...
Location
Location
United States , Menlo Park
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining, a PhD degree in Computer Vision, Machine Learning, Artificial Intelligence, or relevant technical field
  • Research and/or work experience in Generative Modeling and Computer Vision. In particular: video generation, 3D/4D reconstruction, video and image understanding, vision-language foundation models, representation learning, and related areas
  • Research and/or work experience in Machine Learning or Deep Learning with applications to perception
  • Experience in Python, C++, or other related languages
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Perform research to advance the science and technology of generative AI
  • Perform research that enables learning to predict and condition on multimodal data (video, 3D structures, primarily images, text, and other modalities like audio)
  • Brainstorm with research mentors, review literature and existing solutions of a challenging real-world research problem
  • Develop novel solutions, implement prototypes, and perform extensive experiments to test the proposed solutions in meaningful benchmarks and metrics, analyze the results and verify the conclusions
  • Contribute to ongoing research projects and impactful technology releases
  • Draft and polish research publications
  • Present research outcomes to internal and/or external audiences
Read More
Arrow Right

Research Scientist Intern, AI Research - CoreML - World Models

Meta is seeking Research Interns to join the SAM team in the Multimedia Percepti...
Location
Location
United States , Menlo Park
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining, a PhD degree in Computer Vision, Machine Learning, Artificial Intelligence, or relevant technical field
  • Research and/or work experience in Generative Modeling and Computer Vision. In particular: video generation, 3D/4D reconstruction, video and image understanding, vision-language foundation models, representation learning, and related areas
  • Research and/or work experience in Machine Learning or Deep Learning with applications to perception
  • Experience in Python, C++, or other related languages
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Perform research to advance the science and technology of generative AI
  • Perform research that enables learning to predict and condition on multimodal data (video, 3D structures, primarily images, text, and other modalities like audio)
  • Brainstorm with research mentors, review literature and existing solutions of a challenging real-world research problem
  • Develop novel solutions, implement prototypes, and perform extensive experiments to test the proposed solutions in meaningful benchmarks and metrics, analyze the results and verify the conclusions
  • Contribute to ongoing research projects and impactful technology releases
  • Draft and polish research publications
  • Present research outcomes to internal and/or external audiences
Read More
Arrow Right

Research Scientist Intern, Embodied Foundation Models

Our team is seeking a talented Applied Scientist Intern to join us for 3-6 month...
Location
Location
United States , Sunnyvale
Salary
Salary:
Not provided
wayve.ai Logo
Wayve
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently pursuing a graduate degree in Computer Science, Machine Learning, Robotics, or related technical field
  • Proficient in at least one backend/systems programming language (e.g. Python, Ruby, Java, etc)
  • Previous experience in vision-language models, large language models, natural language processing, especially around reasoning
  • Solid software engineering fundamentals, especially in Python
  • Previously used PyTorch or a similar library for deep learning (e.g. Tensorflow, JAX)
  • Experience with multi-node distributed training of large models
  • Interested in using large-scale multimodal (vision, language, etc.) datasets to improve embodied AI
  • Previous publications in conferences (e.g., CVPR, ICCV, CoRL, NeurIPS, CoLM, RSS, ICRA, among others)
Job Responsibility
Job Responsibility
  • Work on foundation models for embodied AI, including large-scale pretraining, post-training, leveraging language, or improving reasoning capabilities
  • Train models on large-scale multimodal (vision, language, etc.) data efficiently in a multi-node distributed system, and evaluate their performance on open (and closed) datasets/benchmarks
  • Lead a high-impact research work and publish at a top tier conference
Read More
Arrow Right

Research Scientist Intern, Embodied Foundation Models (Evaluation)

Our team is seeking a talented Applied Scientist Intern to join us for 3-6 month...
Location
Location
United States , Sunnyvale
Salary
Salary:
Not provided
wayve.ai Logo
Wayve
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • You are currently pursuing a graduate degree in a Computer Science, Machine Learning, Robotics, or related technical field
  • You are proficient in at least one backend/systems programming language (e.g. Python, Ruby, Java, etc)
  • You have previous experience in vision-language models, large language models, natural language processing, especially around reasoning
  • You have solid software engineering fundamentals, especially in Python
  • You have previously used PyTorch or a similar library for deep learning (e.g. Tensorflow, JAX)
  • Experience with multi-node distributed training of large models
  • You are interested in using large-scale multimodal (vision, language, etc.) datasets to improve embodied AI
  • You have previous publications in the following conferences (e.g., CVPR, ICCV, CoRL, NeurIPS, CoLM, RSS, ICRA, among others)
Job Responsibility
Job Responsibility
  • Work on foundation models for embodied AI, including large-scale pretraining, post-training, leveraging language, or improving reasoning capabilities
  • Train models on large-scale multimodal (vision, language, etc.) data efficiently in a multi-node distributed system, and evaluate their performance on open (and closed) datasets/benchmarks
  • Lead a high-impact research work and publish at a top tier conference (e.g., CVPR, ICCV, CoRL, NeurIPS, CoLM, RSS, ICRA, among others)
Read More
Arrow Right

Senior Research Engineer

As a Senior Research Engineer at Microsoft, you will advance Microsoft’s mission...
Location
Location
United States , Redmond
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Experience deploying Fine Tuned LLMs or multimodal models in live production environments
  • Experience shipping and maintaining production AI systems
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Bringing State-of-the-Art Research to Products
  • Design and implement AI systems using foundation models, prompt engineering, retrieval-augmented generation, multi-agent architectures, and classic ML
  • Fine-tune large language models on domain-specific data and evaluate via offline and online methods such as A/B testing, telemetry, and shadow deployments
  • Build and harden prototypes into production-ready services using robust software engineering and MLOps practices
  • Drive original research and thought leadership (whitepapers, internal notes, patents)
  • convert insights into shipped capabilities
  • Research Translation: Continuously review emerging work
  • identify high-potential methods and adapt them to Microsoft problem spaces
  • End-to-End System Development
  • ML Design & Architecture: Own end-to-end pipeline from data prep, training, evaluation, deployment, and feedback loops
  • Fulltime
Read More
Arrow Right