CrawlJobs Logo

Research Intern - Multimodal Language Models

United States, Redmond 6710.00 - 13270.00 USD / Month · Job Posted April 23, 2026
Apply Position
Job Link Share

Job Description

We are seeking a Research Intern to explore innovative approaches for building efficient multimodal language models. The role will focus on techniques such as model compression, quantization, and model optimization for efficient deployment on resource-constrained platforms. You will work on training strategies to enhance performance and scalability across vision-language tasks.

Job Responsibility

  • Prototype implementations
  • designing experiments
  • analyzing results
  • contributing to research that pushes the boundaries of efficiency in AI systems

Requirements

  • Accepted or currently enrolled in a PhD program in Computer Science or related STEM field
  • Foundation in machine learning and deep learning, with expertise in areas such as multimodal language models, transformer architecture, efficient model design, compression, and quantization
  • Proficiency in modern deep learning frameworks (e.g., PyTorch, DeepSpeed) for scalable model development and optimization
  • Proven ability to define and execute original research agendas, demonstrating creativity and technical rigor
  • Motivation to publish in top-tier academic venues, showcasing impactful contributions to the research community.

Nice to have

Familiarity with multimodal architectures and low-bit quantization

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Research Intern - Multimodal Language Models

8 matching positions

Research scientist intern, language and multimodal research

Meta is seeking Research Interns to join our Meta Superintelligence Lab in the p...
Location
Location
United States , Bellevue
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has or is in the process of obtaining a Ph.D. degree in Computer Science, Artificial Intelligence, Generative AI, or relevant technical field
  • Experience with Python, C++, C, Java or other related languages
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
  • Experience building systems based on machine learning and/or deep learning methods
Job Responsibility
Job Responsibility
  • Develop novel state-of-the-art generative AI algorithms and corresponding systems, leveraging various deep learning techniques
  • Based on the project, help analyze and improve efficiency, scalability, and stability of corresponding deployed algorithms
  • Perform research to advance the science and technology of intelligent machines
  • Perform research that enables learning the semantics of and training generative models of data (images, video, 3D, text, audio, and other modalities)
  • Collaborate with researchers and cross-functional partners including communicating research plans, progress, and results
  • Disseminate research results
  • When applicable, contribute to research that can be applied to Meta product development
Read More
Arrow Right

Research Intern - Memory & Orchestration in Large Language Models

Research Internships at Microsoft provide a dynamic environment for research car...
Location
Location
United States , Redmond
Salary
Salary:
6710.00 - 13270.00 USD / Month
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently enrolled in a PhD program in Computer Science, Machine Learning, Artificial Intelligence, or a related STEM field
  • Experience with Python ecosystem
  • Experience with LLMs and prompt engineering
  • Experience with agentic systems and workflows
Job Responsibility
Job Responsibility
  • Conducting hands-on research into systems for memory and orchestration of LLMs and multimodal models
  • Investigating new embedding techniques, including graph embeddings and methods that measure changes over time
  • Developing advanced retrieval augmented generation systems to enhance LLM capabilities
  • Collaborating with interdisciplinary teams of researchers and engineers on challenging and impactful projects
  • Presenting research findings and participating in research discussions
  • Fulltime
Read More
Arrow Right

Research Scientist Intern, AI Research - Multimodal Pretraining

Meta is seeking Research Scientist Interns in the multimodal pretraining team in...
Location
Location
United States , Menlo Park
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has or is in the process of obtaining a Ph.D. degree in Computer Science, Machine Learning, Computer Vision, Artificial Intelligence, or relevant technical field
  • Past projects/publications in the general domain of neural scaling laws, model architectures, image/text modeling, vision-language modeling
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
  • Experience in PyTorch, Triton, or other related programming languages
  • Experience building systems based on machine learning and/or deep learning methods
Job Responsibility
Job Responsibility
  • Perform research to advance the frontiers of multimodal (images, video, text, audio, and other modalities) pretraining, to develop the next generation of multimodal architectures
  • Collaborate with researchers and cross-functional partners including communicating research plans, progress, and results
  • Publish research results and contribute to research that can be applied to Meta product development
Read More
Arrow Right

Research Scientist Intern, AI Research - World Models

Meta is seeking Research Interns to join the SAM team in the Multimedia Percepti...
Location
Location
United States , Menlo Park
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining, a PhD degree in Computer Vision, Machine Learning, Artificial Intelligence, or relevant technical field
  • Research and/or work experience in Generative Modeling and Computer Vision. In particular: video generation, 3D/4D reconstruction, video and image understanding, vision-language foundation models, representation learning, and related areas
  • Research and/or work experience in Machine Learning or Deep Learning with applications to perception
  • Experience in Python, C++, or other related languages
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Perform research to advance the science and technology of generative AI
  • Perform research that enables learning to predict and condition on multimodal data (video, 3D structures, primarily images, text, and other modalities like audio)
  • Brainstorm with research mentors, review literature and existing solutions of a challenging real-world research problem
  • Develop novel solutions, implement prototypes, and perform extensive experiments to test the proposed solutions in meaningful benchmarks and metrics, analyze the results and verify the conclusions
  • Contribute to ongoing research projects and impactful technology releases
  • Draft and polish research publications
  • Present research outcomes to internal and/or external audiences
Read More
Arrow Right

Research Scientist Intern, AI Research - CoreML - World Models

Meta is seeking Research Interns to join the SAM team in the Multimedia Percepti...
Location
Location
United States , Menlo Park
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining, a PhD degree in Computer Vision, Machine Learning, Artificial Intelligence, or relevant technical field
  • Research and/or work experience in Generative Modeling and Computer Vision. In particular: video generation, 3D/4D reconstruction, video and image understanding, vision-language foundation models, representation learning, and related areas
  • Research and/or work experience in Machine Learning or Deep Learning with applications to perception
  • Experience in Python, C++, or other related languages
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Perform research to advance the science and technology of generative AI
  • Perform research that enables learning to predict and condition on multimodal data (video, 3D structures, primarily images, text, and other modalities like audio)
  • Brainstorm with research mentors, review literature and existing solutions of a challenging real-world research problem
  • Develop novel solutions, implement prototypes, and perform extensive experiments to test the proposed solutions in meaningful benchmarks and metrics, analyze the results and verify the conclusions
  • Contribute to ongoing research projects and impactful technology releases
  • Draft and polish research publications
  • Present research outcomes to internal and/or external audiences
Read More
Arrow Right

Research Scientist Intern, Multimodal AI (PhD)

The Meta Reality Labs Research Team brings together a world-class team of resear...
Location
Location
United States , Redmond
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining, a PhD degree in the field of Computer Science, Artificial Intelligence, Generative AI, Transformer Models, Machine Learning, Signal Processing or Computer vision
  • 3+ years experience with Python, Matlab, or similar
  • 3+ years experience with machine learning software platforms such as PyTorch, TensorFlow, etc
  • Experience building novel audio computational models and LLM
  • Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Design, implement, and maintain comprehensive evaluation protocols for large language models, including both automated and human-in-the-loop assessments
  • Develop and curate high-quality datasets and benchmarks to measure model performance, safety, fairness, and robustness across a variety of tasks and modalities
  • Analyze model outputs to identify strengths, weaknesses, and failure modes, and provide actionable insights to research and engineering teams
  • Design and implementation of novel algorithms to solve audio research problems
  • Collaboration with teams building Meta’s language AI products
  • Collaborate with researchers, engineers, and cross-functional partners to define evaluation goals, communicate findings, and drive improvements in model quality
  • Develop tools and infrastructure to streamline and scale evaluation processes, including dashboards, annotation platforms, and reporting systems
  • Stay up-to-date with the latest research in audio LLM evaluation, benchmarking, and responsible AI, and incorporate best practices into Meta’s workflows
  • Disseminate evaluation results through internal reports, presentations, and, when appropriate, external publications
What we offer
What we offer
  • benefits
  • Fulltime
Read More
Arrow Right

Research Scientist Intern, Multimodal Contextual AI (PhD)

At Reality Labs, our team brings novel experiences to life on Meta’s AR devices....
Location
Location
United States , Redmond
Salary
Salary:
7313.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a PhD in Computer Science, Electrical Engineering, or a related field
  • Programming and simulation experience with languages such as C/C++ and Python
  • Experience with computer architecture and HW/SW co-design and co-optimization
  • Must obtain work authorization in the country of employment at the time of hire, and maintain on-going work authorization during employment
Job Responsibility
Job Responsibility
  • Build and characterize experimental HW+SW systems on AR devices and device prototypes
  • Develop embedded firmware and software in RTOS and mobile operating systems, e.g. AOSP
  • Collaborate with other researchers and engineers across various disciplines
What we offer
What we offer
  • Benefits
  • Fulltime
Read More
Arrow Right

Research Intern - Foundation Models and Agentic Systems

Research Internships at Microsoft provide a dynamic environment for research car...
Location
Location
United States , Redmond
Salary
Salary:
5610.00 - 11010.00 USD / Month
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently enrolled in a MS or PhD program in Computer Science or a related STEM field
  • Submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples
  • Demonstrated ability to develop original research agendas
  • Ability to collaborate effectively with other researchers and product development teams
  • Proficient interpersonal skills, cross-group, and cross-culture collaboration
  • Ability to think unconventionally to derive creative and innovative solutions
Job Responsibility
Job Responsibility
  • Develop, improve, and explore the capabilities of LLMs and Multimodal AI models
  • Contribute to efforts on the advancement of Generative AI and Large Language Model Technologies
  • Collaborate with other Research Interns and researchers
  • Present findings
  • Contribute to the vibrant life of the community
What we offer
What we offer
  • Certain roles may be eligible for benefits and other compensation
Read More
Arrow Right