CrawlJobs Logo

Research Scientist Intern, PyTorch Framework Performance

meta.com Logo

Meta

Location Icon

Location:
United States , Bellevue

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

7650.00 - 12134.00 USD / Month

Job Description:

Our team’s mission is to make PyTorch models high-performing, deterministic and stable, via a robust foundational framework that supports the latest hardware, without sacrificing the flexibility and ease of use of PyTorch. We are seeking a PhD Research Intern to work on next-generation Mixture-of-Experts (MoE) systems for PyTorch, focused on substantially improving end-to-end training and inference throughput on modern accelerators (e.g., NVIDIA Hopper and beyond). This internship will explore novel combinations of communication-aware distributed training and kernel- and IO-aware execution optimizations (inspired bySonicMoE and related works) to unlock new performance regimes for large-scale sparse models. The project spans systems research, GPU kernel optimization, and framework optimization, with opportunities for open-source contributions and publication.

Job Responsibility:

  • Design and evaluate communication-aware, kernel-aware, and quantization-aware MoE execution strategies, combining ideas such as expert placement, routing, batching, scheduling, and precision selection
  • Develop and optimize GPU kernels and runtime components for MoE workloads, including fused kernels, grouped GEMMs, memory-efficient forward and backward passes
  • Explore quantization techniques (e.g., MXFP8, FP8) in the context of MoE, balancing accuracy, performance, and hardware efficiency
  • Build performance models and benchmarks to analyze compute, memory, communication, and quantization overheads across different sparsity regimes
  • Run experiments on single-node and multi-node GPU systems
  • Collaborate with the open-source community to gather feedback and iterate on the project
  • Contribute to PyTorch (Core, Compile, Distributed) within the scope of the project
  • Improve PyTorch performance in general

Requirements:

  • Currently has, or is in the process of obtaining, a PhD degree in the field of Computer Science or a related STEM field
  • Deep knowledge of transformer architectures, including attention, feed-forward layers, and Mixture-of-Experts (MoE) models
  • Strong background in ML systems research, with domain knowledge in MoE efficiency, such as routing, expert parallelism, communication overheads, and kernel-level optimizations
  • Hands-on experience writing GPU kernels using CUDA and/or cuteDSL
  • Working knowledge of quantization techniques and their impact on performance and accuracy
  • Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment

Nice to have:

  • Experience working on other ML compiler stack, especially on PT2 stack
  • Familiarity with distributed training and inference, such as data parallelism and collective communication
  • Ability to independently design experiments, analyze complex performance tradeoffs, and clearly communicate technical findings in writing and presentations
  • Intent to return to degree program after the completion of the internship/co-op
  • Proven track record of achieving significant results as demonstrated by grants, fellowships, patents, as well as first-authored publications at leading workshops or conferences such as NeurIPS, MLSys, ASPLOS, PLDI, CGO, PACT, ICML, or similar
  • Experience working and communicating cross functionally in a team environment

Additional Information:

Job Posted:
February 04, 2026

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Research Scientist Intern, PyTorch Framework Performance

PhD AI Research Intern

Join our cutting-edge Machine Learning Research team at Atlassian as a PhD Resea...
Location
Location
United States , Seattle
Salary
Salary:
49.00 - 75.00 USD / Hour
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Completed Bachelors degree in Computer Science or a related field
  • Currently pursuing a PhD in Computer Science or a related field at any stage of your doctoral studies
  • Degree completion date cannot be earlier than September 2026 - June 2027
  • Strong foundation in AI/ML, LLMs, modeling and/or optimization techniques
  • Exhibit a solid grasp of algorithms and data structures
  • Demonstrate proficiency in Python programming and ability to write clean, efficient, and well-documented code
  • Experience working with large-scale datasets, including data preprocessing, augmentation, and scaling techniques
  • Has expertise in managing data using Python libraries such as NumPy, Pandas, Matplotlib, in addition to leveraging models from Hugging Face and has practical knowledge of applied machine learning and deep learning frameworks, like PyTorch
  • Demonstrated exposure to natural language processing (NLP) and Computer Vision (CV)
  • Familiarity with state-of-the-art research in machine learning and AI, as evidenced by relevant coursework, publications, or projects
Job Responsibility
Job Responsibility
  • Collaborate cross-functionally with Research Scientists and Machine Learning Engineers to design, implement, and evaluate experiments that advance the performance, efficiency, and scalability of modern ML and LLM systems for our AI products
  • Curate, preprocess, and manage large-scale datasets for training and evaluation, ensuring data quality, diversity, and reproducibility across experiments
  • Conduct continued training, fine-tuning, and alignment of large language models for specialized applications such as conversational AI, summarization, generative search, and multimodal agents
  • Evaluate cutting-edge ML algorithms through rigorous experimentation and provide detailed analyses highlighting performance insights, failure modes, and opportunities for improvement
  • Contribute to publications and presentations at internal workshops or top-tier academic venues, helping to drive innovation in Enterprise AI and large-scale ML systems
What we offer
What we offer
  • health and wellbeing resources
  • paid volunteer days
Read More
Arrow Right

Machine Learning Research Scientist

This role focuses on cutting-edge research and development in Artificial Intelli...
Location
Location
United States , Milpitas
Salary
Salary:
117500.00 - 270000.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in Computer Science, Electrical Engineering, or related fields focusing on Machine Learning for the dissertation
  • extensive experience in deep learning research, preferably in Large Language Models or Reinforcement Learning
  • experience developing applications with deep learning frameworks like PyTorch with a high software proficiency
  • strong programming skills in Python, data structures, and algorithms are required
  • experience with ML model optimization, GPU acceleration, heterogeneous computation, system software, and performance optimization desired
  • experience in Python Web Frameworks – Django, Flask - a plus but not required.
Job Responsibility
Job Responsibility
  • conducting research, developing solutions, and creating intellectual property in emerging fields like reinforcement learning, LLMs, digital twins, clean energy, data center optimization, and sustainability
  • developing advanced technologies for analysis, optimization, time series forecasting, uncertainty quantification, and control
  • providing thought leadership, collaborating internally and externally, and contributing to HPE’s strategy by identifying emerging technologies
  • publishing in top conferences like NeurIPS, AAAI, and ACL
  • developing patent applications
  • software development, GPU acceleration, model optimization, and real-time data streaming to create robust AI solutions for real-world use cases.
What we offer
What we offer
  • a competitive salary and extensive social benefits
  • diverse and dynamic work environment
  • work-life balance and support for career development
  • health and wellbeing programs
  • personal and professional development programs
  • diversity, inclusion, and belonging initiatives.
  • Fulltime
Read More
Arrow Right
New

Research Scientist Intern, PyTorch Compiler

Our team makes PyTorch run faster and more resource-efficient without sacrificin...
Location
Location
United States , Menlo Park
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has or is in the process of obtaining a PhD degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Experience in ML compiler, Distributed Training, ML systems, or similar
  • Proficient in Python or Cuda programming
  • Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Develop new techniques in TorchDynamo, TorchInductor, PyTorch core, PyTorch Distributed
  • Explore the intersection of PyTorch compiler and PyTorch Distributed
  • Optimize Generative AI models across the stack (pre-training, fine-tuning, and inference)
  • Improve general PyTorch performance
  • Conduct cutting-edge research on ML compiler and ML distributed technologies
  • Collaborate with users of PyTorch to enable new use cases for the framework both inside and outside Meta
Read More
Arrow Right

Research Scientist Intern, Smart Glasses in Wearables AI

The Wearables AI team at Meta works to advance the field of artificial intellige...
Location
Location
United States , Sunnyvale
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has or is in the process of obtaining a Ph.D. degree in Computer Science, Artificial Intelligence, Multi-modal Systems, Computer Vision, Natural Language Processing, Speech Recognition, Audio Processing, Conversational AI, or other relevant technical field
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
  • Experience with Python, C++, C, Java or other related languages
  • Experience building systems with deep learning frameworks such as Pytorch or Tensorflow
Job Responsibility
Job Responsibility
  • Perform research to advance the science and technology of intelligent machines
  • Develop novel and accurate NLP algorithms and systems, leveraging Deep Learning and Machine Learning on big data resources
  • Collaborate with researchers and cross-functional partners including communicating research plans, progress, and results
  • Publish research results and contribute to research that can be applied to Meta product development
Read More
Arrow Right

Research Scientist Intern, Voice Modeling Team

We are looking for Research Scientist Interns to join the Meta AI Speech team in...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has or is in the process of obtaining a Ph.D. degree in Computer Science, Artificial Intelligence, Natural Language Processing, Speech Recognition, Sentiment Analysis, or relevant technical field
  • Must obtain work authorization in country of employment at the time of hire and maintain ongoing work authorization during employment
  • Experience in C/C++ and Python
  • Experience with deep learning frameworks such as Pytorch or Tensorflow
  • Research and/or work experience in machine learning, deep learning, and/or speech technology
Job Responsibility
Job Responsibility
  • Perform research to advance the science and technology of intelligent machines
  • Develop novel and accurate speech algorithms and systems, leveraging Deep Learning and Machine Learning on big data resources
  • Analyze and improve efficiency, scalability, and stability of various deployed systems
  • Collaborate with researchers and cross-functional partners including communicating research plans, progress, and results
  • Publish research results and contribute to research that can be applied to Meta product development
Read More
Arrow Right

Research Scientist Intern PhD, Applied Research

Meta is seeking Research Interns to join our Products and Applied Research team....
Location
Location
United States , Bellevue
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has or is in the process of obtaining a Ph.D. degree in Computer Science, Computer Vision, Audio Processing, Artificial Intelligence, Generative AI, or relevant technical field
  • Must obtain work authorization in the country of employment at the time of hire and maintain ongoing work authorization during employment
  • Research experience in machine learning, deep learning, computer vision and/or natural language processing
  • Experience with Python, C++, C, Java or other related languages
  • Experience with deep learning frameworks such as Pytorch or Tensorflow
Job Responsibility
Job Responsibility
  • Develop novel state-of-the-art generative AI algorithms and corresponding systems, leveraging various deep learning techniques
  • Help analyze and improve safety and robustness of corresponding deployed algorithms based on the project
  • Perform research to advance the science and technology of intelligent machines
  • Collaborate with researchers and cross-functional partners including communicating research plans, progress, and results
  • Disseminate research results
  • Publish research results and contribute to research that can be applied to Meta product development
Read More
Arrow Right

Research Scientist Intern, AI Research - Speech & Audio

Meta AI is currently seeking Research Scientist interns. Our team creates spoken...
Location
Location
United States , Bellevue
Salary
Salary:
7650.00 - 12134.00 USD / Month
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a PhD degree in the field of Computer Science, Artificial Intelligence, Natural Language Processing, or related field
  • Research and/or work experience in machine learning, deep learning, and speech technology
  • Experience with training deep neural networks for key speech tasks such as speech recognition, speech translation, speech synthesis, speaker diarization, sentiment analysis, acoustic event recognition, wake word, scene understanding, etc
  • Experience in Python
  • Experience in deep learning frameworks (PyTorch, Tensorflow, etc)
  • Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment
Job Responsibility
Job Responsibility
  • Perform research to advance the science and technology of intelligent machines
  • Develop novel and accurate speech algorithms and systems, leveraging deep learning and machine learning on big data resources
  • Contribute research that can be applied to Meta product development
  • Analyze and improve efficiency, scalability, and stability of various deployed systems
  • Collaborate with team members from prototyping to production
Read More
Arrow Right

Principal Applied Researcher AI/NLP

At PointClickCare our mission is simple: to help providers deliver exceptional c...
Location
Location
United States
Salary
Salary:
195800.00 - 217500.00 USD / Year
pointclickcare.com Logo
PointClickCare
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD or comparable level of experience in Computer Science, Math, Physics, Engineering or a related field
  • 4-10+ year industry experience building solutions in commercial SaaS, including at least 4 years working in applications of NLP, Search or AI/ML technologies for healthcare
  • Strong interest in applying AI/ML/NLP to healthcare related problems and data
  • Expert-level practical, hands-on experience developing and applying a wide range of techniques in Natural Language Processing, including fine tuning of LLMs and other Transformer models, plus one or more additional AI/ML or Search related areas of expertise to solve real-world problems at scale
  • Demonstrated ability to lead and perform research and experimentation to select appropriate approaches, algorithms, evaluation methods, and frameworks, as well as tasks such as feature selection, language modeling, evaluation and fine tuning or training models, applying standard approaches or developing new tools or workflows as needed to meet project requirements
  • Significant experience building and deploying AI/machine learning and NLP models for large-scale SaaS products, including familiarity with industry standard software development concepts such as scaling issues, version control, CI/CD pipelines, and security
  • Solid understanding and experience with transformer models and multiple kinds of NLP and ML models and approaches including logistic regression, random forest, ensemble methods, SVM, KNN, reinforcement learning, and other ML techniques
  • Proficiency in Python and Java required. Proficiency in JavaScript or TypeScript and modern UI frameworks for building prototype or tool front ends desired
  • Proficiency doing data engineering for ML and NLP applications, including exposure to database systems and proficiency with SQL
  • Proficiency building models from big data using modern packages, models and data analysis stacks such as NumPy, SciPy, Pandas, Scikit-learn, PyTorch, Keras, LightGBM, fastText, NLTK, and spaCy. Proficiency fine tuning Hugging Face Transformers required
Job Responsibility
Job Responsibility
  • You will be applying NLP including GenAI and other AI/ML techniques to develop model systems and solutions, collaborating across functions to scale and integrate advanced solutions into successful end user experiences in large-scale cloud based SaaS production environments for healthcare
  • You will be working with product leaders, clinical informaticists, data scientists, UI/UX researchers and designers, other AI and machine learning and domain experts, engineering teams and others, including work with customers and users who are healthcare professionals
  • Design, build and evaluate solutions that may involve structured or unstructured data including speech or natural language for healthcare use cases, delivering capabilities such as summarization, predictive models, recommenders, semantic search, extraction, classification or other NLP, AI or machine learning based techniques
  • You will be performing research and experimentation to select appropriate approaches, algorithms, evaluation methods and frameworks and doing the R&D to deliver model systems
  • You will perform, oversee and assist in data collection, data cleaning, data analysis, algorithm selection or design, prompt tuning, parameter fine tuning, training, development and evaluation of systems that deliver responsible AI solutions at scale, using existing or developing new tools or workflows as needed
  • As a principal applied researcher, you will bring deep technical expertise and also provide mentorship on advanced AI, NLP, data science, statistical and machine learning methods and technologies, helping the organization develop new capabilities for innovative solutions
  • You will have substantial independence and responsibility from day one
What we offer
What we offer
  • Benefits starting from Day 1
  • Retirement Plan Matching
  • Flexible Paid Time Off
  • Wellness Support Programs and Resources
  • Parental & Caregiver Leaves
  • Fertility & Adoption Support
  • Continuous Development Support Program
  • Employee Assistance Program
  • Allyship and Inclusion Communities
  • Employee Recognition … and more
  • Fulltime
Read More
Arrow Right