CrawlJobs Logo

Member of Technical Staff, Multimodal Infrastructure

United States, Mountain View 139900.00 - 274800.00 USD / Year · Job Posted March 25, 2026
Apply Position
Job Link Share

Job Description

Microsoft AI is looking for a Member of Technical Staff, Multimodal Infrastructure to help build the next wave of capabilities of our personalized AI assistant, Copilot. We’re looking for someone who will bring an abundance of positive energy, empathy, and kindness to the team every day, in addition to being highly effective. The right candidate enjoys building world-class consumer experiences and products in a fast-paced environment. You will actively contribute to the development of AI models that are powering our innovative products. You will wear multiple hats and work on engineering, research, and everything in between. Your contributions will span model architecture, data curation, training and inference infrastructures, evaluation protocols, alignment and reinforcement learning from human feedback (RLHF), and many other exciting topics at the cutting edge of AI. Microsoft AI is building foundational models to develop novel responsible and efficient artificial general intelligence. The foundational models require large compute-capacity, and as a Member of Technical Staff – Multimodal Infrastructure you would be responsible to build large-scale infrastructures to support the full cycle of the multimodal generative model development in our innovative products. You will work closely with research scientists and product engineers on multimodal data processing, model training, inference and serving tasks. As a contributing member of the core group of engineers, you would also bring to the table best practices driving architectural changes and influence roadmap of relevant software and hardware components. Your work will directly impact the business goals of a wide range of users and facilitate the next wave of growth and innovation in AI. Our newly formed organization, Microsoft AI, is dedicated to advancing Copilot and other consumer AI products and research. The team is responsible for Copilot, Bing, Edge, and generative AI research. Come be a part of the team shaping the future personal computing.

Job Responsibility

  • Design, develop and maintain large-scale multimodal data processing pipelines
  • Design, develop and maintain large-scale multimodal model pretraining and post-training frameworks
  • Design, develop and maintain large-scale multimodal model inference and serving frameworks
  • Work with research scientists and product engineers to solve infra-related problems
  • Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
  • Enjoy working in a fast-paced, design-driven, product development cycle
  • Embody our Culture and Values

Requirements

  • Bachelor's Degree in Computer Science, or related technical discipline AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Experience in multi-modal data processing: Strong proficiency in distributed data processing infra (resource utilization management, fault tolerance, ray & spark) and CPU/GPU batch processing optimizations
  • Experience with state-of-art model inference and serving frameworks
  • Experience with image/video/audio data processing
  • Experience with common data formats for efficient I/O
  • Experience in multi-modal pretraining and post-training: Strong proficiency in deep learning frameworks such as PyTorch, Megatron and Deepspeed
  • Knowledge of auto-regressive and diffusion transformer models
  • Experience with distributed training techniques such as data parallelism, model parallelism, and pipeline parallelism
  • Proven experiences in at least one of the following areas: image/video generation and editing
  • efficient architectures (e.g., MoE, window attention)
  • efficient model design
  • or reinforcement learning training methods (e.g., RLHF, DPO, GRPO)
  • Experience in multi-modal inference and serving: Strong proficiency in serving frameworks such as vLLM, TensorRT-LLM, SGLang, xDiT, Cache-DiT etc.
  • Knowledge of distillation techniques such as Progressive Distillation, DMD, Self forcing etc.
  • Knowledge of quantization and compression techniques like AWQ, GPTQ, and FP8 for multi-modal pipelines
  • Experience in distributed inference scaling across multi-node clusters using Ray Serve and Triton
  • Experience in leading technical projects and supporting architectural decisions with data

Nice to have

Bachelor's Degree in Computer Science or related technical field AND 10+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Member of Technical Staff, Multimodal Infrastructure

8 matching positions

Member of Technical Staff - Multimodal

Join Microsoft AI in building one of the world’s most advanced foundation models...
Location
Location
United States , Mountain View
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in AI, Computer Science, Data Science, Statistics, Physics, Engineering, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • Proven expertise, demonstrated through impactful publications or technical leadership on high-scale projects
  • Strong analytical skills, attention to detail, and a data-driven approach to decision-making
  • Experience with large-scale distributed systems and scalable architectures
  • Ability to thrive in fast-paced, collaborative environments and embrace innovation
Job Responsibility
Job Responsibility
  • Develop algorithms, design model architectures, conduct experiments, champion measurement and evaluation, innovate datasets and data pipelines
  • Improve training and deployment efficiency, paying careful attention to detail, persevering, and learning from everyone’s attempts whether successful or not
  • Follow a rigorous data-driven approach grounded in meticulous ablation studies and scientific analysis
  • Innovate and iterate over ideas, prototypes, and product
  • Collaborate closely with teams on infrastructure, data engineering, pre-training, post-training, and product feedback
  • Advance the AI frontier responsibly
  • Embody our culture and values
  • Fulltime
Read More
Arrow Right

Senior Member of Technical Staff, Multimodal AI

At Cohere, we believe in the power of multimodal AI to revolutionise the way we ...
Location
Location
Salary
Salary:
Not provided
cohere.com Logo
Cohere
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Exceptional software engineering skills with a proven track record of building robust and scalable systems
  • Strong command of Python and well-versed in popular deep learning frameworks like JAX, PyTorch, and TensorFlow, with an understanding of their multimodal capabilities
  • Knowledge of distributed training strategies, especially for large-scale multimodal models
  • Familiarity with autoregressive models, particularly their application in multimodal tasks such as image or video captioning, speech-to-text generation
Job Responsibility
Job Responsibility
  • Design and develop cutting-edge multimodal AI systems, integrating various modalities such as text, speech, and vision
  • Conduct research and experiments on our advanced compute infrastructure, exploring novel ideas in multimodal representation learning, transfer learning, and more
  • Collaborate closely with our world-class teams, learning from and contributing to their expertise in the field
What we offer
What we offer
  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, AI Multimodal - MAI Superintelligence Team

At Microsoft AI, we are on a mission to train the world’s most capable AI fronti...
Location
Location
Switzerland , Zürich
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modelling or data engineering work
  • OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, or data engineering work
  • OR equivalent experience
  • Expertise in multimodal Research with a strong publishing track record
  • Proven expertise in areas of interest, evidenced by an exceptional publication track record and/or significant technical leadership in high-impact projects
  • Strong analytical skills, attention to detail, and a commitment to data-driven decision-making
  • Experience and/or in-depth understandings about large-scale distributed systems
  • Ability to work collaboratively in a fast-paced, innovative environment
Job Responsibility
Job Responsibility
  • Develop algorithms, design model architectures, conduct experiments, champion measurement and evaluation, innovate datasets and data pipelines
  • Improve training and deployment efficiency, paying careful attention to detail, persevering, and learning from everyone’s attempts whether successful or not
  • Follow a rigorous data-driven approach grounded in meticulous ablation studies and scientific analysis
  • Innovate and iterate over ideas, prototypes, and product
  • Collaborate closely with teams on infrastructure, data engineering, pre-training, post-training, and product feedback
  • Advance the AI frontier responsibly
  • Embody our culture and values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, AI Multimodal

At Microsoft AI, we are on a mission to train the world’s most capable AI fronti...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modelling or data engineering work
  • OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, or data engineering work
  • OR equivalent experience
  • Expertise in multimodal Research with a strong publishing track record
  • Proven expertise in areas of interest, evidenced by an exceptional publication track record and/or significant technical leadership in high-impact projects
  • Strong analytical skills, attention to detail, and a commitment to data-driven decision-making
  • Experience and/or in-depth understandings about large-scale distributed systems
  • Ability to work collaboratively in a fast-paced, innovative environment
Job Responsibility
Job Responsibility
  • Develop algorithms, design model architectures, conduct experiments, champion measurement and evaluation, innovate datasets and data pipelines
  • Improve training and deployment efficiency, paying careful attention to detail, persevering, and learning from everyone’s attempts whether successful or not
  • Follow a rigorous data-driven approach grounded in meticulous ablation studies and scientific analysis
  • Innovate and iterate over ideas, prototypes, and product
  • Collaborate closely with teams on infrastructure, data engineering, pre-training, post-training, and product feedback
  • Advance the AI frontier responsibly
  • Embody our culture and values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Data Infrastructure

We're looking for a Data Engineer to build and scale the data infrastructure tha...
Location
Location
Salary
Salary:
240000.00 - 290000.00 USD / Year
runwayml.com Logo
Runway
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of industry experience in data engineering
  • Strong knowledge of Python
  • Experience with data quality, deduplication, and cleaning at scale
  • Comfortable working with cloud storage (S3) and managing large datasets
  • Experience building and maintaining ETL/CDC pipelines at scale
  • Strong SQL skills and experience with multiple database systems (Postgres, columnar databases like ClickHouse/Redshift)
  • Humility and open mindedness
Job Responsibility
Job Responsibility
  • Build and own pipelines for the creation, curation, and processing of large-scale multimodal datasets, including vector database (LanceDB) management and query optimization for ML metadata
  • Build and own ETL and CDC streams from Postgres and ClickHouse to analytics warehouses
  • Build standardized data transformation layers using dbt to replace ad-hoc SQL queries and create maintainable data models for business analytics
  • Manage production databases (Postgres, ClickHouse) and optimize for performance and reliability
Read More
Arrow Right

Member of Technical Staff, AI Training Infrastructure

As a Training Infrastructure Engineer, you'll design, build, and optimize the in...
Location
Location
United States , San Mateo
Salary
Salary:
175000.00 - 220000.00 USD / Year
fireworks.ai Logo
Fireworks AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, or related field, or equivalent practical experience
  • 3+ years of experience with distributed systems and ML infrastructure
  • Experience with PyTorch
  • Proficiency in cloud platforms (AWS, GCP, Azure)
  • Experience with containerization, orchestration (Kubernetes, Docker)
  • Knowledge of distributed training techniques (data parallelism, model parallelism, FSDP)
Job Responsibility
Job Responsibility
  • Design and implement scalable infrastructure for large-scale model training workloads
  • Develop and maintain distributed training pipelines for LLMs and multimodal models
  • Optimize training performance across multiple GPUs, nodes, and data centers
  • Implement monitoring, logging, and debugging tools for training operations
  • Architect and maintain data storage solutions for large-scale training datasets
  • Automate infrastructure provisioning, scaling, and orchestration for model training
  • Collaborate with researchers to implement and optimize training methodologies
  • Analyze and improve efficiency, scalability, and cost-effectiveness of training systems
  • Troubleshoot complex performance issues in distributed training environments
What we offer
What we offer
  • meaningful equity in a fast-growing startup
  • comprehensive benefits package
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Document Understanding

We are seeking exceptional AI engineers to join our core document understanding ...
Location
Location
United States , San Francisco
Salary
Salary:
Not provided
llamaindex.ai Logo
LlamaIndex
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3-7 years of experience in machine learning engineering or applied research
  • Strong software engineering fundamentals with production Python experience (modern tooling: uv, ruff, mypy, Pydantic)
  • Hands-on experience training, fine-tuning, or deploying ML models in production
  • Deep understanding of modern ML techniques, particularly in computer vision, NLP, or multimodal learning
  • Experience with at least one of: data pipeline development, model training/fine-tuning, or ML infrastructure
  • Ability to read and implement from research papers and technical specifications
  • Track record of executing with high intensity in fast-paced environments
  • Strong technical communication skills and comfort with open-source collaboration
Job Responsibility
Job Responsibility
  • Develop, train, and optimize machine learning models for document structure understanding, table extraction, layout analysis, and multimodal content processing
  • Build robust data pipelines, evaluation frameworks, and experimentation infrastructure
  • Design and implement production ML systems that handle complex, real-world documents at scale
  • Stay current with latest advances in vision-language models, document AI, and multimodal learning
  • Collaborate with engineering teams to integrate ML innovations into production APIs
  • Contribute to both our open-source frameworks and enterprise offerings
  • Drive technical decisions while balancing research exploration with product delivery
What we offer
What we offer
  • Competitive base salary and equity compensation
  • Comprehensive medical/dental/vision coverage for you and your family
  • Unlimited paid time off policy
  • Daily catered lunch and snacks in the San Francisco office
  • Budget for conferences, research materials, and professional development
  • Access to cutting-edge compute resources and research tools
  • Fulltime
Read More
Arrow Right

Member of Technical Staff

The Microsoft AI Super Intelligence Post-Training team is dedicated to advancing...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or master’s degree in computer science, Engineering, or a related field, or equivalent practical experience
  • 5+ years of professional experience, including 2+ years with Python and ML frameworks such as PyTorch or TensorFlow
  • Hands-on experience with training or fine-tuning LLMs or multimodal models
  • Familiarity with production ML systems and concepts like model serving, caching, batching, and monitoring
  • Understanding of distributed systems and cloud-based infrastructure
Job Responsibility
Job Responsibility
  • Implement large-scale model training, especially with LLMs, SLMs, multimodal, or code-specific models
  • Develop robust evaluation frameworks to assess model performance, conduct systematic benchmarking, and address identified weaknesses while ensuring compliance with customer standards
  • Write efficient, production-quality code and debug complex distributed systems
  • Build and maintain internal tools to streamline training and evaluation workflows and automate repetitive tasks within secure development environments
  • Fulltime
Read More
Arrow Right