Member of Technical Staff

Member of Technical Staff, Multimodal Infrastructure

Microsoft AI is looking for a Member of Technical Staff, Multimodal Infrastructu...

Location

United States , Mountain View

Salary:

139900.00 - 274800.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science, or related technical discipline AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
Experience in multi-modal data processing: Strong proficiency in distributed data processing infra (resource utilization management, fault tolerance, ray & spark) and CPU/GPU batch processing optimizations
Experience with state-of-art model inference and serving frameworks
Experience with image/video/audio data processing
Experience with common data formats for efficient I/O
Experience in multi-modal pretraining and post-training: Strong proficiency in deep learning frameworks such as PyTorch, Megatron and Deepspeed
Knowledge of auto-regressive and diffusion transformer models
Experience with distributed training techniques such as data parallelism, model parallelism, and pipeline parallelism
Proven experiences in at least one of the following areas: image/video generation and editing
efficient architectures (e.g., MoE, window attention)

Job Responsibility

Design, develop and maintain large-scale multimodal data processing pipelines
Design, develop and maintain large-scale multimodal model pretraining and post-training frameworks
Design, develop and maintain large-scale multimodal model inference and serving frameworks
Work with research scientists and product engineers to solve infra-related problems
Find a path to get things done despite roadblocks to get your work into the hands of users quickly and iteratively
Enjoy working in a fast-paced, design-driven, product development cycle
Embody our Culture and Values

Fulltime

Senior Member of Technical Staff, Multimodal AI

At Cohere, we believe in the power of multimodal AI to revolutionise the way we ...

Location

Salary:

Not provided

Cohere

Expiration Date

Until further notice

Requirements

Exceptional software engineering skills with a proven track record of building robust and scalable systems
Strong command of Python and well-versed in popular deep learning frameworks like JAX, PyTorch, and TensorFlow, with an understanding of their multimodal capabilities
Knowledge of distributed training strategies, especially for large-scale multimodal models
Familiarity with autoregressive models, particularly their application in multimodal tasks such as image or video captioning, speech-to-text generation

Job Responsibility

Design and develop cutting-edge multimodal AI systems, integrating various modalities such as text, speech, and vision
Conduct research and experiments on our advanced compute infrastructure, exploring novel ideas in multimodal representation learning, transfer learning, and more
Collaborate closely with our world-class teams, learning from and contributing to their expertise in the field

What we offer

An open and inclusive culture and work environment
Work closely with a team on the cutting edge of AI research
Weekly lunch stipend, in-office lunches & snacks
Full health and dental benefits, including a separate budget to take care of your mental health
100% Parental Leave top-up for up to 6 months
Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
6 weeks of vacation (30 working days!)

Fulltime

Member of Technical Staff, Microsoft Robotics (Human-Robot-AI Interaction)

We are hiring a Member of Technical Staff, Microsoft Robotics (Human-Robot-AI In...

Location

United States , Redmond

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Industrial Design, Product Design, Human Computer Interaction, User Experience, Interaction Design, or related field AND 5+ years experience working in product or service design
OR Master's Degree in Industrial Design, Product Design, Human Computer Interaction, User Experience, Interaction Design, or related field AND 4+ years experience working in product or service design
OR equivalent experience (e.g., demonstrated experience working in product or service design or using design thinking to solve problems)
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter

Job Responsibility

Design end-to-end human-robot interaction (HRI) and human-AI teaming experiences, including natural language task instruction interfaces, shared autonomy control panels, intent and status communication displays, and trust calibration mechanisms
Create wireframes, journey maps, user jobs and scenarios, task flows, and personas for diverse users who interact with robots, ranging from factory floor operators and warehouse associates to lab researchers and first responders
Develop interaction models for multimodal communication between humans and robots, including voice, gesture, gaze, spatial input (AR/MR), touchscreen and future multi-sensory paradigms, ensuring seamless transitions between modalities
Produce visual designs from concept to delivery for robot status communication (e.g., intent signaling, confidence indicators, uncertainty visualization), user-facing explainability surfaces, and collaborative task-handoff interfaces
Build and validate interactive prototypes, including mixed-modality and immersive experiences, that demonstrate human-robot-agent collaboration concepts to stakeholders and customers
Collaborate with program managers, software engineers, robotics researchers, and AI safety experts to define and iterate on HRI product features, ensuring safety, transparency, and appropriate trust calibration are embedded from the outset
Contribute to and lead development of a robotics-specific design system for HRI patterns (e.g., robot communication modalities, autonomy level indicators, handoff protocols, error recovery flows) aligned with Microsoft's Fluent design language
Communicate a compelling, convincing product story about human-robot-agent collaboration to leadership and cross-disciplinary audiences, adjusting narrative depending on the audience
Leverage ideation methodologies to lead design solutions for complex HRI products involving many constraints (e.g., real-time responsiveness, safety-critical contexts, diverse user expertise levels, variable robot morphologies)
Take ownership of interaction design sub-systems from concept to delivery, working closely with hardware engineers to ensure feasibility of physical interaction affordances and advocate for user experience benefits during trade-off discussions

What we offer

Benefits and other compensation
Certain roles may be eligible for benefits and other compensation

Fulltime

Member of Technical Staff, Document Understanding

We are seeking exceptional AI engineers to join our core document understanding ...

Location

United States , San Francisco

Salary:

Not provided

LlamaIndex

Expiration Date

Until further notice

Requirements

3-7 years of experience in machine learning engineering or applied research
Strong software engineering fundamentals with production Python experience (modern tooling: uv, ruff, mypy, Pydantic)
Hands-on experience training, fine-tuning, or deploying ML models in production
Deep understanding of modern ML techniques, particularly in computer vision, NLP, or multimodal learning
Experience with at least one of: data pipeline development, model training/fine-tuning, or ML infrastructure
Ability to read and implement from research papers and technical specifications
Track record of executing with high intensity in fast-paced environments
Strong technical communication skills and comfort with open-source collaboration

Job Responsibility

Develop, train, and optimize machine learning models for document structure understanding, table extraction, layout analysis, and multimodal content processing
Build robust data pipelines, evaluation frameworks, and experimentation infrastructure
Design and implement production ML systems that handle complex, real-world documents at scale
Stay current with latest advances in vision-language models, document AI, and multimodal learning
Collaborate with engineering teams to integrate ML innovations into production APIs
Contribute to both our open-source frameworks and enterprise offerings
Drive technical decisions while balancing research exploration with product delivery

What we offer

Competitive base salary and equity compensation
Comprehensive medical/dental/vision coverage for you and your family
Unlimited paid time off policy
Daily catered lunch and snacks in the San Francisco office
Budget for conferences, research materials, and professional development
Access to cutting-edge compute resources and research tools

Fulltime

The Microsoft AI Super Intelligence Post-Training team is dedicated to advancing...

Location

India , Bangalore

Salary:

Not provided

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor’s or master’s degree in computer science, Engineering, or a related field, or equivalent practical experience
5+ years of professional experience, including 2+ years with Python and ML frameworks such as PyTorch or TensorFlow
Hands-on experience with training or fine-tuning LLMs or multimodal models
Familiarity with production ML systems and concepts like model serving, caching, batching, and monitoring
Understanding of distributed systems and cloud-based infrastructure

Job Responsibility

Implement large-scale model training, especially with LLMs, SLMs, multimodal, or code-specific models
Develop robust evaluation frameworks to assess model performance, conduct systematic benchmarking, and address identified weaknesses while ensuring compliance with customer standards
Write efficient, production-quality code and debug complex distributed systems
Build and maintain internal tools to streamline training and evaluation workflows and automate repetitive tasks within secure development environments

Fulltime

Member of Technical Staff - Pre Training - MAI Superintelligence Team

Help deliver one of the best foundational models in the world at Microsoft AI. A...

Location

United States , Mountain View

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science, Machine Learning, Mathematics, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR equivalent experience
Demonstrated experience in large-scale AI
Passionate about conversational AI and its deployment
Demonstrated written and verbal communication skills with the ability to work closely with cross-functional teams, including product managers, designers, and other engineers
Passion for learning new technologies and staying up to date with industry trends, best practices, and emerging technologies in AI
Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team

Job Responsibility

Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations
Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack
Collaborate closely with teams on infrastructure, data, post-training, and multimodality
Embody our culture and values

Fulltime

Member of Technical Staff, AI Data - MAI Superintelligence Team

Help build the world’s most advanced multimodal dataset at Microsoft AI. We are ...

Location

United Kingdom , London

Salary:

Not provided

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modelling or data engineering work
OR equivalent experience
Expertise in large scale data engineering ideally applied to AI
Expertise in Spark, Kubernetes or similar

Job Responsibility

Design and develop data pipelines that ingest enormous amounts of multi-modal training data (text, audio, images, video)
Build and maintain cutting-edge infrastructure that can store and process the petabytes of data needed to power models
Partner with the pretraining and post-training teams to improve our data recipe by rigorous and careful experimentation
Collaborate with the product team and other engineers and researchers across Microsoft AI to identify gaps in the current generation of models
Embody our culture and values

Fulltime

Member of Technical Staff

The Microsoft AI Superintelligence (MAIST) Post Training team is dedicated to ad...

Location

United States , Redmond

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Doctorate OR equivalent experience
Significant experience in large-scale model training, data curation, and hands-on coding, ideally from leading research labs
Deep expertise in pre-training, post-training, and reinforcement learning (RL) for both language and multimodal models
Ability to develop LLMs, SLMs, multimodal, and coding models using both proprietary and open-source frameworks
Self-driven, able to write efficient code and debug training jobs, document findings, and demonstrate a track record in these fields
Curious, adaptable problem-solver who thrives on continuous learning, embraces changing priorities, and is motivated by creating meaningful impact

Job Responsibility

Design & Evaluate Datasets – Build high-quality datasets and benchmarks for training AI models
run ablation studies to measure impact and optimize data effectiveness
Advance Model Training – Apply deep expertise in pre-training, post-training, and reinforcement learning (RL) for both language and multimodal models
Develop Data Infrastructure – Create and maintain scalable pipelines for ingestion, preprocessing, filtering, and annotation of large, complex datasets
Data Quality & Analysis – Assess real-world multimodal datasets (text, image, video, audio, code) for quality, diversity, and relevance
identify gaps and propose improvements
Tooling & Workflows – Build lightweight tools for dataset auditing, visualization, and versioning to streamline experimentation
Research & Innovation – Collaborate with cross-functional teams to push research and product boundaries, delivering models that make a real-world impact
Embody our Culture and Values

Fulltime

Select Country

Member of Technical Staff - Multimodal

Job Description

Job Responsibility

Requirements

Nice to have

Looking for more opportunities?

Member of Technical Staff - Multimodal

Member of Technical Staff, Multimodal Infrastructure

Senior Member of Technical Staff, Multimodal AI

Member of Technical Staff, Microsoft Robotics (Human-Robot-AI Interaction)

Member of Technical Staff, Document Understanding

Member of Technical Staff

Member of Technical Staff - Pre Training - MAI Superintelligence Team

Member of Technical Staff, AI Data - MAI Superintelligence Team

Member of Technical Staff

Our AI answers in your language