CrawlJobs Logo

AI Research Lead - Multimodal & Video

tether.to Logo

Tether App

Location Icon

Location:

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Join Tether and Shape the Future of Digital Finance. At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exchanges and wallets to payment processors and ATMs—to seamlessly integrate reserve-backed tokens across blockchains. By harnessing the power of blockchain technology, Tether enables you to store, send, and receive digital tokens instantly, securely, and globally, all at a fraction of the cost. Transparency is the bedrock of everything we do, ensuring trust in every transaction.

Job Responsibility:

  • Lead the research, design, and development of state-of-the-art image, video, and 3D generation models, including multimodal foundation models
  • Lead high-impact, specialized projects focused on innovative text, images, audio and video applications
  • Define and drive the technical roadmap for multimodal AI initiatives, aligning research goals with business and product objectives
  • Provide technical leadership and mentorship to teams of AI researchers and engineers, fostering innovation and skill development
  • Oversee the end-to-end lifecycle of multimodal model development, from dataset curation and model training to deployment and performance evaluation
  • Lead large-scale multi-node GPU model training, ensuring scalability, efficiency, and reproducibility of experiments
  • Collaborate closely with cross-functional teams, including product, design, and engineering, to integrate AI solutions into production systems
  • Drive applied research initiatives in image/video/3D generation, editing, animation, and other related domains
  • Monitor advancements in AI research and multimodal technologies, and incorporate novel techniques to improve model capabilities and performance
  • Contribute to the AI research community, including publications, open-source contributions, and participation in conferences
  • Establish best practices and standards for coding, model evaluation, and experimentation within the team
  • Lead and manage complex projects, ensuring timely delivery, quality outcomes, and alignment with strategic objectives
  • Communicate technical insights and updates effectively to executive leadership, stakeholders, and external collaborators
  • Promote a culture of collaboration, innovation, and excellence, maintaining high team morale and accountability

Requirements:

  • PhD, MS or equivalent experience
  • Hands on experience in building Image/Video/3D generation and multimodal foundation models building from scratch
  • 5+ years of experience in managing or leading 10+ research & engineer teams
  • Excellent communication and interpersonal skills
  • Excellent understanding of an AI-based product lifecycle
  • Hands-on experience in building end-to-end multimodal foundation models on thousands of multi-node GPUs
  • Proficiency in modern deep learning and diffusion frameworks & libraries

Nice to have:

  • Demonstrated expertise in computer vision, video generation foundation model and/or multimodal research especially building them from scratch
  • Strong history of delivering innovation in the space of multimodal & video
  • Ability to develop a long-term vision and execute strategies at scale while maintaining a grasp of technical details for better decision-making
  • Experience with VP-level presentations and reporting
  • Publications at leading AI conferences such as CVPR, ICCV, ECCV, ICML, ICLR, NeurIPS etc

Additional Information:

Job Posted:
December 12, 2025

Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for AI Research Lead - Multimodal & Video

AI Research Lead - Multimodal & Video

Join Tether and Shape the Future of Digital Finance. At Tether, we’re not just b...
Location
Location
Salary
Salary:
Not provided
tether.to Logo
Tether App
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD, MS or equivalent experience
  • Hands on experience in building Image/Video/3D generation and multimodal foundation models building from scratch
  • 5+ years of experience in managing or leading 10+ research & engineer teams
  • Excellent communication and interpersonal skills
  • Excellent understanding of an AI-based product lifecycle
  • Hands-on experience in building end-to-end multimodal foundation models on thousands of multi-node GPUs
  • Proficiency in modern deep learning and diffusion frameworks & libraries
Job Responsibility
Job Responsibility
  • Lead the research, design, and development of state-of-the-art image, video, and 3D generation models, including multimodal foundation models
  • Lead high-impact, specialized projects focused on innovative text, images, audio and video applications
  • Define and drive the technical roadmap for multimodal AI initiatives, aligning research goals with business and product objectives
  • Provide technical leadership and mentorship to teams of AI researchers and engineers, fostering innovation and skill development
  • Oversee the end-to-end lifecycle of multimodal model development, from dataset curation and model training to deployment and performance evaluation
  • Lead large-scale multi-node GPU model training, ensuring scalability, efficiency, and reproducibility of experiments
  • Collaborate closely with cross-functional teams, including product, design, and engineering, to integrate AI solutions into production systems
  • Drive applied research initiatives in image/video/3D generation, editing, animation, and other related domains
  • Monitor advancements in AI research and multimodal technologies, and incorporate novel techniques to improve model capabilities and performance
  • Contribute to the AI research community, including publications, open-source contributions, and participation in conferences
Read More
Arrow Right

AI Research Lead - Multimodal & Video

Join Tether and Shape the Future of Digital Finance. At Tether, we’re not just b...
Location
Location
Salary
Salary:
Not provided
tether.to Logo
Tether App
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD, MS or equivalent experience
  • Hands on experience in building Image/Video/3D generation and multimodal foundation models building from scratch
  • 5+ years of experience in managing or leading 10+ research & engineer teams
  • Excellent communication and interpersonal skills
  • Excellent understanding of an AI-based product lifecycle
  • Hands-on experience in building end-to-end multimodal foundation models on thousands of multi-node GPUs
  • Proficiency in modern deep learning and diffusion frameworks & libraries
Job Responsibility
Job Responsibility
  • Lead the research, design, and development of state-of-the-art image, video, and 3D generation models, including multimodal foundation models
  • Lead high-impact, specialized projects focused on innovative text, images, audio and video applications
  • Define and drive the technical roadmap for multimodal AI initiatives, aligning research goals with business and product objectives
  • Provide technical leadership and mentorship to teams of AI researchers and engineers, fostering innovation and skill development
  • Oversee the end-to-end lifecycle of multimodal model development, from dataset curation and model training to deployment and performance evaluation
  • Lead large-scale multi-node GPU model training, ensuring scalability, efficiency, and reproducibility of experiments
  • Collaborate closely with cross-functional teams, including product, design, and engineering, to integrate AI solutions into production systems
  • Drive applied research initiatives in image/video/3D generation, editing, animation, and other related domains
  • Monitor advancements in AI research and multimodal technologies, and incorporate novel techniques to improve model capabilities and performance
  • Contribute to the AI research community, including publications, open-source contributions, and participation in conferences
Read More
Arrow Right

AI Research Lead - Multimodal & Video

Join Tether and Shape the Future of Digital Finance. We are hiring a Multimodal ...
Location
Location
Salary
Salary:
Not provided
tether.to Logo
Tether App
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD, MS or equivalent experience
  • Hands on experience in building Image/Video/3D generation and multimodal foundation models building from scratch
  • 5+ years of experience in managing or leading 10+ research & engineer teams
  • Excellent communication and interpersonal skills
  • Excellent understanding of an AI-based product lifecycle.
  • Hands-on experience in building end-to-end multimodal foundation models on thousands of multi-node GPUs.
  • Proficiency in modern deep learning and diffusion frameworks & libraries.
Job Responsibility
Job Responsibility
  • Lead the research, design, and development of state-of-the-art image, video, and 3D generation models, including multimodal foundation models.
  • Lead high-impact, specialized projects focused on innovative text, images, audio and video applications.
  • Define and drive the technical roadmap for multimodal AI initiatives, aligning research goals with business and product objectives.
  • Provide technical leadership and mentorship to teams of AI researchers and engineers, fostering innovation and skill development.
  • Oversee the end-to-end lifecycle of multimodal model development, from dataset curation and model training to deployment and performance evaluation.
  • Lead large-scale multi-node GPU model training, ensuring scalability, efficiency, and reproducibility of experiments.
  • Collaborate closely with cross-functional teams, including product, design, and engineering, to integrate AI solutions into production systems.
  • Drive applied research initiatives in image/video/3D generation, editing, animation, and other related domains.
  • Monitor advancements in AI research and multimodal technologies, and incorporate novel techniques to improve model capabilities and performance.
  • Contribute to the AI research community, including publications, open-source contributions, and participation in conferences.
Read More
Arrow Right

AI Research Lead - Multimodal & Video

Join Tether and Shape the Future of Digital Finance. At Tether, we’re not just b...
Location
Location
Salary
Salary:
Not provided
tether.to Logo
Tether App
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD, MS or equivalent experience
  • Hands on experience in building Image/Video/3D generation and multimodal foundation models building from scratch
  • 5+ years of experience in managing or leading 10+ research & engineer teams
  • Excellent communication and interpersonal skills
  • Excellent understanding of an AI-based product lifecycle
  • Hands-on experience in building end-to-end multimodal foundation models on thousands of multi-node GPUs
  • Proficiency in modern deep learning and diffusion frameworks & libraries
Job Responsibility
Job Responsibility
  • Lead the research, design, and development of state-of-the-art image, video, and 3D generation models, including multimodal foundation models
  • Lead high-impact, specialized projects focused on innovative text, images, audio and video applications
  • Define and drive the technical roadmap for multimodal AI initiatives, aligning research goals with business and product objectives
  • Provide technical leadership and mentorship to teams of AI researchers and engineers, fostering innovation and skill development
  • Oversee the end-to-end lifecycle of multimodal model development, from dataset curation and model training to deployment and performance evaluation
  • Lead large-scale multi-node GPU model training, ensuring scalability, efficiency, and reproducibility of experiments
  • Collaborate closely with cross-functional teams, including product, design, and engineering, to integrate AI solutions into production systems
  • Drive applied research initiatives in image/video/3D generation, editing, animation, and other related domains
  • Monitor advancements in AI research and multimodal technologies, and incorporate novel techniques to improve model capabilities and performance
  • Contribute to the AI research community, including publications, open-source contributions, and participation in conferences
Read More
Arrow Right

AI Research Lead - Multimodal & Video

Join Tether and Shape the Future of Digital Finance. At Tether, we’re not just b...
Location
Location
Salary
Salary:
Not provided
tether.to Logo
Tether App
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD, MS or equivalent experience
  • Hands on experience in building Image/Video/3D generation and multimodal foundation models building from scratch
  • 5+ years of experience in managing or leading 10+ research & engineer teams
  • Excellent communication and interpersonal skills
  • Excellent understanding of an AI-based product lifecycle
  • Hands-on experience in building end-to-end multimodal foundation models on thousands of multi-node GPUs
  • Proficiency in modern deep learning and diffusion frameworks & libraries
Job Responsibility
Job Responsibility
  • Lead the research, design, and development of state-of-the-art image, video, and 3D generation models, including multimodal foundation models
  • Lead high-impact, specialized projects focused on innovative text, images, audio and video applications
  • Define and drive the technical roadmap for multimodal AI initiatives, aligning research goals with business and product objectives
  • Provide technical leadership and mentorship to teams of AI researchers and engineers, fostering innovation and skill development
  • Oversee the end-to-end lifecycle of multimodal model development, from dataset curation and model training to deployment and performance evaluation
  • Lead large-scale multi-node GPU model training, ensuring scalability, efficiency, and reproducibility of experiments
  • Collaborate closely with cross-functional teams, including product, design, and engineering, to integrate AI solutions into production systems
  • Drive applied research initiatives in image/video/3D generation, editing, animation, and other related domains
  • Monitor advancements in AI research and multimodal technologies, and incorporate novel techniques to improve model capabilities and performance
  • Contribute to the AI research community, including publications, open-source contributions, and participation in conferences
Read More
Arrow Right

Senior Computer Vision and Machine Learning Research Scientist

We are seeking a skilled and innovative Senior Research Scientist to join one of...
Location
Location
United States , Seattle
Salary
Salary:
159750.00 - 234300.00 USD / Year
axon.com Logo
Axon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD and 3+ years experience in Computer Science or a related field with a focus on computer vision, machine learning, artificial intelligence or related technical fields
  • Proven track record of research in machine learning, computer vision or related fields
  • Experience driving the ML development lifecycle and leveraging state-of-the-art research to deliver high quality models at scale
  • Strong computer vision fundamentals such as image processing, feature extractions, object detection, semantic segmentation, video analysis or action recognition
  • Excellent problem-solving skills, analytical thinking, and the ability to work independently as well as collaboratively in a team environment
  • Proficiency in Python and frameworks such as PyTorch, TensorFlow or Keras
  • Strong communication skills and the ability to effectively present complex technical concepts to both technical and non-technical audiences
Job Responsibility
Job Responsibility
  • Collaborate with other scientists, engineers and product managers to build proof-of-concepts to shape the Axon of tomorrow
  • Lead end-to-end research efforts in advanced computer vision, machine learning and gen-AI techniques for cloud and devices from multimodal data sources, including scene understanding, action recognition and anomaly detection
  • Design and implement responsible, privacy-preserving, efficient and scalable models for inference and analysis of visual data
  • Develop performance and quality metrics for CVML models and systems, and validate their effectiveness in real-world settings
  • Optimize algorithms for performance, memory footprint, and energy efficiency to meet the requirements of resource-constrained devices
  • Stay up-to-date with the latest research and advances in CVML and translate relevant findings into shipping Axon products
  • Contribute to academic publications, technical documentation, and patent disclosures to share insights and findings with the broader community
  • Coach and mentor junior scientists
What we offer
What we offer
  • Competitive salary and 401k with employer match
  • Discretionary paid time off
  • Paid parental leave for all
  • Medical, Dental, Vision plans
  • Fitness Programs
  • Emotional & Mental Wellness support
  • Learning & Development programs
  • Snacks in our offices
  • Fulltime
Read More
Arrow Right

Senior Computer Vision and Machine Learning Research Scientist

Join Axon and be a Force for Good. At Axon, we’re on a mission to Protect Life. ...
Location
Location
United States , Seattle
Salary
Salary:
159750.00 - 234300.00 USD / Year
axon.com Logo
Axon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD and 3+ years experience in Computer Science or a related field with a focus on computer vision, machine learning, artificial intelligence or related technical fields
  • Proven track record of research in machine learning, computer vision or related fields
  • Experience driving the ML development lifecycle and leveraging state-of-the-art research to deliver high quality models at scale
  • Strong computer vision fundamentals such as image processing, feature extractions, object detection, semantic segmentation, video analysis or action recognition
  • Excellent problem-solving skills, analytical thinking, and the ability to work independently as well as collaboratively in a team environment
  • Proficiency in Python and frameworks such as PyTorch, TensorFlow or Keras
  • Strong communication skills and the ability to effectively present complex technical concepts to both technical and non-technical audiences
Job Responsibility
Job Responsibility
  • Collaborate with other scientists, engineers and product managers to build proof-of-concepts to shape the Axon of tomorrow
  • Lead end-to-end research efforts in advanced computer vision, machine learning and gen-AI techniques for cloud and devices from multimodal data sources, including scene understanding, action recognition and anomaly detection
  • Design and implement responsible, privacy-preserving, efficient and scalable models for inference and analysis of visual data
  • Develop performance and quality metrics for CVML models and systems, and validate their effectiveness in real-world settings
  • Optimize algorithms for performance, memory footprint, and energy efficiency to meet the requirements of resource-constrained devices
  • Stay up-to-date with the latest research and advances in CVML and translate relevant findings into shipping Axon products
  • Contribute to academic publications, technical documentation, and patent disclosures to share insights and findings with the broader community
  • Coach and mentor junior scientists
What we offer
What we offer
  • Competitive salary and 401k with employer match
  • Discretionary paid time off
  • Paid parental leave for all
  • Medical, Dental, Vision plans
  • Fitness Programs
  • Emotional & Mental Wellness support
  • Learning & Development programs
  • Snacks in our offices
  • Fulltime
Read More
Arrow Right
New

Tech Lead Manager - Behaviour Learning for Embodied AI

The Science organisation at Wayve advances foundational research in embodied AI ...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
wayve.ai Logo
Wayve
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Years of experience in applied ML/AI roles with strong hands-on contributions
  • Demonstrated track record of impactful technical work in one or more of: multimodal learning, reinforcement learning, generative models, latent action modelling, optimisation, or planning
  • Experience building large-scale ML infrastructure and working with high-dimensional temporal data (e.g., video, multi-sensor inputs)
  • Deep understanding of the end-to-end lifecycle of ML research and deployment
  • Strong Python and PyTorch engineering fundamentals, with experience developing research-grade, production-oriented tools
  • Proven ability to shape technical strategy and lead architectural design for ML systems
  • Publications at top-tier ML conferences such as NeurIPS, ICML, CoRL or ICLR
  • Clear and thoughtful communicator, capable of influencing technical direction and mentoring others without formal reporting lines
Job Responsibility
Job Responsibility
  • Architect the future – Design and evolve models for efficient, robust, and adaptable autonomy, setting a high technical bar for quality and innovation
  • Accelerate research impact – Partner with team members to test, scale, and productionise research ideas - from architecture design to data strategy. Provide technical guidance and feedback on research design, implementation, and evaluation. Implement scalable, high-throughput training pipelines for models with temporal context and develop and evaluate novel data sampling strategies to accelerate training and generalisation
  • Get hands-on when it matters – Lead from the front by contributing directly to key system components, codebases, and experiments, especially during high-leverage moments. Contribute directly as an IC on core research and development tasks (~60-70% of time)
  • Disrupt thoughtfully – Challenge assumptions, ask sharp questions, and champion bold ideas that push us beyond incremental gains and toward breakthrough advances
  • Make things happen – Lead a high-performing, cross-functional team of applied scientists and ML engineers working across ML, RL, representation learning, planning, among many more. Work closely with the team manager to drive quarterly planning and execution of research-engineering initiatives, enabling rapid iteration and delivery in high-ambiguity environments. Translate ambiguity into action and ensure technical progress tracks with our mission
  • Champion change – Lead through ambiguity. Balance structure and adaptability to help your team navigate evolving priorities, novel research, and complex organisational change
Read More
Arrow Right
Welcome to CrawlJobs.com
Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.