Multimodal Algorithm Engineer (Model Optimization) Job at AMD (Shanghai)

Research Scientist / Engineer – Foundation Model: Core Research

This is a rare and foundational opportunity to define the future of multimodal A...

Location

United States , Palo Alto

Salary:

250000.00 - 450000.00 USD / Year

Luma AI

Expiration Date

Until further notice

Requirements

A Bachelor's, Master's, or PhD degree in Computer Science, Machine Learning, Physics, or Mathematics is essential
A 'first-principles' intuition for scaling
Fluent in the language of frontier AI
Proven ability to design and rigorously analyze experiments and to articulate complex technical concepts effectively
Practical experience with distributed or high-performance computing environments, particularly managing and optimizing training runs on large-scale GPU clusters

Job Responsibility

Unified Modeling & Efficiency Drive the core research that powers all of Luma's products — co-designing multimodal representations, advancing core algorithms for long-context training, and establishing rigorous scaling laws to predict performance across compute budgets
Alignment & Evaluation Close the gap between training loss and user experience. Develop proxy tasks and automated metrics that serve as the compass for research decisions — ensuring our models optimize for what actually matters to users, not just benchmarks
Research Infrastructure Build the engine for high-velocity research. Maintain production-research parity, ensure reproducibility, and design systems for rapid experimentation — so that novel ideas go from hypothesis to validated result as fast as possible

Fulltime

LLM Algorithm Tech Lead – Applied Large Language Model Systems

Plaud is building the next generation intelligence infrastructure and interfaces...

Location

United States , San Francisco

Salary:

230000.00 - 300000.00 USD / Year

Plaud

Expiration Date

Until further notice

Requirements

5–10 years of experience in LLM/NLP/AI
Strong prompt engineering and reasoning design skills
Proven ability to deliver LLM-powered features into production
Experience with RAG and knowledge-enhanced reasoning
Strong architectural thinking and system design skills
Strong communication and leadership capabilities
Knowledge of memory systems or personalization engines
Experience building eval frameworks or safety systems
Experience leading technical teams
Experience in foundational model algorithm design or efficiency optimization

Job Responsibility

Intelligence Architecture Development: Design structured reasoning pipelines, planning flows, chain-of-thought workflows
Build capability primitives such as memory, personalization, proactive insights
Develop modular and reusable intelligence components
Applied LLM Features & Production Integration: Lead the design and deployment of LLM-based product functionality
Ensure output reliability, consistency, safety, and user-centric alignment
Apply prompting, constraints, and reasoning structures to reduce hallucination
Retrieval-Augmented Generation (RAG): Build and optimize multi-hop, multi-source retrieval pipelines
Implement chunking, indexing, reranking, and retrieval evaluation
Ensure RAG improves factuality and reduces error rates
Model Strategy & Inference Optimization: Select appropriate model families based on capability and cost constraints

What we offer

Competitive Compensation: $230K-$300K base salary+performance bonus+Equity
Comprehensive Benefits: Top-tier healthcare for employees and dependents, including dental and vision, and a generous employer subsidy
Retirement Planning: 401(k) plan for full time employees with company matching
Paid Time Off: Unlimited PTO, plus 13 paid holidays
New Parent Leave: 12 weeks of paid time off to spend time with your new family, regardless of gender
Hybrid Office: Minimum of 3x in office per week
Gear: New hires are equipped with their choice of new top-of-the-line laptops and workstation setups
Perks: Best office equipment. Annual offsites. Free office drinks and snacks

Fulltime

Research Engineer Robotics (Systems)

Reality Labs Research (RL-R) brings together a diverse and highly interdisciplin...

Location

United States , Redmond

Salary:

183997.00 - 257000.00 USD / Year

Data Scientist

As part of our Client’s high-performing AI Innovation team, you’ll help design, ...

Location

United States , New York

Salary:

200000.00 - 225000.00 USD / Year

Solomon Page

Expiration Date

Until further notice

Requirements

Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience)
2+ years of experience as a Data Scientist, Machine Learning Engineer, or applied AI practitioner, with a strong foundation in computer science, algorithms, and software development
Advanced programming skills in Python, with experience building production-grade systems beyond research or experimentation
Solid understanding of machine learning and applied AI concepts, with experience taking solutions from prototype to production
Hands-on experience designing, building, and deploying LLM-driven or GenAI applications, including familiarity with vector databases, embeddings pipelines, or semantic search systems
Practical experience with cloud-based deployments and infrastructure tools (e.g., AWS, Docker, GitHub) and an understanding of modern DevOps practices, containerization, orchestration, caching strategies, and cost-aware design
Strong problem-solving skills and systems thinking, with the ability to balance trade-offs across model quality, scalability, inference latency, cost, and operational complexity
Ability to interpret and implement research ideas and algorithms, actively contributing to research and development initiatives while translating them into production solutions
Excellent communication and collaboration skills, with experience working closely with product managers, engineers, and domain experts to deliver actionable technical solutions
Passion for learning and staying current with the rapidly evolving AI/ML landscape, including emerging best practices for GenAI applications

Job Responsibility

Apply strong problem-solving and critical thinking skills to break down complex, ambiguous requirements into clear, implementable technical components and system designs
Design, build, and maintain AI-powered and data-driven systems with a focus on modern language and multimodal models, including LLM-driven applications, RAG pipelines, and agentic workflows
Evaluate and productionize commercial and open-source LLMs, choosing appropriate models, tools, and techniques for each use case
Develop multi-step agentic workflows that incorporate tools, external data sources, memory, and control logic
Manage the orchestration of production LLM workflows and agentic systems, ensuring reliability and efficiency through prompt routing, state management, retries, fallbacks, and error handling
Design, test, and iteratively refine prompts and system instructions using prompt engineering and tuning techniques to improve model reliability, accuracy, and task performance
Maintain production-grade code and services with automated monitoring and performance tracking, using metrics and alerts to guide continuous improvements in models, prompts, and pipelines
Apply systems thinking to design and optimize AI and LLM systems, balancing quality, scalability, latency, cost, and operational complexity, while implementing efficiency improvements using model selection, prompt design, batching, caching, and retrieval strategies
Define and implement evaluation and observability frameworks for AI systems, including automated testing, task-specific benchmarks, regression testing for prompts, human-in-the-loop validation, and performance monitoring
Build and integrate AI models into backend systems and APIs to support both real-time and batch inference, ensuring solutions are production-ready, scalable, and efficient

Fulltime

Applied Scientist, Silicon and Systems Group Edge AI

Amazon Devices is an inventive research and development company that designs and...

Location

United Kingdom , Cambridge

Salary:

Not provided

Amazon Pforzheim GmbH

Expiration Date

Until further notice

Requirements

PhD, or a Master's degree and experience in CS, CE, ML or related field
Experience in patents or publications at top-tier peer-reviewed conferences or journals
Experience programming in Java, C++, Python or related language
Experience in any of the following areas: algorithms and data structures, parsing, numerical optimization, data mining, parallel and distributed computing, high-performance computing
Experience in building machine learning models for business application

Job Responsibility

Collaborate with cross-functional engineers and scientists to advance the state of the art in multimodal model evaluations for devices, including audio, images, and videos
Invent and validate reliability for novel automated evaluation methods for perception tasks, such as fine-tuned LLM-as-judge
Develop and extend our evaluation framework(s) to support expanding capabilities for multimodal language models
Analyze large offline and online datasets to understand model gaps, develop methods to interpret model failures, and collaborate with training teams to enhance model capabilities for product use cases
Work closely with other scientists, compiler engineers, data collection, and product teams to advance evaluation methods

Senior GenAI Engineer

NTT DATA strives to hire exceptional, innovative and passionate individuals who ...

Location

India , Bangalore

Salary:

Not provided

NTT DATA

Expiration Date

Until further notice

Requirements

Bachelor's/Master's Degree or equivalent
5+ years in ML engineering, around 1+ years of hands-on with LLMs/GenAI and agentic frameworks
Experience in shipping production AI systems on at least one hyperscaler (Azure/AWS/GCP)
Experience delivering end-to-end GenAI based solutions
Strong Python experience to build multiple AI-ML/ GenAI Solutions
Experience working on Agent orchestration with leading frameworks like LangGraph, LangChain, Semantic Kernel, CrewAI, AutoGen
Strong experience working on SQL Query, Vector DB like Pinecone, Qdrant, Fiaas
Experience working on hybrid search and re-rankers
Experience on evaluation & observability LangSmith/ human-in-the-loop workflows
Strong experience in using any one of the leading Hyperscaler services from Azure/AWS/GCP

Job Responsibility

Build GenAI/agentic systems (chat copilots, workflow/graph agents, tool use, memory)
Implement chunking, hybrid search, vector stores, re-ranking, feedback loops, and continuous data quality/eval
Select/integrate/finetune LLMs & multimodal models
Apply prompt-engineering techniques to specific use cases and types
Experience working on solutions based on LLM, NLP, DL (Deep Learning), ML (Machine Learning), object detection / classification etc
Should have good understanding of DevOps
Should have good understanding of LLM evaluation
Should have deployed min of 2 models in production (MLOps)
Should have understanding of guardrails, policy filters, PII redaction, runtime monitors and agent observability
Unit testing of GenAI Solutions built and documentation of results

Applied Scientist II

Prime Video is a first-stop entertainment destination offering customers a vast ...

Location

United States , Sunnyvale; Seattle

Salary:

142800.00 - 222200.00 USD / Year

Amazon

Expiration Date

Until further notice

Requirements

3+ years of building models for business application experience
PhD, or Master's degree and 4+ years of CS, CE, ML or related field experience
Experience programming in Java, C++, Python or related language
Experience in any of the following areas: algorithms and data structures, parsing, numerical optimization, data mining, parallel and distributed computing, high-performance computing

Job Responsibility

Develop foundation models for content understanding using state-of-the-art deep learning and multimodal learning techniques to analyze video and text
Build time sequence foundation models to understand and predict customer behavior patterns and viewing trajectories
Work closely with engineers and product managers to design, implement and launch solutions end-to-end across various Prime Video experiences
Design and conduct offline and online (A/B) experiments to evaluate proposed solutions based on in-depth data analyses
Effectively communicate technical and non-technical ideas with teammates and stakeholders
Stay up-to-date with advancements and the latest modeling techniques in foundation models, multimodal learning, and time series analysis
Publish your research findings in top conferences and journals

What we offer

health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage)
401(k) matching
paid time off
parental leave
sign-on payments
restricted stock units (RSUs)

Fulltime

Mid Level GenAI Engineer

The Mid Level GenAI Engineer role at NTT DATA involves developing and implementi...

Location

India , Pune

Salary:

Not provided

NTT DATA

Expiration Date

Until further notice

Requirements

5+ years in ML engineering, around 1+ years of hands-on with LLMs/GenAI and agentic frameworks
Experience in shipping production AI systems on at least one hyperscaler (Azure/AWS/GCP)
Experience delivering end-to-end GenAI based solutions
Strong Python experience to build multiple AI-ML/ GenAI Solutions
Experience working on Agent orchestration with leading frameworks like LangGraph, LangChain, Semantic Kernel, CrewAI, AutoGen
Strong experience working on SQL Query, Vector DB like Pinecone, Qdrant, Fiaas
Experience working on hybrid search and re-rankers
Experience on evaluation & observability LangSmith/ human-in-the-loop workflows
Strong experience in using any one of the leading Hyperscaler services from Azure/AWS/GCP
Experience working on NLP, CV, Deep Learning Algorithms

Job Responsibility

Build GenAI/agentic systems (chat copilots, workflow/graph agents, tool use, memory)
Implement chunking, hybrid search, vector stores, re-ranking, feedback loops, and continuous data quality/eval
Select/integrate/finetune LLMs & multimodal models
Apply prompt-engineering techniques to specific use cases and types
Experience working on solutions based on LLM, NLP, DL (Deep Learning), ML (Machine Learning), object detection / classification etc
Should have good understanding of DevOps
Should have good understanding of LLM evaluation
Should have deployed min of 2 models in production (MLOps)
Should have understanding of guardrails, policy filters, PII redaction, runtime monitors and agent observability
Unit testing of GenAI Solutions built and documentation of results

Fulltime

Select Country

Multimodal Algorithm Engineer (Model Optimization)

Job Description

Job Responsibility

Requirements

Nice to have

What we offer

Looking for more opportunities?