CrawlJobs Logo

AI Model Training Development Engineer

amd.com Logo

AMD

Location Icon

Location:
China , Beijing

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

We are looking for Machine Learning Engineer to join our Models and Applications team. If the challenge of distributed training of large model on large number of GPUs excites you and you are passionate about improving training efficiency and enjoy innovating and coming up with new ideas, then this role is for you. You will be part of world class team focus on addressing the challenge of training generative AI.

Job Responsibility:

  • Train large model to convergence on AMD GPUs
  • Improve the end-to-end training pipeline performance
  • Optimize the distributed training pipeline and algorithm to scale out
  • Contribute your changes to open source
  • Up to date with latest training algorithms
  • Influence the direction of AMD AI platform
  • Cross team collaborate with various group and stakeholder

Requirements:

  • Experience in ML frameworks such as PyTorch, JAX or Tensorflow
  • Experience with distributed training and distributed training framework such as DeepSpeed
  • Experience with LLM or Vision, especially large model is a plus
  • Excellent python programing skills, including debugging, profiling, and perf analysis
  • Experience with ML pipeline
  • Strong communication and problem-solving skills
  • A master’s degree in computer science, artificial intelligence, machine learning, or a related field

Nice to have:

Experience with LLM or Vision, especially large model is a plus

Additional Information:

Job Posted:
January 05, 2026

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for AI Model Training Development Engineer

AI Research Engineer, VLA Models

As a Research Engineer on the Vision-Language Action (VLA) team, you will be res...
Location
Location
United States , Palo Alto
Salary
Salary:
180000.00 - 300000.00 USD / Year
1x.tech Logo
1X Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong programming skills in Python and familiarity with build systems like Bazel
  • Experience using deep learning frameworks such as PyTorch
  • Proficiency in simulation environments like Isaac Sim or MuJoCo
  • Deep understanding of generalization in autonomous systems
  • Experience designing and validating evaluation metrics in real or simulated environments
  • Ability to work cross-functionally with controls, QA, and data teams to operationalize models
Job Responsibility
Job Responsibility
  • Take end-to-end ownership of autonomous capability development: data review, model design, deployment, and fleet performance monitoring
  • Train NEO to perform whole-body manipulation and navigation tasks in unfamiliar environments
  • Design robust evaluation metrics to support scalable model pre-training
  • Experiment with cutting-edge vision-language and generative model techniques to predict robot actions
  • Collaborate with controls, QA, and data teams to deploy reinforcement learning policies to the production fleet
What we offer
What we offer
  • Equity
  • Health, dental, and vision insurance
  • 401(k) with company match
  • Paid time off and holidays
  • Fulltime
Read More
Arrow Right

AI Engineer

Join Inference Group, a team of passionate innovators devoted to empowering busi...
Location
Location
Salary
Salary:
Not provided
inferencegroup.com Logo
Inference Group
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Skilled in AI engineering
  • Experience in creating, programming, and training complex networks of algorithms
  • Experience in evaluating and comparing algorithm performance based on large, real-world data sets
  • Experience in designing and implementing Machine Learning algorithms
  • Experience in undertaking data mining exercises from various sources
  • Experience in oversight of ongoing AI/ML engineering projects
  • Experience in creating intelligent AI models utilizing deep learning, neural networks, and ML algorithms
  • Well-versed in programming, software engineering, and data science fields
  • Passionate and curious about ongoing development within the AI/ML space
  • Previous experience delivering engineering projects focused on realising the potential of AI/ML within a consultancy space
Job Responsibility
Job Responsibility
  • Broaden engineering capabilities
  • Create, program, and train complex networks of algorithms to grow AI programmes
  • Evaluate and compare algorithm performance based on large, real-world data sets
  • Design and implement Machine Learning algorithms
  • Undertake various data mining exercises from various sources to gain valuable insights to accelerate existing algorithms and future models
  • Provide oversight of ongoing AI/ML engineering projects
  • Play a crucial role in supporting clients' objectives by providing important technical insights into ongoing digital transformation and delivery
  • Create intelligent AI models which utilise deep learning, neural networks and ML algorithms to gain key business insights and create opportunities for informed decision making and future planning
What we offer
What we offer
  • Opportunity to work alongside leading experts in data, AI, and technology
  • Access to the latest technology
  • Training and certifications
  • Commitment to ongoing learning and development
Read More
Arrow Right

AI Engineer

AI Engineer position at Inetum, a European leader in digital services, focusing ...
Location
Location
Romania , Bucharest
Salary
Salary:
Not provided
https://www.inetum.com Logo
Inetum
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum 3 to 5 years of experience in AI solutions deployment in enterprise environments
  • Understanding of how LLMs (GPT-4, Gemini, Claude, Llama, Mistral AI) work, their capabilities, and limitations
  • Familiarity with model architecture, tokenization, context windows, and prompt formatting
  • Crafting effective prompts for various tasks (text generation, summarization, Q&A, code generation, images and sound manipulations)
  • Techniques for prompt chaining, few-shot and zero-shot learning, and multi-turn conversations
  • Knowledge of prompt templates, system instructions, and role-based prompting
  • Understanding the concept of AI agents: autonomous entities that perceive, reason, and act to achieve goals
  • Familiarity with multi-agent systems, agent orchestration, and agentic workflows (e.g., using frameworks like Lang Chain, Crew AI, Auto Gen)
  • Ability to design, prompt, and coordinate groups of AI agents for collaborative or competitive tasks
  • Knowledge of agent communication, delegation, and task decomposition
Job Responsibility
Job Responsibility
  • Develop, refine, and optimize prompts for LLMs (GPT-4, Gemini, Claude, Llama, Mistral) to support a variety of tasks such as text generation, summarization, Q&A, and code generation
  • Design and implement prompt strategies for multi-turn conversations, prompt chaining, and role-based instructions
  • Build and coordinate groups of AI agents (multi-agent systems) for collaborative or competitive tasks using frameworks such as Lang Chain, Crew AI, or Auto Gen
  • Upgrade existing prompts while releases and solutions set evolve
  • Evaluate and improve the effectiveness of prompts and agent workflows through iterative testing, A/B experimentation, and performance analysis
  • Collaborate with cross-functional teams (developers, data scientists, product managers) to integrate prompt engineering and agentic workflows into enterprise solutions
  • Ensure compliance with data privacy, security, and regulatory standards (GDPR, NIS2) in all prompt and agent designs
  • Document prompt strategies, agent architectures, and best practices for internal knowledge sharing and training
  • Ensure provided solutions are ready for production, documented and monitored ensuring consistent delivery and quality
What we offer
What we offer
  • Full access to foreign language learning platform
  • Personalized access to tech learning platforms
  • Tailored workshops and trainings to sustain your growth
  • Medical Insurance
  • Meal tickets
  • Monthly budget to allocate on flexible benefit platform
  • Access to 7 Card services
  • Wellbeing activities and gatherings
  • Fulltime
Read More
Arrow Right

AI Engineer

AI Engineer position at Inetum, a European leader in digital services, focusing ...
Location
Location
Romania , Bucharest
Salary
Salary:
Not provided
https://www.inetum.com Logo
Inetum
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum 3 to 5 years of experience in AI solutions deployment in enterprise environments
  • Understanding of how LLMs (GPT-4, Gemini, Claude, Llama, Mistral AI) work, their capabilities, and limitations
  • Familiarity with model architecture, tokenization, context windows, and prompt formatting
  • Crafting effective prompts for various tasks (text generation, summarization, Q&A, code generation, images and sound manipulations)
  • Techniques for prompt chaining, few-shot and zero-shot learning, and multi-turn conversations
  • Knowledge of prompt templates, system instructions, and role-based prompting
  • Understanding the concept of AI agents: autonomous entities that perceive, reason, and act to achieve goals
  • Familiarity with multi-agent systems, agent orchestration, and agentic workflows (e.g., using frameworks like Lang Chain, Crew AI, Auto Gen)
  • Ability to design, prompt, and coordinate groups of AI agents for collaborative or competitive tasks
  • Knowledge of agent communication, delegation, and task decomposition
Job Responsibility
Job Responsibility
  • Develop, refine, and optimize prompts for LLMs (GPT-4, Gemini, Claude, Llama, Mistral) to support a variety of tasks such as text generation, summarization, Q&A, and code generation
  • Design and implement prompt strategies for multi-turn conversations, prompt chaining, and role-based instructions
  • Build and coordinate groups of AI agents (multi-agent systems) for collaborative or competitive tasks using frameworks such as Lang Chain, Crew AI, or Auto Gen
  • Upgrade existing prompts while releases and solutions set evolve
  • Evaluate and improve the effectiveness of prompts and agent workflows through iterative testing, A/B experimentation, and performance analysis
  • Collaborate with cross-functional teams (developers, data scientists, product managers) to integrate prompt engineering and agentic workflows into enterprise solutions
  • Ensure compliance with data privacy, security, and regulatory standards (GDPR, NIS2) in all prompt and agent designs
  • Document prompt strategies, agent architectures, and best practices for internal knowledge sharing and training
  • Ensure provided solutions are ready for production, documented and monitored ensuring consistent delivery and quality
What we offer
What we offer
  • Full access to foreign language learning platform
  • Personalized access to tech learning platforms
  • Tailored workshops and trainings to sustain your growth
  • Medical Insurance
  • Meal tickets
  • Monthly budget to allocate on flexible benefit platform
  • Access to 7 Card services
  • Wellbeing activities and gatherings
  • Fulltime
Read More
Arrow Right

Model Optimization Engineer

We are looking for a hands‑on Engineer to design, implement, and optimize AI mod...
Location
Location
China , Beijing
Salary
Salary:
Not provided
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong software engineering in Python and C/C++
  • Practical experience with PyTorch/JAX and building/extending deep learning frameworks
  • Hands‑on CUDA and/or ROCm development
  • experience writing or optimizing GPU kernels
  • Experience with Triton (kernel development/optimization) is highly desired
  • Proven experience with model optimization techniques, especially low‑bitwidth quantization and other compression methods
  • Familiarity with GenAI inference engines and optimizations (e.g., vLLM, SGLang, xDiT, continuous batching, speculative decoding)
  • Skilled at profiling and performance debugging across stack layers (operator → model → framework → hardware)
Job Responsibility
Job Responsibility
  • Design, implement, and optimize inference and training pipelines for AMD GPUs/accelerators at the framework, model, and operator levels
  • Lead research and development of model optimization algorithms: low‑bitwidth quantization, pruning/sparsity, compression, efficient attention mechanisms, and lightweight architectures
  • Implement and tune CUDA/ROCm/Triton kernels for critical operators
  • profile and eliminate performance bottlenecks
  • Integrate and optimize models for PyTorch/JAX and common distributed training/inference stacks (Torchtitan, Megatron, DeepSpeed, HF Transformers, etc.)
  • Reduce latency and increase throughput for large‑model inference (e.g., batching strategies, caching, speculative decoding)
  • Contribute to and/or maintain open‑source inference/training tools, ensuring production readiness and community adoption
  • Provide technical support and guidance to customers and internal teams to achieve target accuracy and performance on AMD platforms
Read More
Arrow Right

Senior Platform Engineer, AI Evaluation

We’re looking for an AI Platform Engineer to evolve and extend our internal eval...
Location
Location
United States , Mountain View
Salary
Salary:
137871.00 - 172339.00 USD / Year
khanacademy.org Logo
Khan Academy
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field
  • 5 years of Software Engineering experience with 2+ of those years working on the evaluation of generative AI systems
  • Strong programming skills in Go, Python, SQL, and at least one data pipeline framework (e.g., Airflow, Dagster, Prefect)
  • Familiarity with the architecture of large language models and their industry-standard APIs
Job Responsibility
Job Responsibility
  • Evolve and extend our internal evaluation framework for assessing the quality of our AI-driven experiences
  • Work closely with ML data engineers and platform developers to help internal teams adopt an eval-driven development process incorporating offline benchmark tests and online experiments
  • Gather internal requirements, getting buy-in for changes, and then developing documentation and training materials
What we offer
What we offer
  • Competitive salaries
  • Ample paid time off as needed
  • 8 pre-scheduled Wellness Days in 2026
  • Remote-first culture
  • Generous parental leave
  • 401(k) + 4% matching
  • Comprehensive insurance, including medical, dental, vision, and life
  • Fulltime
Read More
Arrow Right

Sr. Software Development Engineer

We are looking for a Senior Software Engineer who will bring creativity and expe...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
highspot.com Logo
Highspot
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6-10 years of experience building distributed systems, working with databases, and implementing production-ready high quality features
  • Proficiency in building, integrating with, and supporting APIs and web services using a variety of languages, data formats, and data transformations
  • Demonstrated ability to effectively collaborate with cross-functional teams, including designers, product managers, and other developers, to develop and deliver high-quality applications
  • An entrepreneurial spirit: you’re agile, creative, resourceful, and tenacious as you solve problems and achieve team and company goals
  • Comfortable with modern open source technologies and tools
  • Experience developing software products, scalable internet software, and applications using a range of software models including object-oriented and functional design patterns
  • B.S./M.S. in Computer Science or equivalent industry experience
Job Responsibility
Job Responsibility
  • Partner with UX, Product Management, Data Science, and other teams to create software that customers love
  • Develop clean, reusable, supportable, and well-tested RESTful APIs and web services, including Highspot’s external API
  • Optimize and perform enhancements to large-scale data services built on top of MongoDB, Postgres, Redis, and other technologies
  • Integrate Highspot with external APIs, including third-party Customer Relationship Management (CRM) systems, Content Management Systems (CMS), and other partner applications
  • Collaborate with the Data Science team to integrate advanced machine learning models into the application to deliver cutting edge AI features and help solve complex business problems for customers
  • Build scalable methodologies, tools, and techniques accompanied by excellent technical documentation
  • Stay abreast of new technologies and practices to further enhance team capabilities and your own skill
  • Act as a mentor and source for direction, training, and guidance for more junior engineers
  • Fulltime
Read More
Arrow Right

Research Engineer, World Models

You will build large multi-modal generative “world models” that predict future s...
Location
Location
United States , Palo Alto
Salary
Salary:
180000.00 - 300000.00 USD / Year
1x.tech Logo
1X Technologies
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong experience with Python (and related tooling such as Bazel)
  • Proficiency with PyTorch or equivalent deep learning frameworks
  • Familiarity with simulation platforms (e.g., Isaac Sim, MuJoCo)
  • Experience building or working with multi-modal generative models combining video, audio, text, and action prediction
  • Ability to design and optimize large-scale data pipelines and loaders for training
  • Understanding of scaling laws and metrics for foundation models
Job Responsibility
Job Responsibility
  • Full-stack engineering: data engineering, model architecture design, and delivering polished products
  • Develop high-throughput data loaders for large multi-modal datasets
  • Implement tokenizers and transformers tailored for web-scale robot data
  • Translate improvements in world model architectures into improvements in robot autonomy
  • Predict how real-world robot performance scales with pre-training metrics (e.g., log loss)
What we offer
What we offer
  • Health, dental, and vision insurance
  • 401(k) with company match
  • Paid time off and holidays
  • Fulltime
Read More
Arrow Right