CrawlJobs Logo

AI Framework Engineer

China, Shanghai · Job Posted May 13, 2026
Apply Position
Job Link Share

Job Description

As a core member of the team, you will play a pivotal role in optimizing and developing deep learning frameworks for AMD GPUs. Your experience will be critical in enhancing GPU kernels, deep learning models, and training/inference performance across multi-GPU and multi-node systems. You will engage with both internal GPU library teams and open-source maintainers to ensure seamless integration of optimizations, utilizing cutting-edge compiler technologies and advanced engineering principles to drive continuous improvement.

Job Responsibility

  • Build and optimize end to end distributed inference (e.g, P/D disaggregation and Large-EP) and RL solutions on mainstream frameworks like vLLM and SGlang
  • Collaborate with internal GPU library teams to analyze and improve training and inference performance on AMD GPUs
  • Engage with framework maintainers to ensure code changes are aligned with requirements and integrated upstream
  • Optimize deep learning performance on both scale-up (multi-GPU) and scale-out (multi-node) systems
  • Leverage advanced compiler technologies to improve deep learning performance
  • Enhance the full pipeline, including integrating graph compilers
  • Apply sound engineering principles to ensure robust, maintainable solutions

Requirements

  • Bachelor's and/or Master's in Computer Science, Computer Engineering, Electrical Engineering, or related fields
  • 3+ years of professional experience in technical software development, with a focus on GPU optimization, performance engineering, and framework development

Nice to have

Text to Video or Image to Video experience

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

AI Framework Engineer

8 matching positions

Senior Backend Engineer - AI Framework

As a Senior Backend Engineer on Gong’s AI Framework team, you will build the fou...
Location
Location
Israel , Tel Aviv
Salary
Salary:
Not provided
gong.io Logo
Gong
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of backend engineering experience, with strong system design and platform-building expertise
  • Strong analytical and problem-solving skills, with the ability to debug and resolve complex technical issues efficiently
  • Hands-on experience with agentic systems and frameworks such as LangChain, LangSmith, ADK, or equivalent agent orchestration platforms
  • Strong understanding of AI evaluation methodologies, including agent evaluations, prompt evaluation, regression testing, and quality monitoring
  • High proficiency in Python for building production-grade AI frameworks and services
  • Familiarity with Java and experience integrating backend platforms or tooling into Java-based systems
  • Experience building observability, monitoring, or platform tooling for distributed systems
  • Strong analytical skills and the ability to reason about complex, evolving AI-driven systems
  • Experience with cloud platforms and scalable microservices architectures
  • Excellent communication skills and a strong platform mindset, with experience enabling multiple teams
Job Responsibility
Job Responsibility
  • Agentic Framework Architecture: Designing and building Gong’s internal agentic framework, leveraging and integrating industry-standard tools such as LangChain, LangSmith, ADK, and similar ecosystems
  • Evaluation and Quality Systems: Building evaluation frameworks and workflows for AI agents, including offline and online evaluations, quality metrics, regression detection, and experimentation infrastructure
  • Observability, Monitoring, and Guardrails: Providing the organization with robust observability capabilities for AI agents, including tracing, logging, monitoring, cost tracking, and safety guardrails to ensure reliable and responsible usage
  • Developer Enablement Platforms: Creating APIs, SDKs, and abstractions that enable product teams to easily build, test, and operate agents while adhering to platform standards
  • Cross-Language Integrations: Designing integrations and tooling across Python and Java to enable seamless adoption of the AI framework within Gong’s broader backend ecosystem
What we offer
What we offer
  • flexibility
  • autonomy
  • positive work relationships
  • effective work habits
Read More
Arrow Right

Digital Twin AI Framework Engineer

This is a career-defining opportunity to play a crucial role in a hyper-scale AI...
Location
Location
United States , Salt Lake City
Salary
Salary:
Not provided
passivelogic.com Logo
PassiveLogic
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • MS/PhD in CS/CE/EE/Mathematics, information science, computer science, computational linguistics, computational physiology, human comfort, physics, or equivalent experience
  • Demonstrated experience in developing mappings and transformation processes to take data from a relational paradigm into a knowledge graph environment
  • Enthusiasm for human-centered API design and object model design
  • Demonstrated expertise in multi-disciplinary system modeling
  • Strong theoretical background in systems theory and graph theory
  • Strong programming and software architecture skills (C++, Swift, Rust, etc)
  • Exceptional communication skills
  • Organized and strategic
  • Collaborative mindset
  • Adaptability
Job Responsibility
Job Responsibility
  • Develop and evolve the Quantum digital twin object model, ontology, and API
  • Develop the Quantum Knowability Model — evolve the formal proofs of digital twin definitions, systems, events, and compositional stage by which the formalisms can be known
  • Support the development of the digital twin AI inferencing framework and the accelerator compute team
  • Advocate for human-centered UX/UI at every stage of ontology development
  • Apply systems theory and design processes throughout the Quantum evolution process, always collecting feedback and improving the language
  • Collaborate and coordinate with Quantum working groups including: formal methods, AI, compute, physics, geometry, compiler, and semantic teams, as well as with other Quantum partners to address their feedback and incorporate their needs
What we offer
What we offer
  • Competitive compensation
  • Generous equity share package
  • Medical, dental and vision coverage
  • Disability and life Insurance options
  • Flex PTO
  • Team-building events
  • Free catered lunch in the office Monday — Friday
  • Free ski pass
  • Free National Park pass
  • Fulltime
Read More
Arrow Right

AI Model, Framework, and GPU Engineer

We are looking for an experienced Machine Learning Software Engineer who will be...
Location
Location
Germany , Munich
Salary
Salary:
Not provided
amd.com Logo
AMD
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong technical and analytical skills in C/C++/Python AI development in Windows and Linux environment
  • Some knowledge on GPU programming and compiler
  • Capable problem solver
  • Technical leader to define goals and scope and drive development effort
  • Good communication skills
  • Enthusiastic about AI technologies
  • Strongly motivated to enable customers with best feature-rich efficient solutions
  • Strong cross-platform software development experience and deep programming skills in C/C++ and Python
  • Excellent problem-solving and effective communication skills
  • Development experience on CONV, GEMM, and/or non-linear operators
Job Responsibility
Job Responsibility
  • Develop and deliver innovative AI software solutions to AMD customers and users
  • Enable and optimize software stack for standard frameworks like ONNX and PyTorch, as well as new popular Open-Source AI software
  • Bring up new SOTA AI models, analyze and improve their performance
  • Participate and drive end-2-end AI software development from feature scoping, implementation, integration and verification, to customer enablement
Read More
Arrow Right

Senior Principal Engineer- End-to-End AI Training Framework

As the Senior Principal Engineer, E2E AI Training Framework for Autonomous Drivi...
Location
Location
United States , Sunnyvale
Salary
Salary:
240000.00 - 320000.00 USD / Year
https://www.bosch.pl/ Logo
Robert Bosch Sp. z o.o.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master’s degree or Ph.D in Computer Science, Robotics, Electrical Engineering, AI, or a closely related field with a focus on autonomous systems
  • 10+ years of experience in software development and system engineering for autonomous driving or ADAS applications
  • Proven industry experience in releasing AI-based L2+ systems, with a strong track record of successful product deployments
  • Deep knowledge of E2E AI stack solutions and training algorithms, including reinforcement learning, and imitation learning, as well as motion control and optimization techniques
  • Deep knowledge of AI frameworks such as TensorFlow and PyTorch
  • Deep knowledge in model optimization and embedded deployment of E2E AI stacks to embedded automotive hardware
  • Deep knowledge of cloud-based scalable training pipelines, MLOps, and CICD for training AI models with large-scale fleet datasets
  • Proven track record of leading the end-to-end development and successful deployment of complex AI-powered systems into production environments at scale
Job Responsibility
Job Responsibility
  • Define and drive execution of the technical roadmap and strategy for the E2E AI machinery, including training pipelines, optimization techniques, simulation and MLOps tooling
  • Oversee the design, development, and testing of the E2E AI machinery and its interaction with data sources, model repositories, and development targets
  • Collaborate closely with other functional tech leads (e.g. data engineering, infrastructure) to define and drive the overall architecture of the AI machinery ecosystem
  • Guide the set-up of a development framework that enables fast evaluation and integration of emerging E2E AI solutions
  • Guide the transition from research prototypes to production-ready solutions, ensuring performance optimization on automotive-grade hardware and scalability
  • Leverage your prior industry experience in launching AI-based L2+ systems to implement best practices in system validation, testing (SIL/HIL), and continuous improvement
  • Mentor and lead a high-caliber team of AI scientists and engineers, fostering a culture of innovation, collaboration, and technical excellence
What we offer
What we offer
  • health, dental, and vision plans
  • health savings accounts (HSA)
  • flexible spending accounts
  • 401(K) retirement plan with an attractive employer match
  • wellness programs
  • life insurance
  • long term disability insurance
  • paid time off
  • parental leave
  • Fulltime
Read More
Arrow Right

Senior AI Engineer (AI Agents & Applied LLMs)

We are looking for a Senior AI Engineer to design, build, and scale intelligent ...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
infogrowth.in Logo
InfoGrowth
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in software engineering, machine learning, or AI-related roles
  • Strong experience with Python (required)
  • familiarity with JavaScript/TypeScript is a plus
  • Hands-on experience with LLMs, prompt engineering, and AI frameworks (LangChain, LlamaIndex, etc.)
  • Experience building production-grade APIs and services
  • Knowledge of vector databases (Pinecone, FAISS, Weaviate, etc.)
  • Solid understanding of data structures, algorithms, and system design
  • Experience deploying AI systems on cloud platforms (AWS, GCP, or Azure)
Job Responsibility
Job Responsibility
  • Design and develop AI agents capable of reasoning, planning, and executing tasks autonomously
  • Build and deploy LLM-powered applications using models such as GPT, Claude, or open-source LLMs
  • Integrate AI systems with APIs, databases, tools, and third-party services
  • Optimize prompts, workflows, and agent architectures for accuracy, performance, and cost
  • Lead end-to-end AI projects from concept to production
  • Collaborate closely with product managers, designers, and backend teams
  • Establish best practices for AI safety, evaluation, monitoring, and governance
  • Mentor junior engineers and conduct technical reviews
  • Fulltime
Read More
Arrow Right

Applied AI Engineer - AI Solutions

As an Applied AI Engineer, you’ll research and utilize state-of-the-art Gen AI a...
Location
Location
United States , Redwood City; San Francisco
Salary
Salary:
172000.00 - 300000.00 USD / Year
snorkel.ai Logo
Snorkel AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • B.S. degree in a quantitative field such as Computer Science, Engineering, Mathematics, Statistics, or comparable degree/experience
  • 3+ years of customer-facing experience in the design and implementation of AI/ML solutions
  • Proficiency in Python, including strong grounding in software engineering fundamentals (e.g., modular design, testing, profiling, packaging) and experience with modern Python constructs and libraries for type validation and typed data modeling (e.g., pydantic), building type-safe systems (e.g., mypy), testing (e.g., pytest), packaging and environment configuration (e.g., poetry), API and service frameworks (e.g., FastAPI), serialization and structured data handling (e.g., msgspec), and orchestration tooling relevant to ML deployment (e.g., Ray, Airflow)
  • Expertise across the Applied AI stack, spanning classical ML libraries (e.g., scikit-learn), deep learning frameworks (e.g., PyTorch), foundation-model ecosystems (e.g., Hugging Face Transformers), vector/embedding tooling (e.g., FAISS), data processing frameworks (e.g., pandas, Spark), retrieval/RAG tooling (e.g., Chroma, Weaviate), synthetic dataset curation, evaluation workflows, and LLM orchestration, workflow, agent authoring tools (e.g., LlamaIndex, LangGraph, CrewAI)
  • Experience leading strategic, customer-facing initiatives and collaborating with business stakeholders to ensure ML solutions drive successful business outcomes, with a strong focus on teaching and enablement
  • Outstanding presentation skills to technical and executive audiences, whether impromptu on a whiteboard or using presentations and demos
  • Ability to work in a fast-paced environment and balance priorities across multiple projects at once
Job Responsibility
Job Responsibility
  • Partner with customers to build and deploy impactful Gen AI and machine learning solutions, from use case scoping and data exploration to model development and deployment. This may involve leveraging Snorkel Flow or designing custom approaches using state-of-the-art tools, with the goal of delivering real business value and informing the evolution of the Snorkel platform
  • Develop and implement state of the art AI systems such as retrieval-augmented generation (RAG), fine-tuning pipelines, prompt engineering recipes and agentic workflows
  • Create augmented real-world datasets and comprehensive evaluation workflows to ensure model reliability, transparency, and stakeholder trust. A data- and evaluation-first mindset is essential for success in this role
  • Forge and manage relationships with our customers’ leadership and stakeholders to ensure successful development and deployment of AI projects with Snorkel Flow
  • Collaborate closely with pre-sales Solutions and Product teams to map customer needs to existing capabilities, prioritize roadmap gaps, and guide successful project setup
  • Work with other Applied AI Engineers to standardize solutions and contribute to internal tooling and best practices
  • Lead stakeholder education on quantitative capabilities, helping them to understand the strengths and weaknesses of different approaches and what problems are best-suited for Snorkel AI
  • Serve as the voice of our customers for new AI paradigms, data science workflows, and share customer feedback to product teams
  • Conduct one-to-few and one-to-many enablement workshops to transfer knowledge to customers considering or already using Snorkel AI
  • Annual travel up to 25%
What we offer
What we offer
  • equity in the form of employee stock options
  • Fulltime
Read More
Arrow Right

Senior AI Engineer Vertex AI / Gemini

My client is looking for a Senior AI Engineer with proven experience delivering ...
Location
Location
Qatar , Doha
Salary
Salary:
Not provided
welovesalt.com Logo
Salt
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong hands-on experience with Vertex AI and Gemini Enterprise AI Platform
  • Delivered 2-3+ AI use cases to production using the Google AI stack
  • Experience building and deploying RAG (Retrieval-Augmented Generation) solutions
  • Multi-agent AI systems
  • Strong Python and cloud engineering skills
  • Experience with vector databases, embeddings, prompt engineering, and AI orchestration frameworks
  • Excellent English communication skills
Job Responsibility
Job Responsibility
  • Deliver production-grade AI solutions on Google Cloud Platform (GCP)
  • Build and deploy RAG solutions
  • Build and deploy multi-agent AI systems
  • Fulltime
Read More
Arrow Right

Senior AI Engineer (Agentic AI / LLM Engineering)

We’re partnering with a rapidly growing, innovation-focused organization that is...
Location
Location
United States
Salary
Salary:
Not provided
zeektek.com Logo
Zeektek
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in software engineering, data engineering, or AI/ML engineering
  • Hands-on experience building applications using LLMs (e.g., GPT, open-source models)
  • Experience with agentic AI frameworks (e.g., LangChain, AutoGen, CrewAI, or similar)
  • Strong programming skills in Python
  • Experience working with large datasets and distributed data platforms
  • Familiarity with Databricks or similar modern data platforms
  • Experience building production-grade AI systems (not just POCs)
  • Strong understanding of prompt engineering and LLM optimization techniques
Job Responsibility
Job Responsibility
  • Design and build agentic AI systems leveraging large language models (LLMs)
  • Develop scalable AI solutions using modern data and AI platforms (Databricks preferred)
  • Translate business problems into production-ready AI workflows and applications
  • Collaborate with product and architecture teams to define AI use cases and technical approaches
  • Implement and optimize: Prompt engineering strategies
  • Token usage and cost efficiency
  • Model performance and response quality
  • Work with large-scale datasets to support training, fine-tuning, and inference workflows
  • Contribute to the development of AI engineering standards, frameworks, and best practices
  • Partner with data science teams to integrate models into production environments
What we offer
What we offer
  • Weekly Direct Deposit
  • 401K Matching
  • Competitive medical, dental and vision insurance
  • Consistent communication throughout your project
  • ZeekTek Referral Program
Read More
Arrow Right