CrawlJobs Logo

Senior Framework Engineer — Diffusion Inference

amd.com Logo

AMD

Location Icon

Location:
Finland , Helsinki

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

As a Framework Engineer for Diffusion Model Inference, you will design, build, and evolve a production-grade inference framework for Diffusion Transformers (DiTs) powering state-of-the-art image and video generation. You will focus on framework-level engineering—model integration, scalable parallel inference, kernel plumbing, packaging, testing, and release management—ensuring diffusion workloads run out-of-the-box with exceptional performance on modern GPU systems.

Job Responsibility:

  • Develop and maintain a diffusion inference framework for image/video generation with clean APIs and strong compatibility with widely used diffusion ecosystems
  • Own scalable parallel inference features for DiT workloads—single-node and multi-node
  • Integrate optimized operator backends (attention, GEMM, quantized paths) by bridging Python/C++ layers and ensuring correctness and high performance
  • Ship production-grade packaging & releases including containers, versioned artifacts, dependency hygiene, and pip-installable distributions
  • Build continuous testing & benchmarking infrastructure
  • Collaborate across the GPU software stack and translate framework needs into actionable upstream improvements
  • Support strategic customers by mapping real-world inference constraints into framework features, reference configurations, and reproducible deployment recipes
  • Communicate clearly around technical tradeoffs, performance bottlenecks, and roadmap decisions

Requirements:

  • Strong Python and/or C++ engineering skills (debugging, profiling, testing, navigating complex codebases, clean abstractions)
  • Experience with ML frameworks—PyTorch strongly preferred, JAX/TF welcome—and familiarity with diffusion model execution
  • Proven ability to work in GPU-accelerated environments with intuition for performance, memory/compute tradeoffs, and profiling
  • Comfort with containers (Docker) and modern dev workflows (git, CI, build systems)
  • Strong cross-functional collaboration and clear technical communication skills
  • BSc, MSc, PhD, or equivalent experience in Computer Science, Electrical Engineering, or a related field

Nice to have:

  • Experience with diffusion inference engines or parallel inference frameworks for DiTs (sequence, pipeline, CFG-parallel concepts)
  • Exposure to operator libraries such as AITER-style kernel collections (attention/GEMM/quant/comm)
  • GPU kernel development experience (HIP/CUDA/Triton) or familiarity with compiler/codegen backends
  • Knowledge of high-performance networking (RDMA, RoCE, InfiniBand, UCX) for multi-node inference
  • Experience building benchmarking and performance regression systems at scale

Additional Information:

Job Posted:
April 05, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior Framework Engineer — Diffusion Inference

Senior Software Engineer – AI

NStarX is seeking a highly skilled Senior Software Engineer – AI with a strong f...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
nstarxinc.com Logo
NStarX
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Machine Learning, Data Science, or a related field (PhD is a plus)
  • 9+ years of experience in AI/ML engineering or related roles
  • 3+ years of experience in Generative AI with team leadership responsibilities
  • Proven track record of production-grade ML and GenAI model development and deployment
  • Programming: Python (preferred)
  • GenAI Frameworks: Hugging Face Transformers, Diffusers, LangChain, TGI
  • Serving & Inference: FastAPI, gRPC, NVIDIA Triton, TorchServe
  • Cloud Platforms: AWS (SageMaker, EKS), GCP (Vertex AI, GKE), Azure (Azure ML, AKS)
  • MLOps & DevOps: Kubeflow, MLflow, GitHub Actions, Jenkins, Helm, Terraform
  • Optimization Techniques: Model quantization, distillation, pipeline and tensor parallelism
Job Responsibility
Job Responsibility
  • Design, develop, and deploy machine learning models and AI algorithms to address complex business challenges
  • Lead and mentor a team of AI/ML engineers, ensuring quality and scalability in solution design and implementation
  • Collaborate closely with cross-functional teams including data scientists, software engineers, product managers, and UX designers
  • Lead the development and deployment of Generative AI applications across text, code, image, and audio modalities using state-of-the-art LLMs
  • Design and implement CI/CD pipelines for the GenAI model lifecycle including training, validation, packaging, and deployment
  • Apply best practices for model performance tuning, cost optimization, and scalable deployment in cloud and hybrid environments
  • Develop prompt engineering, fine-tuning strategies (LoRA, QLoRA, PEFT), and evaluation protocols tailored to business use cases
  • Stay current with emerging trends in AI, ML, and Generative AI and drive adoption across teams
  • Document processes, model architectures, and deployment strategies for traceability and knowledge sharing
  • Work closely with cross-functional teams to gather requirements and deliver high-quality solutions
What we offer
What we offer
  • Competitive salary aligned with market standards
  • Opportunities for professional development and skill enhancement
  • A collaborative and innovative work environment
  • Fulltime
Read More
Arrow Right

Senior Principal, Machine Learning & Artificial Intelligence

Xometry is seeking a Senior Principal, Machine Learning & Artificial Intelligenc...
Location
Location
United States , North Bethesda
Salary
Salary:
150000.00 - 196000.00 USD / Year
cherry.vc Logo
Cherry Ventures
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master’s or PhD in Computer Science, Machine Learning, Applied Mathematics, Electrical Engineering or related field (PhD preferred for deep generative/3D modeling emphasis)
  • 12+ years of professional experience in machine learning, artificial intelligence, or data science roles — with several years in senior or principal capacity leading major programs
  • Demonstrated experience architecting and delivering large scale ML/AI solutions - end-to-end from data ingestion, feature engineering, model training, evaluation, deployment, monitoring & operations
  • Deep expertise in machine learning frameworks (TensorFlow, PyTorch), data engineering, model infrastructure, MLOps, cloud platforms (AWS, GCP, Azure), and scalable production systems
  • Strong exposure to generative AI techniques (large language models, multimodal models, diffusion, GANs) and translating them into business use-cases
  • Excellent cross-functional collaboration skills: you can partner with product, engineering, ops, manufacturing, design, business leadership and translate technical concepts into business language
  • Proven ability to influence without direct authority and drive change across organizations
  • Strong communication and presentation skills
  • you can articulate technical vision, roadmap, trade-offs and outcomes to senior leadership
  • Track record of identifying and delivering measurable business impact via ML/AI - e.g., revenue growth, cost savings, improved efficiency
Job Responsibility
Job Responsibility
  • Serve as the technical leader of multiple large, cross-functional ML/AI solutions with significant, lasting impact across Xometry’s business
  • Define, and drive the 18-24-month ML/AI technical roadmap - balancing breakthrough innovation (e.g., generative 3D, foundation models, large-scale vision/3D pipelines) with reliable business value delivery (e.g., quoting accuracy, lead-time reduction, defect detection, cost optimization)
  • Influence partner roadmaps across engineering, product, operations, and business teams: align priorities, advise on resourcing, champion ML/AI best practices
  • Proactively identify and remove roadblocks for teams and projects — whether technical, operational, data-related, or resource constraints
  • Mentorship of individuals and technical teams
  • Act as a trusted SME with strong cross-functional partnerships: your insights and guidance will shape ML/AI infrastructure, data, model, infrastructure, and tooling decisions
  • Play a leadership role in identifying areas of opportunity — e.g., using ML/AI to unlock new revenue streams (e.g., rapid quoting for new manufacturing modalities, generative design for customers), reduce cost (e.g., automated quality inspection), or optimize efficiency (e.g., 3D-geometry classification, defect detection, generating manufacturing ready models)
  • Address problems adjacent to your sphere of immediate influence: proactively tackle challenges outside direct scope and champion holistic solutions
  • Stay ahead of industry developments in ML, AI, generative AI, 2D/3D modeling and manufacturing tech
  • translate insights into the improvement of internal best practices, tooling, frameworks, model governance, data pipelines, and operationalization
What we offer
What we offer
  • annual bonus
  • 401(k) match
  • medical, dental and vision insurance
  • life and disability insurance
  • generous paid time off including vacation, sick leave, floating and fixed holidays, maternity and bonding leave
  • EAP, other wellbeing resources
  • Fulltime
Read More
Arrow Right

Senior Principal, Machine Learning & Artificial Intelligence

Xometry is seeking a Senior Principal, Machine Learning & Artificial Intelligenc...
Location
Location
United States , Waltham
Salary
Salary:
150000.00 - 196000.00 USD / Year
cherry.vc Logo
Cherry Ventures
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master’s or PhD in Computer Science, Machine Learning, Applied Mathematics, Electrical Engineering or related field (PhD preferred for deep generative/3D modeling emphasis)
  • 12+ years of professional experience in machine learning, artificial intelligence, or data science roles — with several years in senior or principal capacity leading major programs
  • Demonstrated experience architecting and delivering large scale ML/AI solutions - end-to-end from data ingestion, feature engineering, model training, evaluation, deployment, monitoring & operations
  • Deep expertise in machine learning frameworks (TensorFlow, PyTorch), data engineering, model infrastructure, MLOps, cloud platforms (AWS, GCP, Azure), and scalable production systems
  • Experience in 3D modeling / geometry / computer vision / generative models (e.g., point-cloud processing, mesh processing, text23D, image23D, CAD/CAM integration) is highly desirable
  • Strong exposure to generative AI techniques (large language models, multimodal models, diffusion, GANs) and translating them into business use-cases
  • Excellent cross-functional collaboration skills: you can partner with product, engineering, ops, manufacturing, design, business leadership and translate technical concepts into business language
  • Proven ability to influence without direct authority and drive change across organizations
  • Strong communication and presentation skills
  • you can articulate technical vision, roadmap, trade-offs and outcomes to senior leadership
Job Responsibility
Job Responsibility
  • Serve as the technical leader of multiple large, cross-functional ML/AI solutions with significant, lasting impact across Xometry’s business
  • Define, and drive the 18-24-month ML/AI technical roadmap - balancing breakthrough innovation (e.g., generative 3D, foundation models, large-scale vision/3D pipelines) with reliable business value delivery (e.g., quoting accuracy, lead-time reduction, defect detection, cost optimization)
  • Influence partner roadmaps across engineering, product, operations, and business teams: align priorities, advise on resourcing, champion ML/AI best practices
  • Proactively identify and remove roadblocks for teams and projects — whether technical, operational, data-related, or resource constraints
  • Mentorship of individuals and technical teams
  • Act as a trusted SME with strong cross-functional partnerships: your insights and guidance will shape ML/AI infrastructure, data, model, infrastructure, and tooling decisions
  • Play a leadership role in identifying areas of opportunity — e.g., using ML/AI to unlock new revenue streams (e.g., rapid quoting for new manufacturing modalities, generative design for customers), reduce cost (e.g., automated quality inspection), or optimize efficiency (e.g., 3D-geometry classification, defect detection, generating manufacturing ready models)
  • Address problems adjacent to your sphere of immediate influence: proactively tackle challenges outside direct scope and champion holistic solutions
  • Stay ahead of industry developments in ML, AI, generative AI, 2D/3D modeling and manufacturing tech
  • translate insights into the improvement of internal best practices, tooling, frameworks, model governance, data pipelines, and operationalization
What we offer
What we offer
  • 401(k) match
  • medical, dental and vision insurance
  • life and disability insurance
  • generous paid time off including vacation, sick leave, floating and fixed holidays, maternity and bonding leave
  • EAP, other wellbeing resources
  • Fulltime
Read More
Arrow Right

Senior Applied AI Engineer, Image Generation

We’re hiring a Senior Applied AI Engineer, Image Generation to join a fast‑movin...
Location
Location
United States , Redmond
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
  • Master’s Degree AND 3+ years of experience in engineering, problem solving, model building, evaluation, data analysis OR equivalent experience.
  • PhD in engineering, applied math, statistics, or related analytical field.
  • 2+ years shipping production-level code, models, or data analysis.
  • 1+ years using AI-assisted coding and analysis techniques.
  • Solid grasp of deep learning: loss functions, optimization, regularization, training stability
  • Experience deploying ML models at scale (inference optimization, quantization, distillation)
  • Familiarity with image preprocessing pipelines, data augmentation, and dataset curation
  • Experience working on small teams and mid-stage startup environments.
  • Experience working on AI products.
Job Responsibility
Job Responsibility
  • Model Development & Training
  • Train, fine-tune, and evaluate image generation models (diffusion, GAN, transformer-based)
  • Implement and adapt techniques from research papers into working production systems
  • Design and run experiments to improve image quality, diversity, and controllability
  • Curate, clean, and manage large-scale image-text training datasets
  • Evaluation, Hillclimbing & Quality Systems
  • Build and maintain evaluation frameworks for correctness, safety, grounding, and UX quality.
  • Run hillclimbing loops across prompts, models, and tool‑use strategies to continuously improve assistant performance.
  • Analyze failure modes, design mitigations, and drive systematic improvements across the stack.
  • LLM Tooling & Internal Infrastructure
  • Fulltime
Read More
Arrow Right

Senior AI Software Architect

Do you want to be at the forefront of innovating the latest hardware designs to ...
Location
Location
United States , Redmond
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Port and optimize large-scale AI models (e.g., foundation models, diffusion models, YOLO) to run efficiently on Maia hardware
  • Integrate models using frameworks such as PyTorch, ONNX, vLLM, and SGLang
  • Apply techniques like KV cache quantization (e.g., BF16 → FP8), checkpointing, and re-sharding for efficient inference and training
  • Experiment with parallelism strategies (TP, PP) and analyze performance impacts across interconnects (NVLink vs PCIe)
  • Collaborate on improving inference pipelines, including KV caching in sglang/vllm and performance tuning at the PyTorch level
  • Work with Triton kernels for basic operations (e.g., FP8 dequantization) and assist in kernel performance analysis
  • Partner with hardware architects and kernel developers for co-design discussions
  • Communicate effectively with multiple stakeholders to align on performance goals and deliverables
  • Fulltime
Read More
Arrow Right
New

Senior Accountant

Our holding company team is looking for a Senior Accountant to help support our ...
Location
Location
United States , Chicago
Salary
Salary:
105000.00 - 115000.00 USD / Year
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in accounting
  • Solid understanding of GAAP compliance
  • Relevant experience
  • Proficiency in MS Office Suite, with advanced working knowledge of Excel
  • Systems Integrations and Mergers and Acquisitions knowledge
  • QuickBooks knowledge
  • Microsoft Business Central knowledge preferred
Job Responsibility
Job Responsibility
  • Implement and maintain consistent accounting practices across portfolio companies
  • Ensure timely and accurate month-end accounting closes for portfolio companies and holding company
  • Assist with preparation of consolidated financial reports for the holding company
  • Oversee offshore and potential US-based resources
  • Support the financial onboarding of newly acquired companies and their finance leaders
  • Manage audit work for the holding company and its portfolio companies
  • Support the building of annual budgets across the portfolio
  • Assist organization-wide initiatives such as system migrations, improving processes & controls, compliance with accounting guidelines, metrics tracking, etc.
What we offer
What we offer
  • medical, vision, dental, life, and disability insurance
  • bonus
  • Fulltime
Read More
Arrow Right
New

Volunteer Vinted Assistant

We’re looking for a Volunteer Vinted Assistant to help source, sell and send clo...
Location
Location
United Kingdom , Chorley
Salary
Salary:
Not provided
jobs.360resourcing.co.uk Logo
360 Resourcing Solutions
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Confident using smartphones, tablets, or computers
  • Enjoys fundraising, but is looking for something that’s a little bit different
  • Good attention to detail and organisational skills
  • A friendly and reliable volunteer
  • Minimum age: 16+
Job Responsibility
Job Responsibility
  • Listing items for sale on Vinted and other online platforms
  • Taking clear photos of donations and writing engaging descriptions
  • Assisting with stock organisation and preparing items for sale
  • Packing sold items
What we offer
What we offer
  • Gain experience in online sales and social media platforms
  • Be part of a friendly and supportive volunteer team
  • Help raise funds to support our vital work
  • Join in with volunteer events and social activities
  • Lunch/travel expenses provided (minimum hours apply)
  • Full training and ongoing support provided
  • Parttime
Read More
Arrow Right
New

Teacher of English

Are you an inspiring and cheerful classroom practitioner with a passion for lite...
Location
Location
United Kingdom , Steyning
Salary
Salary:
31650.00 - 49084.00 GBP / Year
https://www.randstad.com Logo
Randstad
Expiration Date
June 10, 2026
Flip Icon
Requirements
Requirements
  • Subject Expertise: Strong knowledge of English literature and language across the key stages
  • Outstanding Potential: A track record of-or the clear potential to become-an outstanding practitioner
  • Collaborative Mindset: A desire to work within a dedicated team that supports one another to achieve excellence
  • High Expectations: A commitment to the school's "highest expectations" policy for student behavior and academic progress
  • BA Hons (QTS), BEd, BSc Hons (QTS), PGCE, PGDE (Scotland), QTLS, QTS, Schools direct, SCITT
Job Responsibility
Job Responsibility
  • Inspire & Innovate: Deliver engaging English lessons that challenge students and encourage new ways of thinking
  • Nurture Development: Commit to your own professional growth while nurturing the academic and personal development of your students
  • Raise Achievement: Maintain and further improve high levels of motivation and attainment across the department
  • Optimise Learning: Tailor your teaching to ensure that every student, regardless of their starting point, has the tools to be their best
What we offer
What we offer
  • Access to the Teachers' Pension Scheme
  • Free Employee Assistance Programme
  • Retail discounts
  • Cycle to Work scheme
  • Referral Bonus
Read More
Arrow Right