CrawlJobs Logo

Principal Open Source AI/ML Solutions Engineer

India, Bangalore · Job Posted March 25, 2026
Apply Position
Job Link Share

Job Description

The Senior Member in the GPU domain is a technical role responsible for owning the design, development, and implementation of GPU-related technologies. This position requires a strong understanding of GPU architecture and software development, with the ability to drive innovation in high-performance computing applications. You will develop and optimize the software ecosystem for the next generation of GPU computational accelerators, working closely with platforms like https://github.com/ROCm/ROCm

Job Responsibility

  • Architectural Design: Own architectural design and development of GPU software components, ensuring alignment with industry standards and best practices
  • Technical Leadership: Act as one of the subject matter experts in GPU technologies, providing guidance and mentorship to junior engineers in the team on complex technical challenges
  • Software Development: Design, write, and deliver high-quality open software solutions that enhance GPU performance and capabilities. This includes developing drivers, APIs, and other critical software components
  • Research and Innovation: Conduct research to explore new technologies and methodologies that can improve GPU performance and efficiency. Propose innovative solutions to meet evolving market demands
  • Collaboration: Work collaboratively with cross-functional teams, including hardware engineers, system architects, and product managers, to ensure successful integration of GPU technologies into broader systems
  • Documentation and Standards: Develop comprehensive technical documentation and establish coding standards to ensure maintainability and scalability of software products

Requirements

  • Strong C++ and Python programming skills
  • Performance analysis skills for both CPU and GPU
  • Good knowledge of AI/ML Frameworks and Architecture
  • Basic GPU kernel programming knowledge
  • Experience with software engineering methodologies such as Agile, Scrum, Kanban
  • Experience in all the phases of software development, from requirement gathering, analysis, design, development, testing to final release
  • Experience developing software in an end customer product delivery environment
  • Experience with open-source software development including collaboration with community maintainers and submitting contributions
  • Excellent analytical and problem-solving skills
  • Strong communication skills to effectively convey complex technical concepts to both technical and non-technical stakeholders
  • Ability to work independently and as part of a team
  • Willingness to learn skills, tools, and methods to advance the quality, consistency, and timeliness of AMD software products
  • BE / B-Tech with several years of related experience or M-Tech with years of related experience or PhD with years of related experience in Computer Science or Computer Engineering or related equivalent
  • Overall 18+ Years Of Experience

Nice to have

  • Experience with GPU kernel programming using CUDA, HIP or OpenCL
  • Experience in implementing and optimizing parallel methods on GPU accelerators (NCCL/RCCL, OpenMP, MPI)
  • Experience in PyTorch, TensorFlow, JAX
  • Experience with Singularity, Docker, and/or Kubernetes

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Principal Open Source AI/ML Solutions Engineer

8 matching positions

Principal AI/ML & Innovation Engineer

We are seeking Principal AI/ML & Innovation Engineer who will be leading initiat...
Location
Location
Puerto Rico , Aguadilla
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or master’s degree in computer science, engineering, data science, machine learning, artificial intelligence, or closely related quantitative discipline
  • Typically, 10-15 years’ experience
  • Solid understanding of fundamental AI and machine learning concepts, including supervised and unsupervised learning, deep learning, reinforcement learning, natural language processing, computer vision, and statistical modeling
  • Proficient in implementing and deploying various machine learning algorithms, such as decision trees, random forests, support vector machines, and neural networks
  • Knowledge of popular machine learning frameworks and libraries like TensorFlow, PyTorch, or sci-kit
  • Strong understanding of GitHub CoPilot, Cursor, N8N, vibe coding, Windsurf, and similar technologies
  • Experience in Cloud Infrastructure (AWS, Azure, etc)
  • Knowledge of Open Source, Linux, etc
  • Understanding of Devops, SRE
  • Expertise in deep learning techniques, architectures, and frameworks (e.g., convolutional neural networks (CNN), recurrent neural networks (RNN), generative adversarial networks (GAN), etc.)
Job Responsibility
Job Responsibility
  • Designing, developing, and deploying advanced machine learning models and algorithms
  • Leading research initiatives to explore novel approaches and technologies
  • Designing the architecture of AI systems and ensuring scalability, performance, and reliability
  • Collaborating with other teams, such as data scientists, software engineers, and product managers
  • Providing technical leadership and mentorship to junior engineers
  • Overseeing and guiding multiple design review sessions across different projects
  • Partnering with the engineering manager and team lead to establish long-term design and implementation strategies
  • Leading efforts to incorporate feedback loops and continuous improvement processes
  • Leading meetings, ensuring efficient progress tracking, issue resolution, and team coordination
  • Creating and delivering high-level presentations and reports to executive stakeholders
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Principal Software Engineer - Azure Core

We are seeking experienced engineers to help build cloud‑native, open‑source AI ...
Location
Location
United States , Multiple Locations
Salary
Salary:
163000.00 - 296400.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, Go, or Python
  • OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements
  • Microsoft Cloud Background Check
  • Master's Degree in Computer Science or related technical field AND 12+ years technical engineering experience
  • OR Bachelor's Degree in Computer Science or related technical field AND 15+ years technical engineering experience
  • OR equivalent experience
  • Hands‑on experience building or operating AI/ML training, fine-tuning, and inference platforms in cloud‑native environments
  • Proficiency with Go and/or Python for building platform components, Kubernetes operators/controllers, and integrations in production environments
  • Demonstrated experience contributing to or maintaining open‑source software, especially in the Kubernetes, AI/ML, or cloud‑native ecosystem
Job Responsibility
Job Responsibility
  • Design, implement, and maintain Kubernetes operators and controllers for AI/ML workloads
  • Partner with product managers, business stakeholders, and users to understand user pain points deeply and create innovative solutions that delight your customers in an agile development environment
  • Contribute to applicable upstream open-source projects
  • Write technical design documents and participate in architecture reviews
  • Mentor team members and external contributors through code reviews
  • Debug and optimize distributed AI systems running at scale
  • Strive for excellence in everything you do: culture, collaboration, process, tools, design, engineering practices, customer experience, performance, security etc.
  • Fulltime
Read More
Arrow Right

Principal Applied AI Engineer

Security represents the most critical priorities for our customers in a world aw...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Data Science or related technical field
  • Minimum 9+ Years of total Experience
  • 5+ years technical engineering experience with coding in languages including C#, Java AND Python
  • Should have 5+ years in Data Science experience
  • 3 years of experience with LLMs and open-source GenAI frameworks, such as LangChain, LlamaIndex, Haystack, or equivalents (e.g., Transformers, AutoGen, DSPy), including agent-based orchestration, prompt engineering, retrieval-augmented generation (RAG), and fine-tuning and evaluation
  • Proficiency in writing production-quality software code in one or more modern programming languages (Python, C#)
  • 3+ years experience developing software systems end-to-end, from design to implementation
  • 2+ years experience in shipping at least 2 large scale ML/AI-based services or applications on cloud platforms (Azure, AWS, GCP, etc.)
Job Responsibility
Job Responsibility
  • Design, develop, and deploy end-to-end AI/ML systems, including data ingestion, model training, evaluation, and integration into production environments
  • Build and optimize applications leveraging LLMs and open-source GenAI frameworks such as LangChain, LlamaIndex, Haystack, Transformers, AutoGen, and DSPy
  • Implement advanced GenAI techniques including agent-based orchestration, prompt engineering, retrieval-augmented generation (RAG), and model fine-tuning
  • Write production-grade software in Python and C# or Java, ensuring maintainability, scalability, and performance
  • Collaborate with cross-functional teams to translate business requirements into technical solutions
  • Ship and maintain large-scale AI applications, with a focus on performance monitoring and continuous improvement
  • Conduct rigorous evaluation of AI models using appropriate metrics and benchmarks
  • Optimize models for latency, throughput, and accuracy in real-world scenarios
  • Work closely with data scientists, product managers, and other engineers to drive AI initiatives
  • Stay current with the latest advancements in GenAI, LLMs, and AI frameworks
  • Fulltime
Read More
Arrow Right

Principal Applied AI Engineer

Security represents the most critical priorities for our customers in a world aw...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Data Science or related technical field
  • Minimum 9+ Years of total Experience
  • 5+ years technical engineering experience with coding in languages including C#, Java AND Python
  • Should have 5+ years in Data Science experience
  • 3 years of experience with LLMs and open-source GenAI frameworks, such as LangChain, LlamaIndex, Haystack, or equivalents (e.g., Transformers, AutoGen, DSPy), including agent-based orchestration, prompt engineering, retrieval-augmented generation (RAG), and fine-tuning and evaluation
  • Proficiency in writing production-quality software code in one or more modern programming languages (Python, C#)
  • 3+ years experience developing software systems end-to-end, from design to implementation
  • 2+ years experience in shipping at least 2 large scale ML/AI-based services or applications on cloud platforms (Azure, AWS, GCP, etc.)
Job Responsibility
Job Responsibility
  • Design, develop, and deploy end-to-end AI/ML systems, including data ingestion, model training, evaluation, and integration into production environments
  • Build and optimize applications leveraging LLMs and open-source GenAI frameworks such as LangChain, LlamaIndex, Haystack, Transformers, AutoGen, and DSPy
  • Implement advanced GenAI techniques including agent-based orchestration, prompt engineering, retrieval-augmented generation (RAG), and model fine-tuning
  • Write production-grade software in Python and C# or Java, ensuring maintainability, scalability, and performance
  • Collaborate with cross-functional teams to translate business requirements into technical solutions
  • Ship and maintain large-scale AI applications, with a focus on performance monitoring and continuous improvement
  • Conduct rigorous evaluation of AI models using appropriate metrics and benchmarks
  • Optimize models for latency, throughput, and accuracy in real-world scenarios
  • Work closely with data scientists, product managers, and other engineers to drive AI initiatives
  • Stay current with the latest advancements in GenAI, LLMs, and AI frameworks
  • Fulltime
Read More
Arrow Right

Principal Technical Program Manager - Forward Deployed Engineering

The Industry Solutions Engineering (ISE) team is a global engineering organizati...
Location
Location
Netherlands , Amsterdam
Salary
Salary:
99800.00 - 166900.00 EUR / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree AND demonstrated years experience in engineering, product/technical program management, data analysis, or product development
  • Or equivalent experience
  • Years of experience managing cross-functional and/or cross-team projects
  • Experience in customer-facing technical roles (e.g., consulting, solutions engineering, field engineering, or similar)
  • Experience with AI/ML technologies, cloud architecture, or data engineering
Job Responsibility
Job Responsibility
  • Embed with strategic customers to lead complex technical programs with high autonomy, aligning business needs with engineering solutions across AI, cloud, and data
  • Take ownership of the customer journey end-to-end, from problem definition through co-engineering delivery to measurable business outcomes
  • Apply AI assisted engineering practices as a daily operating mode
  • Operate fluently in engineering tooling as primary work surfaces for planning, documentation, and technical program leadership
  • Navigate ambiguity in customer environments with self-direction and high ownership
  • Champion and cultivate reusable solution patterns, open-source assets, and engineering playbooks that scale impact beyond individual engagements
  • Partner with multi-disciplinary engineering teams to evaluate, design, and deliver AI-powered cloud solutions side-by-side with customer engineers
  • Fulltime
Read More
Arrow Right

Artificial Intelligence Security Specialist EMEA

Citi, the leading global bank, has approximately 200 million customer accounts a...
Location
Location
United Kingdom , London; Belfast
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5-7+ years for Assistant Vice President (C12 Mid - Senior Level)
  • 8-10+ years for Vice President (C13 Senior - Lead/Staff Level)
  • 10+ years for Senior Vice President (C14 Lead/Staff - Principal Level)
  • Depth in at least one of AI/ML engineering, offensive security, detection engineering, software engineering, or security research
  • Hands-on LLM API experience (context management, tool use, evaluation, failure modes) for AI/ML Engineering
  • Agentic systems design
  • AI safety at the infrastructure level
  • Vulnerability research, exploit development, or pen testing with real depth for Cyber Security
  • Detection engineering for novel attack patterns
  • Threat modelling (STRIDE, ATT&CK)
Job Responsibility
Job Responsibility
  • Depends on team: Offensive Security & Vulnerability Management — AI-assisted pen testing at a scale previously impossible
  • Automated exploit validation
  • Bridge the gap from 'AI found a vulnerability' to 'the application team has a PR to fix it'
  • AI & Emerging Technology Security — Define how the bank deploys AI safely
  • Security architecture and assurance for new implementations
  • Building the next generation of AI-powered tools for CISO colleagues
  • Test new models at the cutting edge of creation and influence
  • Cyber Security AI Services — Own the AI products CISO depends on in production — security assurance, cyber security operations, governance and controls, vulnerability assessment
  • Keep them reliable, evolve them fast
  • Cyber Security Operations — Detection, triage, and response for a world where adversaries use AI to find and exploit vulnerabilities faster than traditional detection can keep up
What we offer
What we offer
  • Business casual workplace
  • Hybrid working model (up to 2 days working at home per week)
  • Competitive base salary (annually reviewed)
  • 27 days annual leave (plus bank holidays)
  • Discretional annual performance related bonus
  • Private Medical Care & Life Insurance
  • Employee Assistance Program
  • Pension Plan
  • Paid Parental Leave
  • Special discounts for employees, family, and friends
  • Fulltime
Read More
Arrow Right

Artificial Intelligence Security Specialist EMEA

Job Overview Why Citi Citi, the leading global bank, has approximately 200 mil...
Location
Location
Poland , Warsaw
Salary
Salary:
165020.00 PLN / Year
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • AI/ML Engineering — Hands-on LLM API experience (context management, tool use, evaluation, failure modes). Agentic systems design. AI safety at the infrastructure level, not just the prompt level.
  • Cyber Security — Vulnerability research, exploit development, or pen testing with real depth. Detection engineering for novel attack patterns. Threat modelling (STRIDE, ATT&CK). Security architecture.
  • Software Engineering — You've built and operated production systems, not just prototypes. Strong Python and/or systems programming. Bonus if you're comfortable reading disassembly or tracing through kernel code.
  • Research & Communication — Can digest dense technical research and turn it into actionable security recommendations. Published research, conference talks, or open-source contributions.
  • Mindset - You love to engineer solutions to problems vs purchasing tools, and you see problems as opportunities
  • At any level: genuinely curious, comfortable with ambiguity, biased toward building, able to work across disciplines.
  • Assistant Vice President (C12 Mid - Senior Level): 5-7+ years. Own workstreams end-to-end with real autonomy. You'll go deep on problems that most organizations don't even know they have yet.
  • Vice President (C13 Senior - Lead/Staff Level): 8-10+ years. Define technical approach, make architectural decisions, mentor others. The scope here is wider than most senior IC roles — you're not optimizing an existing system
  • you're designing ones that don't exist yet.
  • Senior Vice President (C14 Lead/Staff - Principal Level): 10+ years. Set technical direction for a function and influence the firm's approach to AI security. If you've hit a ceiling elsewhere because the problem space isn't big enough, it's big enough here.
What we offer
What we offer
  • Private Medical Care Program
  • Life Insurance Program
  • Pension Plan contribution (PPE Program)
  • Employee Assistance Program
  • Paid Parental Leave Program (maternity and paternity leave)
  • Sport Card
  • Holidays Allowance
  • Sport and team recreation activities
  • Special offers and discounts for employees
  • Access to an array of learning and development resources
  • Fulltime
Read More
Arrow Right

Principal Generative AI Scientist

We are hiring a Data Scientist Specialist! Role: Data Scientist Specialist (Prin...
Location
Location
United States , McLean
Salary
Salary:
80.00 - 95.73 USD / Hour
apexsystems.com Logo
Apex Systems
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD in AI, Data Science, or a closely related field (required)
  • 10+ years of AI/ML experience, including 3+ years in applied Generative AI or LLM-based solutions
  • Strong experience with AWS cloud-native AI development, including SageMaker, Bedrock, and MLflow on EKS
  • Deep hands-on expertise with GenAI system design, RAG, agents, prompt engineering, and vector databases
  • Advanced Python skills with extensive use of Jupyter notebooks
  • Experience with Transformers, LangChain, or similar GenAI frameworks
  • Strong understanding of GenAI architectural patterns and best practices
  • GitHub repository link required demonstrating hands-on technical work
Job Responsibility
Job Responsibility
  • Architect and implement AI agents, agentic workflows, and full-scale GenAI applications
  • Develop, fine-tune, and evaluate LLMs, including Claude, Azure OpenAI models, and open-source alternatives
  • Design and deploy Retrieval-Augmented Generation (RAG) and Graph RAG solutions using vector databases and knowledge bases
  • Implement Model Context Protocol (MCP) and Agent-to-Agent (A2A) communication patterns
  • Build and maintain Jupyter-based workflows using SageMaker, MLflow, Kubeflow, and Kubernetes (EKS)
  • Collaborate with UI, microservices, product, and data engineering teams to deliver end-to-end GenAI experiences
  • Integrate GenAI services via API-driven enterprise integration patterns
  • Establish evaluation frameworks, safety guardrails, bias mitigation, and validation processes for production deployment
  • Design and own data ingestion pipelines (extracting, chunking, enriching, anonymizing, and embedding data)
  • Orchestrate multimodal ETL/ELT pipelines and scalable ingestion workflows
What we offer
What we offer
  • Medical
  • dental
  • vision
  • life
  • disability
  • other insurance plans
  • ESPP (employee stock purchase program)
  • 401K program
  • HSA (Health Savings Account on the HDHP plan)
  • SupportLinc Employee Assistance Program (EAP) with up to 8 free counseling sessions
  • Fulltime
Read More
Arrow Right