CrawlJobs Logo

Fellow, AI Software Architecture

United States, San Jose Employment contract 268000.00 - 402000.00 USD / Year · Job Posted April 24, 2026
Apply Position
Job Link Share

Job Description

AMD AI Group is seeking a highly influential technical leader for the role of AMD Fellow, OneROCm — driving a unified ROCm software stack across AMD's broad product portfolio, including Instinct, Radeon, Ryzen, Embedded, Game Consoles, and Autonomous Driving platforms. This is a rare opportunity to lead architecture, drive innovation, and help develop next-generation products at company-wide scale. The ideal candidate will define the end-to-end ROCm software strategy and influence the full stack, spanning compilers, kernels, runtime, libraries, models, frameworks, and performance optimization layers. The role also requires strong hardware/software co-design leadership to maximize performance across diverse AMD products and workloads. The ideal candidate is expected to be highly hands-on and embrace agentic AI workflows.

Job Responsibility

  • Strategic Leadership: Set the technical vision and roadmap for AI software stack across all AMD products, ensuring AMD remains the platform of choice for top-tier AI customers
  • Hardware-Software Co-design: Collaborate across hardware architecture, compiler, math libraries, kernel and framework teams to influence future silicon features based on evolving AI workload trends
  • Workload Performance Engineering: Lead the profiling, analysis, and tuning of large-scale models (LLMs, Diffusion, Multimodal, and MoE) to ensure 'out-of-the-box' performance excellence on AMD hardware
  • Ecosystem Innovation: Drive the development of advanced tools and frameworks for performance estimation, modeling, and automated reporting
  • Customer Engagement: Partner with top customers and hyperscalers to understand their unique workload requirements and deliver tailored architectural wins and software optimizations
  • Community & Mentorship: Act as a technical ambassador in industry forums and open-source communities. Mentor and inspire the next generation of AMD's technical leaders and engineers.

Requirements

  • Knowledge in GPU architectures, basic knowledge of CPU architecture
  • Experience in AI/ML software stack spanning compilers, kernels, runtime, libraries, models, frameworks, and performance optimization layers
  • Understanding of GPU programming such as ROCm, CUDA, OpenCL, etc
  • Experience in hardware/software co-design, building high-performance products across the full product lifecycle
  • Experience with operating systems (OS) and device driver development is a plus
  • Undergrad degree required. Bachelor of Science, Masters, or PhD degree with emphasis in Electrical Engineering, Computer architecture, or Computer Science with relevant experience preferred

Nice to have

Experience with operating systems (OS) and device driver development is a plus

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Fellow, AI Software Architecture

8 matching positions

Ai & Software Engineer Lead - Backend

As an Lead Software Engineer at PMG, you’ll join a dynamic team that thrives on ...
Location
Location
United States , Dallas
Salary
Salary:
Not provided
pmg.com Logo
PMG
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of backend development experience, with a strong focus on building scalable, high-performance systems, or a related field, with a bachelor’s or master’s degree or equivalent experience
  • 3+ years working with modern backend frameworks such as Django, Flask, or Express.js
  • Proven experience designing and integrating RESTful APIs and microservices architectures
  • Hands-on experience working across all stages of software development, including requirements, design, coding, testing, implementation, and support
  • Experience mentoring and guiding engineers to achieve technical excellence
  • Proficiency in backend languages like Node.js, Python, or Go with a focus on clean and maintainable code
  • Advanced skills in working with databases such as PostgreSQL, MongoDB, Redis, or MySQL, including query optimization and schema design
  • Familiarity with AI technologies like OpenAI, Bedrock, Vertex, and LangChain to explore and implement innovative backend solutions
  • Experience with version control tools like Git and CI/CD pipelines to automate testing and deployments
  • Knowledge of infrastructure-as-code tools such as Terraform or Ansible to manage infrastructure efficiently
Job Responsibility
Job Responsibility
  • Leading the design, development, and deployment of innovative, scalable features within the Alli suite of applications
  • Drafting and implementing architectural designs to produce reliable, complex, and highly scalable technical solutions
  • Working closely with Product Managers, Designers, and cross-functional teams to integrate AI-driven solutions that improve performance and drive business outcomes
  • Building scalable products that drive measurable results for our clients
  • Fostering a culture of collaboration and innovation by mentoring junior and mid-level engineers and promoting development best practices
  • Conducting coding, debugging, testing, and troubleshooting across the application development lifecycle
  • Conducting code reviews and providing fellow backend engineers critical feedback to ensure high-quality, maintainable code across Alli
  • Pioneering development in state-of-the-art technologies that transform the marketing technology space
What we offer
What we offer
  • Professional Development: Take advantage of our learning and development programs, mentorship opportunities, and career advancement support
  • Generous Time Off: Enjoy generous paid time off and holiday allowances to recharge and spend time with loved ones
  • Parental Leave: We provide paid parental leave to support your family during important life events
  • Retirement & Pension Plans: Plan for your future with competitive retirement or pension programs, including contribution matching
  • Fertility and Family Support: Access fertility benefits for all team members and their spouses
  • Healthcare: Coverage and support for everyday medical expenses and routine care, tailored by geography
  • Pet Insurance: Protect your pet's health and your finances
  • Lifestyle Spending Accounts: Enjoy 100% company-funded accounts to promote healthy habits and well-being
  • Commuter Benefits: Access support for travel and commuting needs, where available
  • Annual Bonus: All employees are eligible for an annual bonus
Read More
Arrow Right

Senior Software Engineer - AI Engineering and Productivity

The Role The AI Engineering and Productivity team in the Global Planning, Desig...
Location
Location
United States , Austin; Warren
Salary
Salary:
Not provided
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree in Computer Science, Software Engineering, Information Systems, Engineering, or a related field, OR equivalent experience
  • 6+ years of experience delivering enterprise or full stack software solutions using Java / JEE, Python , and preferably Angular
  • 3+ years of experience working with complex SQL queries, functions, and stored procedures, including performance tuning and optimization against large datasets
  • Experience building or supporting data pipelines , ETL/ELT processes, or datacentric applications on distributed or cloud platforms (e.g., Databricks, Spark , or similar)
  • 3+ years of experience with Kubernetes/Docker, Quarkus , and cloud platforms such as Azure, AWS , or GCP
  • Experience working in Agile/SCRUM development methodologies, including backlog refinement, sprint planning, and incremental delivery
  • Hands on experience with modern DevOps practices such as Git/GitHub, code reviews, automated builds, automated testing, and CI/CD pipelines (e.g., GitHub Actions)
  • Willingness and demonstrated ability to learn and apply AI concepts , including working with data and APIs that support AI/ML and LLM based solutions
  • Strong problem solving skills with the ability to break down complex technical and data challenges into clear, actionable steps and deliver high quality solutions
  • Excellent written and verbal communication skills with the ability to collaborate with both technical and nontechnical stakeholders
Job Responsibility
Job Responsibility
  • Design, develop, and maintain data driven and AI-enabled applications and services that support Product Development engineering teams
  • Write high-quality, performant SQL (queries, functions, stored procedures) for complex data transformations and modeling across enterprise data platforms (e.g., SQL Server, Oracle, PostgreSQL)
  • Build and optimize data pipelines and workflows in Databricks (DBX) and related tools to support batch and near realtime data processing
  • Develop backend services and APIs in Java and/or Python that integrate data, business rules, and user workflows into robust, reusable components
  • Develop enterprise grade applications using Kubernetes/Docker, Quarkus, Java, Angular, PostgreSQL, and other GM approved tools
  • Partner with data science and AI teams to productionize AI/ML and LLM based solutions, including feature pipelines, inference integrations, monitoring, and continuous improvement
  • Proactively identify and remediate issues related to code quality, patterns, performance, security, and data correctness, using code quality analysis tools and remediation techniques
  • Lead or contribute to solution design, including architecture, patterns, and technology choices aligned with GM standards and Statement of Technical Direction
  • Apply and champion software engineering best practices, including code reviews, automated testing, branching strategies, CI/CD pipelines (e.g., GitHub Actions), observability, and secure coding practices
  • Collaborate with cross-functional teams (product owners, data engineers, architects, business stakeholders) to refine requirements, define acceptance criteria, and deliver incremental value in an Agile/SCRUM environment
What we offer
What we offer
  • Relocation benefits
  • Fulltime
Read More
Arrow Right

Research Intern - AI Systems & Architecture

Research Internships at Microsoft provide a dynamic environment for research car...
Location
Location
United States , Mountain View
Salary
Salary:
6710.00 - 13270.00 USD / Month
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently enrolled in a PhD program in Computer Science, Electrical/Computer Engineering, or a related field
  • Research Interns are expected to be physically located in their manager’s Microsoft worksite location for the duration of their internship
  • submit a minimum of two reference letters for this position as well as a cover letter and any relevant work or research samples
Job Responsibility
Job Responsibility
  • Investigate emerging AI system architectures and analyze how hardware, software, and model behavior interact across large-scale inference workloads
  • Develop and evaluate analytical or simulation-based performance models to identify system bottlenecks, scalability limits, and optimization opportunities
  • Prototype or assess new inference mechanisms, including disaggregated execution, sparse/expert model scaling, and hierarchical attention techniques
  • Explore next-generation accelerator, memory-architecture, and interconnect technologies, assessing their architectural trade-offs and cost implications
  • Conduct experiments, synthesize research findings, and communicate results to mentors and collaborating researchers
  • Collaborate with fellow interns and researchers to advance new ideas in AI systems and architectural design
  • Fulltime
Read More
Arrow Right

Software Engineer, AI Agent

We are looking for senior and staff-level fullstack engineers eager to join the ...
Location
Location
United States , San Francisco; New York
Salary
Salary:
160000.00 - 270000.00 USD / Year
hex.tech Logo
Her
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 4+ years of software engineering experience
  • Proficiency with React and Typescript in addition to knowledge around architecture and systems design
  • Experience leading and executing on major product initiatives
  • Excitement for getting your hands dirty and diving deep into how the browser works to develop high-quality and performant user experiences
  • An aptitude for how web infrastructure works to develop high-quality internal developer and external user experiences
  • An instinct for strategic thinking and aligning with business and product goals while keeping a healthy balance of velocity and engineering excellence
  • Interest in the data space, and a love of shipping great products and building tools that empower end users to do more
  • Experience maintaining a high quality bar for design, correctness, and testing
  • An ability to lead product initiatives while collaborating and mentoring fellow engineers
  • Curiosity and an interest in diving into the bigger picture of building a company, including go-to-market, customer development, people, and marketing
Job Responsibility
Job Responsibility
  • Take the lead on building AI-powered features into Hex
  • Partner closely with our Research team on context and model prompting, and the rest of our product engineering teams on integrating into the platform
  • Own the scalable agent infrastructure powering today’s most powerful data agents, as well as the cutting-edge experimentation tooling that allows the rest of the team to rapidly deliver new AI features in the product
What we offer
What we offer
  • Market-benched salary & equity
  • Comprehensive health benefits
  • Flexible paid time off
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, Fellow Experience

As a Senior Software Engineer on the Fellows Allocations team, you will architec...
Location
Location
United States , San Francisco
Salary
Salary:
200000.00 - 235000.00 USD / Year
joinhandshake.com Logo
Handshake
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of software engineering experience with a backend-leaning orientation and demonstrated tech lead capabilities
  • Strong proficiency in TypeScript and experience with full-stack development
  • our stack includes Next.js, TRPC, GraphQL, and PostgreSQL
  • Experience building and scaling larger workflows and asynchronous job orchestration (experience with Temporal or similar systems is a plus)
  • Proven ability to architect systems at scale, ideally with experience at larger, more established companies solving complex problems
  • Strong ownership mentality with high availability and dedication to the team's success
  • Product engineer mindset
  • you think beyond the code to understand user impact and business outcomes
  • Excellent communication skills and ability to partner effectively with product managers and cross-functional teams
  • Opinionated about engineering best practices with the confidence to question existing code structures and drive improvements
Job Responsibility
Job Responsibility
  • Lead the architecture and long-term technical vision for scaling our automatic allocations system, designing solutions that match fellows to projects with precision and efficiency
  • Drive engineering excellence by establishing standards and best practices that elevate the team's technical capabilities
  • Build and own the full journey from fellow verification through successful onboarding to the start of tasking
  • Develop high-quality, high-velocity code while maintaining a strong balance between speed and product reliability
  • Partner closely with product managers to deliver seamless user experiences, bringing strong product sense to technical decisions
  • Mentor and grow engineers on the team, fostering a culture of ownership and technical excellence
  • Respond to and resolve emergency situations tied to fellow onboarding volume, quality, and project matching
What we offer
What we offer
  • Ownership: Equity in a fast-growing company
  • Financial Wellness: 401(k) match, competitive compensation, financial coaching
  • Family Support: Paid parental leave, fertility benefits, parental coaching
  • Wellbeing: Medical, dental, and vision, mental health support, wellness stipend
  • Growth: Learning stipend, ongoing development
  • Remote & Office: Internet, commuting, and free lunch/gym in our SF office
  • Time Off: Flexible PTO, 15 holidays + 2 flex days
  • Connection: Team outings & referral bonuses
  • Fulltime
Read More
Arrow Right

Senior Software Engineer (Applied AI)

As a Senior Software Engineer (Applied AI), you will play a critical hands-on ro...
Location
Location
United States , Seattle; New York City; Boston
Salary
Salary:
130000.00 - 200000.00 USD / Year
pearlhealth.com Logo
Pearl Health
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5-8+ years of professional experience in software engineering, with a strong foundation in service-oriented architectures and distributed systems
  • Hands-on experience building and productionizing Applied AI/LLM features, including working with RAG architectures, vector databases, embedding models, and/or Agentic workflows
  • Experience with observability and evaluation practices for production LLM systems (prompt tracking, quality metrics, cost monitoring)
  • Strong proficiency in Python, relational databases, and a major cloud platform (AWS preferred)
  • A deep understanding of modern service design principles, including RESTful and event-driven architectures
  • Proven experience designing, building, and optimizing data-intensive applications
  • A demonstrated history of mentoring engineers and driving technical best practices within a team
  • A strong background in performance optimization, reliability engineering, and security best practices
Job Responsibility
Job Responsibility
  • Design, build, and own production AI features powered by LLMs, including RAG architectures (chunking strategies, semantic search, vector databases) and Agentic workflows
  • Develop high-performance data pipelines, APIs, and microservices that process healthcare data at scale and securely integrate LLM outputs into user-facing experiences
  • Execute Proof-of-Concepts (POCs) and technical evaluations of new AI technologies to validate product viability and scalability
  • Build responsive web applications using modern frontend frameworks to deliver intuitive, user-facing intelligence and analytic features
  • Ensure observability, monitoring, and operational excellence for AI-powered services, championing security and regulatory compliance (HIPAA, SOC2)
  • Drive architectural decisions and system optimizations for AI features in close collaboration with product and engineering leadership
  • Own technical projects from discovery to delivery with autonomy, ensuring solutions align with business needs and long-term scalability
  • Mentor and upskill fellow engineers on Applied AI best practices, fostering a strong culture of technical excellence and collaborative growth
  • Contribute to the team's understanding of LLM capabilities, limitations, and best practices within the healthcare domain
  • Participate in thorough design and code reviews, raising the bar for technical quality across the team
What we offer
What we offer
  • discretionary performance bonus
  • equity options
  • competitive benefits package
  • Fulltime
Read More
Arrow Right

Intermediate Software Engineer - Artificial Intelligence (AI)

Tucows Domains is the world’s largest wholesale domain registrar, responsible fo...
Location
Location
Canada , Toronto
Salary
Salary:
100350.00 - 111500.00 CAD / Year
tucows.com Logo
Tucows
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Software Engineering, Computer Science, or a related field
  • 3+ years of professional software engineering experience in production environments
  • Strong proficiency in Python and Golang
  • Solid foundation in software design principles, patterns, and service-oriented architecture
  • Experience contributing to scalable systems and component-level architecture
  • Ability to design and build RESTful APIs for model serving and AI-enabled workflows
  • Working knowledge of relational/SQL databases (preferably PostgreSQL) and data modeling for AI use cases
  • Strong understanding of modern LLM concepts, including transformer architectures and attention mechanisms
  • Hands-on experience adapting and deploying open-source models (e.g., LLaMA, Mistral, Mixtral) using tools like Ollama or Hugging Face Transformers
  • Experience with fine-tuning techniques (e.g., LoRA, QLoRA, PEFT) for domain-specific adaptation
Job Responsibility
Job Responsibility
  • Design and build AI-driven features for our domain services platform using Python and Golang
  • Integrate and fine-tune open-source models such as LLaMA 3.2 and similar cutting-edge architectures via tools like Ollama
  • Research, evaluate, and implement emerging AI technologies that align with our vision for smarter, more intuitive products and services
  • Collaborate with internal stakeholders and fellow engineers to rapidly prototype and iterate on machine learning and LLM-based features
  • Contribute to a modern AI development stack, ensuring scalability, performance, and ethical usage of models
  • Actively participate in the open-source ecosystem and bring relevant tools and techniques back to the team
What we offer
What we offer
  • Fair compensation and generous benefits
  • Commitment to inclusion across race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status or disability status
  • Reasonable accommodation for individuals with disabilities
  • Fulltime
Read More
Arrow Right

Core Engineering Java Development Lead – FX Tech

Engineer the future of global finance. At Citi, our Tech team doesn't just suppo...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience building backend systems using Java and frameworks such as Spring with deep understanding of the JVM ecosystem
  • experience using AI tooling to accelerate design exploration, prototyping and embedding AI capabilities into product workflows and services
  • Experience designing systems using event-driven approaches (e.g. Solace, Kafka, Web Sockets or similar technologies), with a solid grounding in Domain-Driven Design
  • Strong grasp of SOLID principles, design patterns, dependency injection and persistence technologies
  • Solid knowledge of algorithms and data structures, with the ability to reason about performance, complexity and scalability
  • Experience building software for cloud environments, including containerization and modern deployment practices
  • Understanding of secure coding practices, fault tolerance and building reliable systems in distributed environment
Job Responsibility
Job Responsibility
  • Designing, developing and maintaining high-performance Java services that are scalable, secure and resilient
  • Applying Domain-Driven Design and message-/event-driven design principles to build loosely coupled, well-structured systems
  • Writing clean, maintainable code and contributing to peer reviews, championing best practices and continuous improvement
  • Helping modernise and evolve existing platforms, balancing pragmatic delivery with long-term architectural health
  • Working closely with trading, quants and fellow engineers to turn complex business requirements into robust technical solutions
  • Contributing to improvements in build tooling, CI/CD pipelines, testing approaches and overall engineering productivity
  • Supporting systems in production, improving observability, performance and resilience
What we offer
What we offer
  • 27 days annual leave (plus bank holidays)
  • A discretional annual performance related bonus
  • Private Medical Care & Life Insurance
  • Employee Assistance Program
  • Pension Plan
  • Paid Parental Leave
  • Special discounts for employees, family, and friends
  • Access to an array of learning and development resources
  • Fulltime
Read More
Arrow Right