CrawlJobs Logo

Senior AI Software Architect

United States, Redmond 119800.00 - 234700.00 USD / Year · Job Posted January 29, 2026
Apply Position
Job Link Share

Job Description

Do you want to be at the forefront of innovating the latest hardware designs to propel Microsoft’s cloud growth? Are you seeking a unique career opportunity that combines technical capabilities, cross-team collaboration with business insight and strategy? Join the Systems Planning and Architecture (SPARC) team within Microsoft’s Azure Hardware Systems and Infrastructure (AHSI) organization, the team behind Microsoft’s expanding Cloud Infrastructure and for powering Microsoft’s “Intelligent Cloud” mission. We are seeking a highly skilled Senior AI Software Architect to join our team focused on model enablement and performance optimization for Maia accelerators. This role is ideal for someone with strong experience in PyTorch-based model development, quantization techniques, and parallelization strategies at the framework level. You will work closely with hardware and software teams to bring up models on Maia and ensure they run efficiently.

Job Responsibility

  • Port and optimize large-scale AI models (e.g., foundation models, diffusion models, YOLO) to run efficiently on Maia hardware
  • Integrate models using frameworks such as PyTorch, ONNX, vLLM, and SGLang
  • Apply techniques like KV cache quantization (e.g., BF16 → FP8), checkpointing, and re-sharding for efficient inference and training
  • Experiment with parallelism strategies (TP, PP) and analyze performance impacts across interconnects (NVLink vs PCIe)
  • Collaborate on improving inference pipelines, including KV caching in sglang/vllm and performance tuning at the PyTorch level
  • Work with Triton kernels for basic operations (e.g., FP8 dequantization) and assist in kernel performance analysis
  • Partner with hardware architects and kernel developers for co-design discussions
  • Communicate effectively with multiple stakeholders to align on performance goals and deliverables

Requirements

  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter

Nice to have

  • Bachelor's Degree in Computer Science or Engineering
  • 3+ years of strong hands-on experience with PyTorch and model optimization techniques
  • Practical knowledge of quantization techniques like PTQ/QAT especially for KV cache quantization
  • Familiarity with parallelization strategies and distributed training concepts (e.g., sharding, allreduce)
  • 2+ years of experience with AI inference stacks like SGLang/vLLM and performance profiling
  • Excellent problem-solving and communication skills
  • ability to work in a collaborative team environment
  • 3+ years of experience in Triton kernels and CUDA programming (basic understanding is acceptable but willingness to learn is essential)
  • Experience with AI accelerator hardware and embedded systems
  • 3+ years of prior work on efficient model checkpointing, resharding scripts, and large-scale model deployments for serving at scale

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Senior AI Software Architect

8 matching positions

Senior Software Engineer, Managed AI - AI Platform

Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'...
Location
Location
United States , San Francisco, CA; Sunnyvale, CA
Salary
Salary:
172425.00 - 209000.00 USD / Year
crusoe.ai Logo
Crusoe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Advanced degree in Computer Science/Engineering
  • 4-5+ years of industry experience with demonstrated history of consistent success leading a varied portfolio of initiatives across your function
  • Experience with distributed systems, cloud services (compute, storage, networking, database), and delivering early-stage projects quickly
  • Experience with Generative AI (LLMs, Multimodal) and familiar with AI infrastructure (training, inference, ETL pipelines)
  • Proficient with container runtimes (e.g., Kubernetes), microservices, REST APIs, gRPC, and the full software development lifecycle including CI/CD
Job Responsibility
Job Responsibility
  • Lead the design and implementation of core AI services, including: Resilient fault-tolerant queues for efficient task distribution
  • Model catalogs for managing and versioning AI models
  • Scheduling mechanisms optimized for cost and performance
  • Architect and scale infrastructure to handle millions of API requests per second
  • Implement robust monitoring and alerting to ensure system health and 24/7 availability
  • Collaborate closely with product management, business strategy, and other engineering teams to define the AI platform roadmap
  • Influence the long-term vision and architectural decisions of the platform
  • Contribute to open-source AI frameworks and actively participate in the AI community
  • Prototype and rapidly iterate on emerging technologies and new features
What we offer
What we offer
  • Restricted Stock Units
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Tuition reimbursement
  • Fulltime
Read More
Arrow Right

Software Engineer II and Senior Software Engineer - AI Compilers

The AI Frameworks team at Microsoft develops the AI software used to train and d...
Location
Location
United States , Mountain View
Salary
Salary:
100600.00 - 199000.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Invent and implement innovative compiler features and advanced optimization passes, leveraging tools such as LLVM, MLIR, Torch Dynamo, and Triton
  • Develop code generation techniques for new hardware platforms
  • Design and develop cutting edge AI software in C++ and Python
  • Optimize AI workloads
  • Design new programming abstractions for AI
  • Collaborate broadly across multiple disciplines from hardware architects to ML developers
  • Identify requirements, plan and design solutions, estimate effort, and schedule deliverables
  • Help establish and drive the adoption of outstanding coding standards and patterns and help enhance our inclusive engineering culture
  • Embody Microsoft's culture and values
  • Fulltime
Read More
Arrow Right

Cloud Solution Architect and Senior Cloud Solution Architect - Data and AI

We are looking for Cloud Solution Architect (CSA) and Senior Cloud Solution Arch...
Location
Location
United States , Multiple Locations
Salary
Salary:
85100.00 - 169800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Information Technology, Engineering, Business, Liberal Arts, or related field AND 2+ years experience in cloud/infrastructure technologies, information technology (IT) consulting/support, systems administration, network operations, software development/support, technology solutions, practice development, architecture, and/or consulting OR equivalent experience
  • Authorization to work in the United States that does not now or in the future require employer sponsorship
  • 1+ years of experience with AI/ML and/or Generative AI technology
  • 2+ years of customer facing experience providing recommendations to, or collaborating with, mid-to-senior level executives to address and advance technology transformation initiatives, entablements, and outcomes, including Data and AI solutions
  • 2+ years of enterprise experience in ANY of the following: Microsoft Fabric, Azure Databricks, Microsoft Purview, Azure SQL, PostgreSQL, MySQL, and Cosmos DB
Job Responsibility
Job Responsibility
  • Seek to understand customers’ overall data estate Business and IT priorities and success measures to design Data & Analytics solutions that drive business value and drive positive Customer Satisfaction & become a trusted advisor
  • Ensure that solution exhibits high levels of performance, security, scalability, maintainability, repeatability, appropriate reusability, and reliability upon deployment and provide feedback and insights from customers/partners
  • Develop opportunities to drive Customer Success business results & help Customers get value from their Microsoft investments and identify resolutions to Customer blockers by leveraging SA subject matter expertise
  • Deliver according to MS best practices & using repeatable Intellectual Property (IP)
  • Apply technical knowledge to architect and design solutions that meet business and IT needs, create AI roadmaps, drive Proof of Concepts (POC) and Minimal Viable Product (MVP), and ensure long term technical viability of new deployments, infusing key AI technologies where appropriate
  • Be the Voice of Customer to share insights and best practices, connect with Engineering team to remove key blockers and drive product improvements
  • Maintain technical skills and knowledge, keep up to date with market trends and competitive insights
  • collaborate and share with the AI technical community while educating customers on Azure platform
  • Accelerate customer outcomes - Share expertise, contribute to IP creation & re-use to accelerate customer outcomes and obtain relevant accreditations and certifications
  • Fulltime
Read More
Arrow Right

Senior AI Software Developer

The Senior AI Engineer owns end-to-end delivery of AI features—from design to pr...
Location
Location
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or master’s degree in computer science, engineering, data science, machine learning, artificial intelligence, or closely related quantitative discipline
  • Typically, 7-10 years’ experience
  • LLMs & Agents: Prompt engineering, function/tool calling, orchestration frameworks, RAG
  • ML/DS: Evaluation metrics (precision/recall, BLEU/ROUGE where relevant), error analysis
  • Data/RAG: Embeddings, similarity (cosine/IP), chunking, rerankers, vector DB operations
  • Backend: Python (FastAPI/Flask), microservices patterns
  • MLOps/Infra: Docker, Kubernetes, CI/CD, artifact management, GPU scheduling
  • Observability: Metrics/logging/tracing, dashboards, automated evaluation pipelines
  • Frameworks: PyTorch/TensorFlow, Hugging Face, LangChain/LlamaIndex
  • Data: Pandas, SQL/NoSQL, Parquet/Arrow, Kafka/queues
Job Responsibility
Job Responsibility
  • Translate high-level designs into clear component contracts, APIs, and service boundaries
  • Implement LLM integrations, RAG pipelines, agents, tool/function calling, and prompt strategies
  • Own feature delivery for sprints/releases
  • maintain high code quality and documentation
  • Fine-tune models when needed
  • design evaluation harnesses and metrics
  • Build A/B testing setups
  • track accuracy, latency, robustness, and task success rates
  • Conduct error analysis
  • iterate using feedback efficacy loops and prompt refinement
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
Read More
Arrow Right

Senior AI Software Engineer

AnaVation is seeking a Senior Agentic-AI Software engineer to join our team that...
Location
Location
United States , Chantilly
Salary
Salary:
Not provided
anavationllc.com Logo
AnaVation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Active TS/SCI clearance within last 24 months
  • BA/BS in Computer Science or related field
  • BS + 10 Yrs or MS + 8 Yrs experience in computer science, AI, Machine Learning, or related field
  • 5+ years of experience in AI/ML development
  • At least 2 years focused on Agentic AI or autonomous systems
  • Proven track record of deploying production-grade AI systems
  • Strong problem-solving skills
  • Ability to work in a fast-paced, collaborative environment
Job Responsibility
Job Responsibility
  • Design, develop, and deploy advanced Agentic AI systems that autonomously perform complex tasks, make decisions, and interact with dynamic environments
  • Collaborate with cross-functional teams to deliver scalable, efficient, and ethical AI solutions
  • Architect and implement agentic AI systems capable of autonomous decision-making, task planning, and execution
  • Design and integrate multi-agent systems to solve complex problems
  • Develop and fine-tune large language models (LLMs) and reinforcement learning (RL) models
  • Implement robust APIs and interfaces to integrate AI agents with external systems
  • Optimize AI models for performance, scalability, and low-latency inference
  • Conduct rigorous testing, validation, and monitoring of AI agents
  • Collaborate with product managers, data scientists, and software engineers
  • Stay updated on latest advancements in Agentic AI, LLMs, and RL
What we offer
What we offer
  • Generous cost sharing for medical insurance for employee and dependents
  • 100% company paid dental insurance for employees and dependents
  • 100% company paid long-term and short term disability insurance
  • 100% company paid vision insurance for employees and dependents
  • 401k plan with generous match and 100% immediate vesting
  • Competitive Pay
  • Generous paid leave and holiday package
  • Tuition and training reimbursement
  • Life and AD&D Insurance
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - AI Engineering

RTB House is a global company that provides state-of-the-art marketing technolog...
Location
Location
Salary
Salary:
Not provided
rtbhouse.com Logo
RTB House
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Pragmatic Architect: Proven ability to evaluate third-party tools and vendor solutions against custom-built software to find the most efficient path forward
  • Technical Stack: High proficiency in Python is mandatory. Significant experience with at least one other language (Java, Go, TypeScript, or Scala) is highly preferred
  • AI/ML Implementation: Hands-on experience integrating LLMs into production environments
  • Systems Thinker: Ability to define technical roadmaps for specific features and drive high-level design choices that prioritize maintainability and performance
  • C1 level in English and Polish.
Job Responsibility
Job Responsibility
  • Drive Technical Excellence: Act as a technical pillar within the Lab, implementing high-standard code and sophisticated system designs. You will mentor mid-level peers and lead deep-dive code reviews
  • Architect Multi-Agent Systems: Design and deploy distributed systems and multi-agent architectures that automate complex engineering tasks. You will own the architectural decisions for 'build vs. integrate' strategies
  • Innovate with Agentic AI: Spearhead the evaluation and prototyping of LLMs, Agentic frameworks, and Model Context Protocols (MCPs). You will transform theoretical AI advancements into production-ready tools
  • Own the Full Lifecycle: Take responsibility for the entire development cycle. From initial concept and API integration to production deployment and long-term scalability
  • Influence Product Strategy: Partner with Product and Engineering Managers to ensure the Lab's innovations align with the broader company roadmap and provide measurable ROI to our developers.
What we offer
What we offer
  • Projects focused on extreme performance and high code quality – clean code and solid code reviews are our standard
  • Collaboration within an interdisciplinary, self-sufficient team (including DevOps, database experts, backend developers, product designers, and QA engineers)
  • Access to modern technologies and the opportunity to apply them in large-scale, high-impact projects.
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - AI Engineering and Productivity

The Role The AI Engineering and Productivity team in the Global Planning, Desig...
Location
Location
United States , Austin; Warren
Salary
Salary:
Not provided
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree in Computer Science, Software Engineering, Information Systems, Engineering, or a related field, OR equivalent experience
  • 6+ years of experience delivering enterprise or full stack software solutions using Java / JEE, Python , and preferably Angular
  • 3+ years of experience working with complex SQL queries, functions, and stored procedures, including performance tuning and optimization against large datasets
  • Experience building or supporting data pipelines , ETL/ELT processes, or datacentric applications on distributed or cloud platforms (e.g., Databricks, Spark , or similar)
  • 3+ years of experience with Kubernetes/Docker, Quarkus , and cloud platforms such as Azure, AWS , or GCP
  • Experience working in Agile/SCRUM development methodologies, including backlog refinement, sprint planning, and incremental delivery
  • Hands on experience with modern DevOps practices such as Git/GitHub, code reviews, automated builds, automated testing, and CI/CD pipelines (e.g., GitHub Actions)
  • Willingness and demonstrated ability to learn and apply AI concepts , including working with data and APIs that support AI/ML and LLM based solutions
  • Strong problem solving skills with the ability to break down complex technical and data challenges into clear, actionable steps and deliver high quality solutions
  • Excellent written and verbal communication skills with the ability to collaborate with both technical and nontechnical stakeholders
Job Responsibility
Job Responsibility
  • Design, develop, and maintain data driven and AI-enabled applications and services that support Product Development engineering teams
  • Write high-quality, performant SQL (queries, functions, stored procedures) for complex data transformations and modeling across enterprise data platforms (e.g., SQL Server, Oracle, PostgreSQL)
  • Build and optimize data pipelines and workflows in Databricks (DBX) and related tools to support batch and near realtime data processing
  • Develop backend services and APIs in Java and/or Python that integrate data, business rules, and user workflows into robust, reusable components
  • Develop enterprise grade applications using Kubernetes/Docker, Quarkus, Java, Angular, PostgreSQL, and other GM approved tools
  • Partner with data science and AI teams to productionize AI/ML and LLM based solutions, including feature pipelines, inference integrations, monitoring, and continuous improvement
  • Proactively identify and remediate issues related to code quality, patterns, performance, security, and data correctness, using code quality analysis tools and remediation techniques
  • Lead or contribute to solution design, including architecture, patterns, and technology choices aligned with GM standards and Statement of Technical Direction
  • Apply and champion software engineering best practices, including code reviews, automated testing, branching strategies, CI/CD pipelines (e.g., GitHub Actions), observability, and secure coding practices
  • Collaborate with cross-functional teams (product owners, data engineers, architects, business stakeholders) to refine requirements, define acceptance criteria, and deliver incremental value in an Agile/SCRUM environment
What we offer
What we offer
  • Relocation benefits
  • Fulltime
Read More
Arrow Right

Principal AI Software Engineer, Senior Vice President

Are you looking for a career move that will put you at the heart of a global fin...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Exceptional Python Expertise: Demonstrated mastery of core Python, including advanced features, performance optimization, and a deep understanding of the FastAPI framework
  • Prior hands-on experience with Generative AI, Large Language Model (LLM) frameworks (e.g. LangChain, LlamaIndex), and their application in enterprise environments is a must. This must be underpinned by a profound understanding of core machine learning principles, algorithms, and data science methodologies
  • Full Lifecycle Ownership: Extensive hands-on experience and technical authority throughout the entire software development lifecycle, from conceptualization and design to implementation, deployment, and operational ownership of enterprise software solutions, involving significant cross-functional collaboration
  • Strategic System Design: Significant hands-on experience in architecting and designing (architecture, design patterns, reliability, scaling) highly complex new and current systems with broad technical impact
  • Hands-on expertise with containerized deployment technologies (e.g. Kubernetes, OpenShift, Docker) and orchestration strategies
  • Hands-on experience and in-depth understanding of C++ is a significant bonus, particularly for complex code analysis, parsing, and integration into knowledge graph structures
Job Responsibility
Job Responsibility
  • Architect and implement cutting-edge software systems, defining the technical design for our AI solutions to ensure scalability, performance, and reliability
  • Drive the hands-on design, implementation, and deployment of sophisticated systems that automate the analysis of data, code, and documentation
  • Apply deep expertise to structure extracted knowledge within a Credit Risk Domain-aware knowledge graph, including advanced strategies for effectively modelling complex codebases, particularly C++, within this graph
  • Act as a critical technical partner with data scientists, business analysts, and other engineering teams to translate challenging business requirements into robust technical solutions and ensure successful, high-quality project delivery
  • Tackle the most complex technical challenges within our AI initiatives, providing solutions that set the standard for engineering excellence
What we offer
What we offer
  • Generous holiday allowance starting at 27 days plus bank holidays
  • increasing with tenure
  • A discretional annual performance related bonus
  • Private medical insurance packages to suit your personal circumstances
  • Employee Assistance Program
  • Pension Plan
  • Paid Parental Leave
  • Special discounts for employees, family, and friends
  • Access to an array of learning and development resources
  • Fulltime
Read More
Arrow Right