CrawlJobs Logo

Senior AI Software Architect

https://www.microsoft.com/ Logo

Microsoft Corporation

Location Icon

Location:
United States , Redmond

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

119800.00 - 234700.00 USD / Year

Job Description:

Do you want to be at the forefront of innovating the latest hardware designs to propel Microsoft’s cloud growth? Are you seeking a unique career opportunity that combines technical capabilities, cross-team collaboration with business insight and strategy? Join the Systems Planning and Architecture (SPARC) team within Microsoft’s Azure Hardware Systems and Infrastructure (AHSI) organization, the team behind Microsoft’s expanding Cloud Infrastructure and for powering Microsoft’s “Intelligent Cloud” mission. We are seeking a highly skilled Senior AI Software Architect to join our team focused on model enablement and performance optimization for Maia accelerators. This role is ideal for someone with strong experience in PyTorch-based model development, quantization techniques, and parallelization strategies at the framework level. You will work closely with hardware and software teams to bring up models on Maia and ensure they run efficiently.

Job Responsibility:

  • Port and optimize large-scale AI models (e.g., foundation models, diffusion models, YOLO) to run efficiently on Maia hardware
  • Integrate models using frameworks such as PyTorch, ONNX, vLLM, and SGLang
  • Apply techniques like KV cache quantization (e.g., BF16 → FP8), checkpointing, and re-sharding for efficient inference and training
  • Experiment with parallelism strategies (TP, PP) and analyze performance impacts across interconnects (NVLink vs PCIe)
  • Collaborate on improving inference pipelines, including KV caching in sglang/vllm and performance tuning at the PyTorch level
  • Work with Triton kernels for basic operations (e.g., FP8 dequantization) and assist in kernel performance analysis
  • Partner with hardware architects and kernel developers for co-design discussions
  • Communicate effectively with multiple stakeholders to align on performance goals and deliverables

Requirements:

  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter

Nice to have:

  • Bachelor's Degree in Computer Science or Engineering
  • 3+ years of strong hands-on experience with PyTorch and model optimization techniques
  • Practical knowledge of quantization techniques like PTQ/QAT especially for KV cache quantization
  • Familiarity with parallelization strategies and distributed training concepts (e.g., sharding, allreduce)
  • 2+ years of experience with AI inference stacks like SGLang/vLLM and performance profiling
  • Excellent problem-solving and communication skills
  • ability to work in a collaborative team environment
  • 3+ years of experience in Triton kernels and CUDA programming (basic understanding is acceptable but willingness to learn is essential)
  • Experience with AI accelerator hardware and embedded systems
  • 3+ years of prior work on efficient model checkpointing, resharding scripts, and large-scale model deployments for serving at scale

Additional Information:

Job Posted:
January 29, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior AI Software Architect

Senior AI Software Engineer

AnaVation is seeking a Senior Agentic-AI Software engineer to join our team that...
Location
Location
United States , Chantilly
Salary
Salary:
Not provided
anavationllc.com Logo
AnaVation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Active TS/SCI clearance within last 24 months
  • BA/BS in Computer Science or related field
  • BS + 10 Yrs or MS + 8 Yrs experience in computer science, AI, Machine Learning, or related field
  • 5+ years of experience in AI/ML development
  • At least 2 years focused on Agentic AI or autonomous systems
  • Proven track record of deploying production-grade AI systems
  • Strong problem-solving skills
  • Ability to work in a fast-paced, collaborative environment
Job Responsibility
Job Responsibility
  • Design, develop, and deploy advanced Agentic AI systems that autonomously perform complex tasks, make decisions, and interact with dynamic environments
  • Collaborate with cross-functional teams to deliver scalable, efficient, and ethical AI solutions
  • Architect and implement agentic AI systems capable of autonomous decision-making, task planning, and execution
  • Design and integrate multi-agent systems to solve complex problems
  • Develop and fine-tune large language models (LLMs) and reinforcement learning (RL) models
  • Implement robust APIs and interfaces to integrate AI agents with external systems
  • Optimize AI models for performance, scalability, and low-latency inference
  • Conduct rigorous testing, validation, and monitoring of AI agents
  • Collaborate with product managers, data scientists, and software engineers
  • Stay updated on latest advancements in Agentic AI, LLMs, and RL
What we offer
What we offer
  • Generous cost sharing for medical insurance for employee and dependents
  • 100% company paid dental insurance for employees and dependents
  • 100% company paid long-term and short term disability insurance
  • 100% company paid vision insurance for employees and dependents
  • 401k plan with generous match and 100% immediate vesting
  • Competitive Pay
  • Generous paid leave and holiday package
  • Tuition and training reimbursement
  • Life and AD&D Insurance
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, AI

As a Senior AI Engineer on our Core AI team, you will be a cornerstone of FloQas...
Location
Location
India , Pune
Salary
Salary:
Not provided
floqast.com Logo
FloQast
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years of professional software engineering experience
  • 3+ years focused on building backend for production applications
  • Mastery of Python
  • Familiarity with some AI application frameworks, context engineering, and scalable system design for AI products
  • Expertise in designing products that integrate with multiple technologies, APIs, and data sources in cloud-native environments (AWS preferred)
  • Strong desire to develop deep hands-on experience with LLM APIs, retrieval-augmented generation (RAG), conversational AI, document processing, and MCP integrations
  • Proven ability to lead tech product initiatives, establish technical standards and communicate complex system designs to both technical and business stakeholders
Job Responsibility
Job Responsibility
  • Architect and lead development of production AI products including intelligent chatbots, document processing systems, and agentic workflows using Python and modern AI frameworks
  • Design and implement our centralized AI platform including model routing, provider management, vector search, and AI application frameworks with seamless MCP (Model Context Protocol) integrations
  • Build scalable AI products that integrate with diverse technologies including accounting systems, document repositories, and external APIs while maintaining robust monitoring and observability
  • Master context engineering and system design for AI applications, ensuring optimal information retrieval, context assembly, and multi-turn conversation management
  • Collaborate with Product, Engineering, and Security teams to ensure AI products are robust, compliant, and aligned with business objectives in the regulated accounting space
  • Provide technical leadership and mentorship to the growing AI team, establishing best practices for AI product development, deployment, and governance
  • Fulltime
Read More
Arrow Right

Senior Software Developer

Senior Software Developer role at Hewlett Packard Enterprise focused on AI and m...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in computer science, engineering, data science, machine learning, artificial intelligence, or closely related quantitative discipline
  • Typically 4-7 years' experience
  • Deep understanding of machine learning algorithms (linear regression, decision trees, support vector machines, random forests, deep learning models, reinforcement learning)
  • Strong foundation in mathematics and statistics (linear algebra, calculus, probability theory)
  • Proficiency in programming languages such as Python, R, or Java
  • Experience with software engineering best practices and version control systems (Git)
  • Knowledge of libraries and frameworks like TensorFlow, PyTorch, sci-kit, Keras
  • Advanced knowledge in deep learning and neural network architectures
  • Proficiency in using agentic frameworks like langGraph
  • Knowledge of evaluation of traditional AI/ML and Gen-AI based applications
Job Responsibility
Job Responsibility
  • Conduct advanced research in AI and machine learning
  • Design and architect AI solutions for complex problems
  • Provide technical guidance and mentorship to junior team members
  • Work with stakeholders to translate requirements into technical solutions
  • Drive continuous improvement and innovation in AI/ML practices
  • Evaluate and integrate third-party tools or services
  • Facilitate design review sessions
  • Collaborate with engineering manager and team lead
  • Prepare and deliver presentations to stakeholders
  • Design and develop solutions to complex application problems
What we offer
What we offer
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment
  • Comprehensive benefits suite supporting physical, financial and emotional wellbeing
  • Fulltime
Read More
Arrow Right

Senior AI Product Manager

We are investing massively in developing next-generation, agentic and generative...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
satalia.com Logo
Satalia
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven relevant Product Management experience
  • Experience developing AI products
  • Conceptual understanding of AI
  • BA/BS Computer Science/ related technical degree, or 3+ years equivalent technical experience
  • Success defining and launching software products
  • An understanding of the software development lifecycle
  • Experience working with data scientists
  • Knowledge of product methodologies such as user story mapping, user personas
  • Demonstrable ability to determine value
  • Strong problem solving and analytical capabilities
Job Responsibility
Job Responsibility
  • Design the start-to-end experience of an early stage, B2B AI product
  • Manage the entire product life cycle
  • Own roadmap plans and prioritise features to support ongoing delivery in line with client expectations
  • Work closely with development and data science teams to ensure on time delivery of product releases and oversee the product backlog
  • Conduct requirements gathering, reviews, and design review meetings with stakeholders, domain experts and technical architects
  • Analyse market trends and closely monitor product competitors
  • Perform business and technology analyses, based on product and user needs
  • Act as the subject matter expert of the product area and market internally (engineering, sales, pre-sales, and marketing) as well as externally (business partners, agencies, customers)
  • Research, gather, and define product's business requirements from discussions with users, clients, partners, sales and other sources for current and future product releases
  • Cooperate with other teams on product integrations and understand how products can be made so they are are reusable for other clients, not as an off-the-shelf product but a customisable product
What we offer
What we offer
  • Development - annual development budget to upskill yourself
  • Annual bonus - when Satalia does well, we all do well
  • Remote working - café, bedroom, beach - wherever works
  • Impactful projects - focus on bringing meaningful social and environmental change
  • People oriented culture - wellbeing is a priority, as is being a nice person
  • Truly flexible working hours - school pick up, volunteering, gym - no problem
  • Generous leave - 27 days’ holiday plus bank holidays and enhanced family leave
  • Transparent and open culture - you will be listened to and heard
  • Fulltime
Read More
Arrow Right

Senior Software Engineer II - AI/ML

As a Senior Software Engineer II at Aledade, we maintain, improve, and expand ou...
Location
Location
United States
Salary
Salary:
Not provided
aledade.com Logo
Aledade, Inc.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS/BTech (or higher) in Computer Science, Engineering or a related field
  • 6+ years experience as an engineer building full-stack web applications as part of a cross-functional team
  • 3+ years of experience working with SQL or other database querying language on large multi-table data sets
  • 3+ years of experience acting as a trusted technical decision-maker in a team setting, solving for short-term and long-term business value
  • 3+ years of experience coaching other engineers
Job Responsibility
Job Responsibility
  • Develop and implement scalable and performant solutions
  • Partner, as a peer, with Engineering Managers, Product Managers, and stakeholders throughout Aledade to develop and execute technical roadmaps using Agile processes
  • Mentor and coach more junior engineers including thorough pull request reviews for other developers and be receptive to critical feedback on your own work
  • Improve AI/ML infrastructure for model development, training, and deployment, with a focus on large language models and other generative AI architectures
  • Design multi-year vision, shaping the direction of crucial generative AI areas - text generation, image synthesis, multimodal models, and personalized content creation
  • Architect systems to enhance the capabilities and relevance of AI models, making complex data sets more accessible and actionable
  • Design and implement prompt engineering strategies to effectively guide generative AI models
  • Work closely with Product Management, Practices, Sales, Customer Success, and other stakeholders to identify and prioritize applied AI use cases within the organization
  • Analyze product usage patterns and trends to make data-driven decisions and forecasts for generative AI applications
  • Maintain the security of protected patient health information and ensure compliance with relevant regulations in the context of AI
  • Fulltime
Read More
Arrow Right

Senior Agentic AI Developer

Senior Agentic AI Developer to build production-grade autonomous agents and orch...
Location
Location
United States , San Francisco
Salary
Salary:
156000.00 - 234000.00 USD / Year
gofundme.com Logo
GoFundMe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6+ years in software engineering, with 3+ years hands-on building production-grade AI/ML systems
  • Deep expertise in LLM agent frameworks (LangGraph, Google ADK, CrewAI, AutoGen, Pydantic AI, LangChain)
  • Proven ability to architect agentic systems from scratch, including planning/reasoning flows and multi-agent orchestration
  • Strong Python/TypeScript skills and experience in microservices and distributed infrastructure
  • Familiarity with vector databases (e.g., Pinecone, ChromaDB), knowledge graphs (Neo4j, Graphiti), and RAG pipelines
  • Deep autonomous agent architecture knowledge including advanced reasoning, planning, task decomposition, multi-step automation, tool-use patterns, evaluation systems beyond simple response generation
  • Enterprise integration expertise with RESTful APIs, A2A interoperability, webhook systems, Model Context Protocol (MCP) tools for dynamic tool discovery
  • Production experience with cloud-native infrastructure (Docker, Kubernetes, AWS/GCP)
  • Demonstrated experience with reinforcement learning, agent optimization, and agent evaluation techniques
  • Strong understanding of agent safety, bias mitigation, transparency, and ethical design practices
Job Responsibility
Job Responsibility
  • Design and implement LLM-based autonomous agents that perform beyond simple responses — including planning, reasoning, tool-use, and long-term task decomposition
  • Engineer context- and memory-rich agents that integrate structured data, external APIs, and conversational context to optimize outcomes
  • Build production-scale orchestration frameworks using tools such as LangGraph, Pydantic AI, Google ADK, and CrewAI to orchestrate complex workflows across campaign creation, optimization, and tracking in distributed microservices architectures
  • Implement agent-to-agent (A2A) communication protocols using Model Context Protocol (MCP), enabling tool discovery and dynamic peer -to-peer task delegation between specialized agents in our fundraising ecosystem
  • Develop microservice-based, cloud-native infrastructure for autonomous agent deployment with Docker/Kubernetes, observability tooling, and cloud platforms (AWS/GCP)
  • Ensure enterprise-grade performance, monitoring, and fault tolerance in agent systems supporting large-scale fundraising workflows
  • Design sophisticated prompt strategies, agent memory architectures, vector and graph database implementations for maintaining context across multi-turn conversations and long-running campaign lifecycles
  • Architect and optimize vector and graph database integrations (e.g., Pinecone, Weaviate, ChromaDB) for agent memory and semantic recall
  • Build custom agentic frameworks for use cases across campaign storytelling, optimization, and donor engagement
  • Architect frameworks from scratch tailored to fundraising domain challenges, incorporating advanced tool-use capabilities that integrate with platform APIs, knowledge graphs, and external systems
What we offer
What we offer
  • Make an Impact: Be part of a mission-driven organization making a positive difference in millions of lives every year
  • Innovative Environment: Work with a diverse, passionate, and talented team in a fast-paced, forward-thinking atmosphere
  • Collaborative Team: Join a fun and collaborative team that works hard and celebrates success together
  • Competitive Benefits: Enjoy competitive pay and comprehensive healthcare benefits
  • Holistic Support: Enjoy financial assistance for things like hybrid work, family planning, along with generous parental leave, flexible time-off policies, and mental health and wellness resources to support your overall well-being
  • Growth Opportunities: Participate in learning, development, and recognition programs to help you thrive and grow
  • Commitment to DEI: Contribute to diversity, equity, and inclusion through ongoing initiatives and employee resource groups
  • Community Engagement: Make a difference through our volunteering program
  • Fulltime
Read More
Arrow Right

Senior AI Product Engineer

As a one person team to start, you are both a product lead and the technical SME...
Location
Location
Australia , Melbourne
Salary
Salary:
Not provided
frankieone.com Logo
FrankieOne
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Recent experience in building AI applications with end to end ownership
  • 10 plus years experience in engineering teams in an Agile environment using JS based frameworks like React
  • 5 plus years developing & supporting Full Stack TS-based SaaS applications production in AWS/Cloud ecosystem
  • Experience in HTML5, ES6, CSS3/Sass, javascript, typescript, React, React Native for Web (optional), npm and other front-end technologies to deliver enterprise grade frontend applications
  • Experience in depth of back-end oriented technologies such as nodejs, typescript for managing BFF
  • Knowledge and experience in tracking technological developments especially AI with vendor offerings and ability to quickly evaluate their value proposition e.g AWS Bedrock
  • Experience in architecting & building enterprise grade AI applications with Data and AI governance, AI gateways, context management (RAG), inference/prompt management, tools/functions (MCP, A2A), memory & fine-tuning
  • Experience in capturing business requirements from stakeholders to documenting, architect and building AI applications for both internal and external users
  • Experience in designing web applications based on AWS well architectured principles, 12 factor web application principles, cloud based software architecture patterns (pub-sub, saga, circuit breaker etc)
  • Experience in designing reactjs based frontend and backend in nodejs or golang
Job Responsibility
Job Responsibility
  • Inspire others
  • Design with quality
  • Collaborate
  • Be proactive
  • Be an advocate for FrankieOne, for our product, and our values
Read More
Arrow Right

Senior Full Stack Software Engineer

Tutor Intelligence builds software to enable ordinary robots to achieve extraord...
Location
Location
United States , Watertown
Salary
Salary:
140000.00 - 190000.00 USD / Year
tutorintelligence.com Logo
Tutor Intelligence
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong programming skills in Python
  • Software engineering tooling: git, unix shell, etc
  • Collaborative nature and social skill set
  • Interest in robotics, AI, solving hard problems, or improving the future of humanity
  • Passion for building things (and just getting stuff done)
Job Responsibility
Job Responsibility
  • Architecting and engineering core software across one or more of: robot software, backend services, ML services, cloud infrastructure / dev-ops
  • Involvement in new project planning
What we offer
What we offer
  • generous equity
  • fully covered health + dental
  • unlimited PTO
  • Fulltime
Read More
Arrow Right