CrawlJobs Logo

Member of Technical Staff, AI Systems Engineer

Switzerland, Zürich · Job Posted March 21, 2026
Apply Position
Job Link Share

Job Description

We are building next-generation customized AI silicon designed to accelerate AI workloads with unprecedented efficiency. We are looking for an exceptional Systems Engineer to bridge the gap between our custom hardware and modern AI inference frameworks. As a Senior AI Systems Engineer, you will own the software integration layer between our custom AI chip's proprietary SDK and SGLang, a state-of-the-art serving framework for Large Language Models (LLMs) and Vision-Language Models. You will be responsible for ensuring that our silicon can seamlessly run SGLang inference workloads at peak performance, bypassing the traditional CUDA ecosystem entirely.

Job Responsibility

  • Framework Integration: Architect and develop the backend integration to make our custom AI chip a first-class citizen in SGLang
  • Custom Operator Development: Write custom C++ / PyTorch extensions that map SGLang’s primitive operations (e.g., RadixAttention, FlashAttention, matrix multiplications) to our custom chip's proprietary software layer
  • Performance Optimization: Profile and optimize end-to-end LLM inference latency, throughput, and memory utilization (Paged Attention) on our hardware
  • Cross-Functional Collaboration: Work closely with our hardware architecture and compiler teams to provide feedback on our custom software stack and silicon design based on framework-level bottlenecks
  • Testing & Deployment: Build robust testing pipelines to validate model accuracy and performance parity against standard GPU baselines

Requirements

  • BS, MS, or PhD in Computer Science, Computer Engineering, or a related field
  • Software engineering experience focusing on systems programming, ML infrastructure, or AI compilers
  • Expertise in Python: Deep understanding of memory management, concurrent programming
  • Experience with LLM Inference Engines: Hands-on experience modifying or extending frameworks like SGLang, vLLM, DeepSpeed-FastGen, or TensorRT-LLM
  • PyTorch Internals: Strong experience writing PyTorch C++ extensions and custom operators
  • Hardware Interfacing: Proven track record of integrating machine learning workloads with hardware accelerators (GPUs, TPUs, NPUs) using custom SDKs, APIs, or low-level drivers

Nice to have

  • Prior experience working on non-CUDA software ecosystems (e.g., AMD ROCm, AWS Neuron, Google XLA)
  • Familiarity with AI compilers and intermediate representations (MLIR, Apache TVM, OpenAI Triton)
  • Strong understanding of underlying LLM architectures (Transformers, MoE) and state-of-the-art attention algorithms (FlashAttention v2/v3)
  • Previous experience at an AI silicon startup or working on custom accelerators (e.g., Google TPU, AWS Trainium)

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Member of Technical Staff, AI Systems Engineer

8 matching positions

Member of Technical Staff, AI Platform Engineer

As Microsoft continues to push the boundaries of AI, we are on the lookout for p...
Location
Location
United States , Mountain View
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to TypeScript, Python, C, C++, C#, Java
  • OR equivalent experience
  • Bachelor’s degree in computer science, or related technical discipline AND 6+ years technical engineering experience building web services with coding in languages including, but not limited to: Python, Golang, Java/Scala, Rust
  • 6+ years' experience in building and releasing production software at the platform level
  • Deep experience with all of the following languages: Golang, Java/Scala, Typescript (React/Next.js)
  • Experience in model pretraining, post-training, evaluation, and inference
  • Experience using Machine Learning frameworks, including experience using, deploying, and scaling language learning models, either personally or professionally
  • Ability to clearly communicate complex technical concepts to both technical and non-technical stakeholders
  • Demonstrated interpersonal skills and ability to work closely with cross-functional teams, including product managers, designers, and other engineers
  • Experience going from zero-to-one as well as working with developed systems
Job Responsibility
Job Responsibility
  • Design, develop, and maintain platform-level software solutions
  • Collaborate with cross-functional teams to integrate AI capabilities into various products
  • Ensure the reliability, scalability, and performance of platform components
  • Stay updated with the latest advancements in AI and engineering
  • Work alongside the technical staff and AI researchers to improve model development flows
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - AI Product Engineer - Web

As Microsoft continues to push the boundaries of AI, we are on the lookout for p...
Location
Location
United States , Mountain View
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, or related technical discipline AND 4+ years software engineering experience building web products at scale by coding in languages including, but not limited to, Typescript and React OR equivalent experience.
  • Bachelor’s degree in computer science, or related technical discipline AND 10+ years software engineering experience building web products at scale by coding in languages including, but not limited to, Typescript and React OR Master's Degree in Computer Science, or related technical discipline AND 8+ years software engineering experience building web products at scale by coding in languages including, but not limited to, Typescript and React OR equivalent experience.
  • Have 0 to 1 experience with a bias towards shipping and learning, while balancing a high-quality bar.
  • Thrive in a fast-paced, collaborative environment and are comfortable making progress in ambiguity.
  • Enjoy working closely with cross-functional partners and teammates in an inclusive, curious culture.
  • Take a user-centric approach to product development, prioritizing solutions that result in the best user experience and have the technical expertise to pull it off.
Job Responsibility
Job Responsibility
  • Ship delightful, AI powered experiences that will shape how millions of people will interact with AI in the future
  • Collaborate with AI researchers, product managers, and designers to bring a world-class AI companion to the world
  • Design and build efficient and reusable front-end systems that drive complex web applications
  • Plan and deploy front end infrastructure necessary to build, test, and deploy our products
  • Join a small team of world class product engineers with deep frontend expertise who are obsessed with building beautiful and performant products
  • Embody our Culture and Values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Applied AI Engineer

We’re hiring a Applied AI Engineer to join a fast‑moving, high‑ownership team bu...
Location
Location
United States , Mountain View
Salary
Salary:
100600.00 - 199000.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Doctorate in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field OR Master's Degree in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND 1+ year(s) data-science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results) or consulting experience OR Bachelor's Degree in Data Science, Mathematics, Statistics, Econometrics, Economics, Operations Research, Computer Science, or related field AND 2+ years data-science experience (e.g., managing structured and unstructured data, applying statistical techniques and reporting results) OR equivalent experience.
  • 2+ years shipping production-level code, models, or data analysis.
  • 1+ years using AI-assisted coding and analysis techniques.
  • Experience working on small teams and mid-stage startup environments.
  • Experience working on AI products.
  • PhD in engineering, applied math, statistics, or related analytical field.
  • 4+ years shipping production-level code, models, or data analysis.
  • Deep experience building from zero-to-one.
  • Hands on work hillclimbing AI evaluations.
Job Responsibility
Job Responsibility
  • LLM Feature & Agent Development
  • Design and ship LLM‑powered assistant features, including conversational flows, agentic behaviors, retrieval pipelines, and multimodal interactions.
  • Build prompt architectures, system instructions, and orchestration logic that ensure reliability, grounding, and personality consistency.
  • Prototype new capabilities rapidly and iterate based on user signals and evaluation data.
  • Evaluation, Hillclimbing & Quality Systems
  • Build and maintain evaluation frameworks for correctness, safety, grounding, and UX quality.
  • Run hillclimbing loops across prompts, models, and tool‑use strategies to continuously improve assistant performance.
  • Analyze failure modes, design mitigations, and drive systematic improvements across the stack.
  • LLM Tooling & Internal Infrastructure
  • Develop internal tools for prompt experimentation, model comparison telemetry and debugging automated eval pipelines
  • Fulltime
Read More
Arrow Right

Member of Technical Staff - AI Product - Build Engineer

As Microsoft continues to push the boundaries of AI, we are on the lookout for p...
Location
Location
United States , Mountain View
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Experience building CI/CD pipelines and automation systems
  • Strong scripting skills (Python, Bash, or similar)
  • Sufficient in git version control, git-ops, Docker, Kubernetes
  • Familiar with common web toolchains like npm/pnpm, vite
  • Familiar with GitHub Actions and/or Azure DevOps
  • Strong debugging and problem-solving skills
  • Experience supporting large codebases or monorepos
  • Experience optimizing large build pipelines
Job Responsibility
Job Responsibility
  • Build Systems: Design and maintain scalable build systems and tooling
  • Optimize build performance, caching, and dependency management
  • Support cross-platform builds
  • Ensure deterministic and reproducible builds
  • CI/CD Pipeline: Build and maintain CI pipelines for automated builds and testing
  • Improve pipeline reliability and reduce build/test latency
  • Integrate build pipelines with release and deployment systems
  • Release & Artifact Management: Design artifact packaging and versioning systems
  • Support release automation and rollback strategies
  • Manage build metadata and release metrics
  • Fulltime
Read More
Arrow Right

Member Of Technical Staff - Software Engineer, Health AI

At Microsoft AI, our health team is on a mission to help millions of users bette...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Higher Degree in Computer Science, or related technical discipline AND strong software engineering experience with coding in languages/frameworks including, but not limited to, C#, C, C++, Java, Python, Rust, Typescript, Swift, Kotlin
  • Demonstrated expertise building products at scale, with domain expertise in one or more of distributed systems, cloud infrastructure, web, mobile, GenAI
  • Experience collaborating in cross functional teams, working through ambiguity to deliver high quality products
  • Have 0 to 1 experience with a bias towards shipping and learning, while balancing a high-quality bar
  • Proven ability to collaborate and contribute to a positive, inclusive work environment, fostering knowledge sharing and growth within the team
Job Responsibility
Job Responsibility
  • Collaborate with AI researchers, product managers, and designers to bring a world-class AI health companion to the world
  • Own the end-to-end development of features, from ideation and specification through to deployment and iteration
  • Design, build, and optimize production-grade code, delivering robust features within a much larger existing architecture
  • Work independently across a wide range of our stack, shipping delightful user experiences
  • Ensure resilience, maintainability, and security above all else
  • Build the hiring pipelines, onboarding frameworks, or software development best practices as needed to scale an engineering team around you
  • Guide peers, contributing to a culture of technical excellence and continuous improvement
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, Software Co-Design AI HPC Systems

Our team’s mission is to architect, co-design, and productionize next-generation...
Location
Location
United States , Mountain View
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
  • Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
  • Strong background in one or more of the following areas: AI accelerator or GPU architectures
  • Distributed systems and large-scale AI training/inference
  • High-performance computing (HPC) and collective communications
  • ML systems, runtimes, or compilers
  • Performance modeling, benchmarking, and systems analysis
  • Hardware–software co-design for AI workloads
  • Proficiency in systems-level programming (e.g., C/C++, CUDA, Python) and performance-critical software development.
  • Proven ability to work across organizational boundaries and influence technical decisions involving multiple stakeholders.
Job Responsibility
Job Responsibility
  • Lead the co-design of AI systems across hardware and software boundaries, spanning accelerators, interconnects, memory systems, storage, runtimes, and distributed training/inference frameworks.
  • Drive architectural decisions by analyzing real workloads, identifying bottlenecks across compute, communication, and data movement, and translating findings into actionable system and hardware requirements.
  • Co-design and optimize parallelism strategies, execution models, and distributed algorithms to improve scalability, utilization, reliability, and cost efficiency of large-scale AI systems.
  • Develop and evaluate what-if performance models to project system behavior under future workloads, model architectures, and hardware generations, providing early guidance to hardware and platform roadmaps.
  • Partner with compiler, kernel, and runtime teams to unlock the full performance of current and next-generation accelerators, including custom kernels, scheduling strategies, and memory optimizations.
  • Influence and guide AI hardware design at system and silicon levels, including accelerator microarchitecture, interconnect topology, memory hierarchy, and system integration trade-offs.
  • Lead cross-functional efforts to prototype, validate, and productionize high-impact co-design ideas, working across infrastructure, hardware, and product teams.
  • Mentor senior engineers and researchers, set technical direction, and raise the overall bar for systems rigor, performance engineering, and co-design thinking across the organization.
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, AI Safety Post-Training

As a Member of Technical Staff, AI Safety Post-Training, you will work to develo...
Location
Location
United States , Mountain View
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s Degree in Computer Science, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • Experience prompting and working with large language models
  • Experience writing production-quality Python code
  • Demonstrated interest in Responsible AI
Job Responsibility
Job Responsibility
  • Leverage expertise in AI safety to uncover potential risks and develop novel mitigation strategies, including alignment techniques, constitutional AI approaches, RLHF, and robustness improvements for large language models
  • Create and implement comprehensive evaluation frameworks and red-teaming methodologies to assess model safety across diverse scenarios, edge cases, and potential failure modes
  • Build automated safety testing systems, generalize safety solutions into repeatable frameworks, and write efficient code for safety model pipelines and intervention systems
  • Maintain a user-oriented perspective by understanding safety needs from user perspectives, validating safety approaches through user research, and serving as a trusted advisor on AI safety matters
  • Track advances in AI safety research, identify relevant state-of-the-art techniques, and adapt safety algorithms to drive innovation in production systems serving millions of users
  • Embody our culture and values
  • Fulltime
Read More
Arrow Right

Member of Technical Staff, AI Training Infrastructure

As a Training Infrastructure Engineer, you'll design, build, and optimize the in...
Location
Location
United States , San Mateo
Salary
Salary:
175000.00 - 220000.00 USD / Year
fireworks.ai Logo
Fireworks AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, or related field, or equivalent practical experience
  • 3+ years of experience with distributed systems and ML infrastructure
  • Experience with PyTorch
  • Proficiency in cloud platforms (AWS, GCP, Azure)
  • Experience with containerization, orchestration (Kubernetes, Docker)
  • Knowledge of distributed training techniques (data parallelism, model parallelism, FSDP)
Job Responsibility
Job Responsibility
  • Design and implement scalable infrastructure for large-scale model training workloads
  • Develop and maintain distributed training pipelines for LLMs and multimodal models
  • Optimize training performance across multiple GPUs, nodes, and data centers
  • Implement monitoring, logging, and debugging tools for training operations
  • Architect and maintain data storage solutions for large-scale training datasets
  • Automate infrastructure provisioning, scaling, and orchestration for model training
  • Collaborate with researchers to implement and optimize training methodologies
  • Analyze and improve efficiency, scalability, and cost-effectiveness of training systems
  • Troubleshoot complex performance issues in distributed training environments
What we offer
What we offer
  • meaningful equity in a fast-growing startup
  • comprehensive benefits package
  • Fulltime
Read More
Arrow Right