Senior Software Engineer, AI Runtime Job at Apollo GraphQL

Senior Solution Engineer

JFrog is expanding in APAC, and we are looking for a strong Senior Solution Engi...

Location

Singapore

Salary:

Not provided

JFrog

Expiration Date

Until further notice

Requirements

Experience in pre-sales, solutions engineering, DevOps consulting, or platform engineering
Hands-on knowledge of Docker, Kubernetes, Git, Jenkins/GitHub/GitLab, cloud-native architectures
Strong communication and customer-facing skills
Based in Singapore and open to travel across SEA and Korea

Job Responsibility

Lead technical discovery, demos, and POCs for customers in SEA + Korea
Architect CI/CD, DevSecOps, and software supply chain solutions using the JFrog Platform
Work closely with sales, product, R&D, and channel partners
Represent JFrog at regional events, workshops, and customer sessions
Support enterprise adoption of Artifactory, Xray, Curation, Advanced Security, AI Catalog, Runtime, and more

Senior AI Engineer - MSC AI Innovation

MSC AI Innovation is an AI-first team that incubates, builds, and accelerates so...

Location

Israel , Tel Aviv, Herzliya

Salary:

Not provided

Microsoft Corporation

Expiration Date

Until further notice

Requirements

8+ years professional software development
4+ years of software engineering experience in the AI space (e.g., building and shipping AI/ML or GenAI features in production)
Proven experience with building AI agents
Hands-on experience with evaluation methodologies and integrating quality standards/guardrails into delivery
Proficiency in Python and/or C#, with experience using REST APIs and SDKs
Deep understanding of AI system design, including ML fundamentals, Generative AI concepts, and cloud-native architectures

Job Responsibility

Design and build AI agents that plan, use tools/APIs, manage state/memory, and reliably complete multi-step workflows
Own AI features from design through production, including deployment, monitoring, and live‑site reliability, with an eval-first development lifecycle: define success criteria, build evaluation datasets and automated harnesses, and run human-in-the-loop reviews where needed
Develop and maintain prompt, retrieval, and memory strategies (system prompts, few-shot examples, tool schemas, retrieval context) with proper versioning and evaluation coverage
Debug AI behavior using prompt analysis, data inspection, and model/tool-call traces, and translate failure patterns into targeted improvements
Establish and track AI quality metrics (e.g., accuracy, groundedness, relevance, hallucination rate) and integrate them into CI/CD release gates
Optimize runtime performance and economics (token usage, inference cost, latency, caching, model selection/routing, batching) and implement monitoring and continuous improvement loops (online signals, drift detection, structured user feedback)
Partner with product, design, and domain stakeholders to define use cases, acceptance criteria, and rollout plans for AI features
Live site responsibility

Fulltime

Director, Digital Ecosystem Applications

This position is responsible for the Software Platforms group at the Innovation ...

Location

United States , Belmont

Salary:

240000.00 - 285000.00 USD / Year

Volkswagen AG

Expiration Date

Until further notice

Requirements

10+ years with 2+ years in a technical leadership role
CS, EE, M.S. Engineering (or equivalent) REQUIRED
M.S. Engineering (or equivalent) or PhD PREFERRED
Analytical and conceptual thinking – using logic and reason, creative and strategic
Communication skills – interpersonal, presentation and written
Managing interdisciplinary teams on individual projects
Integration – joining people, processes or systems
Influencing and negotiation skills
Problem solving
Resource management

Job Responsibility

Define the technical mission, architecture strategy, and long‑term platform vision for the In‑Vehicle Computing & Digital Ecosystem Applications team, spanning Android Automotive OS (AAOS), in‑vehicle compute platforms, Software‑Defined Vehicle (SDV) architecture, and AI‑driven cockpit intelligence
Provide technical leadership across the full software stack, including Android Framework, System Services, HAL layers, middleware, connectivity stacks, media/audio frameworks, HMI toolchains, and cloud‑connected AI runtimes within an SDV‑aligned architecture
Lead and mentor engineering teams in platform bring‑up, system integration, performance optimization, and development of AI‑agentic features, multimodal interaction models, and next‑generation speech technologies
Manage multi‑year budgets for platform development, AI integration, SDV‑aligned compute evolution, SoC evaluations, cloud services, and prototype programs
Deliver executive‑level technical reporting on architecture decisions, platform readiness, SDV integration milestones, AI progress, risks, and strategic recommendations
Drive strategic planning for ICC’s infotainment and cockpit portfolio, including AAOS evolution, hybrid cloud/edge AI pipelines, intelligent mobile agent technologies, and SDV‑centric software and compute roadmaps
Align technical roadmaps with global VW Group Innovation teams across infotainment, connectivity, AI/ML, vehicle architecture, cloud services, and SDV platform strategy, ensuring cross‑platform consistency and shared component reuse
Build strategic relationships with SoC vendors, Tier‑1 suppliers, cloud providers, and AI technology partners to influence cockpit compute and SDV platform evolution
Maintain partnerships with Silicon Valley companies specializing in AI runtimes, LLMs, speech, multimodal interaction, and automotive‑grade SDV‑compatible software frameworks
Collaborate with academic and research institutions on AI‑agentic systems, embedded ML, HMI, and in‑vehicle compute architectures aligned with SDV principles

What we offer

Eligibility for annual performance bonus
Healthcare benefits
401(k), with company match
Defined contribution retirement program
Tuition reimbursement
Company lease car program
Paid time off

Fulltime

Senior Software Engineer, Managed AI - AI model LifeCycle

The Senior Software Engineer for the Model LifeCycle team will contribute to bui...

Location

United States , San Francisco

Salary:

172425.00 - 209000.00 USD / Year

Crusoe

Expiration Date

Until further notice

Requirements

Bachelor's degree in Computer Science, Engineering, or a related field
Experience delivering production-ready features
Familiarity with essential cloud-based services (e.g., compute, storage, networking)
Familiarity with Generative AI (Large Language Models, Multimodal)
Experience with AI infrastructure components (training, inference)
4-5+ years of industry experience with demonstrated history of consistent success leading a varied portfolio of initiatives across your function

Job Responsibility

Implement and maintain systems for fine-tuning large foundation models (SFT, PEFT, LoRA, adapters), including multi-node orchestration, checkpointing, failure recovery, and cost-efficient scaling
Implement and maintain end-to-end training pipelines for Large Language Models
Implement components for distillation and reinforcement learning pipelines (e.g., preference optimization, policy optimization, reward modeling)
Develop and maintain core agent execution infrastructure
Implement features for dataset, model, and experiment management, focusing on versioning, lineage, evaluation, and reproducible fine-tuning
Work closely with Senior Engineers and Principal Engineers, as well as product and platform teams, to implement system abstractions and APIs
Contribute to technical discussions on training runtimes, scheduling, storage, and model lifecycle management
Engage with the open-source LLM ecosystem

What we offer

Restricted Stock Units
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short-term and long-term disability
Teladoc
401(k) with a 100% match up to 4% of salary
Generous paid time off and holiday schedule
Cell phone reimbursement
Tuition reimbursement

Fulltime

Senior Software Engineer, Managed AI - AI Platform

Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'...

Location

United States , San Francisco, CA; Sunnyvale, CA

Salary:

172425.00 - 209000.00 USD / Year

Crusoe

Expiration Date

Until further notice

Requirements

Advanced degree in Computer Science/Engineering
4-5+ years of industry experience with demonstrated history of consistent success leading a varied portfolio of initiatives across your function
Experience with distributed systems, cloud services (compute, storage, networking, database), and delivering early-stage projects quickly
Experience with Generative AI (LLMs, Multimodal) and familiar with AI infrastructure (training, inference, ETL pipelines)
Proficient with container runtimes (e.g., Kubernetes), microservices, REST APIs, gRPC, and the full software development lifecycle including CI/CD

Job Responsibility

Lead the design and implementation of core AI services, including: Resilient fault-tolerant queues for efficient task distribution
Model catalogs for managing and versioning AI models
Scheduling mechanisms optimized for cost and performance
Architect and scale infrastructure to handle millions of API requests per second
Implement robust monitoring and alerting to ensure system health and 24/7 availability
Collaborate closely with product management, business strategy, and other engineering teams to define the AI platform roadmap
Influence the long-term vision and architectural decisions of the platform
Contribute to open-source AI frameworks and actively participate in the AI community
Prototype and rapidly iterate on emerging technologies and new features

What we offer

Restricted Stock Units
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short-term and long-term disability
Teladoc
401(k) with a 100% match up to 4% of salary
Generous paid time off and holiday schedule
Cell phone reimbursement
Tuition reimbursement

Fulltime

Senior Software Engineer - Performance Tooling

The Artificial Intelligence (AI) Frameworks team at Microsoft develops AI softwa...

Location

United States , Redmond

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C++, or Python OR equivalent experience
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. This includes passing the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C++, or Python OR Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C++, or Python OR equivalent experience
4+ years’ practical experience working on high performance applications and performance debugging and optimization on CPUs/GPUs
Experience in DNN/LLM inference and experience in one or more DL frameworks such as PyTorch, Tensorflow, or ONNX Runtime and familiarity with CUDA, ROCm, Triton
Technical background and solid foundation in software engineering principles, computer architecture, GPU architecture, hardware neural net acceleration
Experience in end-to-end performance analysis and optimization of state of the art LLMs and HPC applications, including proficiency using GPU profiling tools
Cross-team collaboration skills and the desire to collaborate in a team of researchers and developers
Ability to independently lead projects

Job Responsibility

Work across multiple layers of the AI software stack (abstractions, programming models, compilers, runtimes, libraries, and APIs) to enable large-scale model training and inference
Benchmark OpenAI and other LLMs for performance on GPUs and Microsoft hardware
Debug, profile, and optimize performance for training/inference workloads on Central Processing Units (CPUs)/Graphics Processing Units (GPUs)
Monitor performance regressions and drive continuous improvements to reduce time-to-deploy and hardware footprint
Collaborate across teams of researchers and engineers to deliver scalable, production-ready AI performance improvements

Fulltime

Senior Software Engineer, CoreAI Workload Engines

The CoreAI Workloads team builds the foundational inference engines and APIs tha...

Location

United States , Redmond

Salary:

119800.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science or related technical field and 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, Python, or equivalent experience.
Proven ability to design and operate large-scale, production inference services with high reliability and performance requirements, and to ship performance improvements safely via disciplined experimentation.
Strong skills in performance analysis: benchmarking, profiling, diagnosing regressions, and turning results into concrete engine/runtime changes.
Strong problem-solving skills and the ability to debug complex, cross layer systems issues.
Demonstrated technical leadership, including mentoring engineers, driving cross-team architectural alignment, and leveraging AI tools and AI-assisted workflows to accelerate engineering velocity and quality.
Hands-on experience with Kubernetes (building and operating services on k8s), including debugging production issues and designing platform abstractions (e.g., custom resources/controllers) and scheduling-aware deployments (e.g., node affinity, taints/tolerations, resource requests/limits).
Strong collaboration and communication skills, with the ability to work across organizational boundaries.

Job Responsibility

Optimize inference engines for OpenAI and open-source models by implementing and shipping performance/efficiency improvements across runtime, scheduling, and serving paths (latency, throughput, utilization, availability, and cost).
Run experiments end-to-end: formulate hypotheses, implement engine changes (including Python/PyTorch integration points where relevant), analyze results, and ship improvements behind guardrails.
Build and use experimentation capabilities for large-scale AI inference (experiment lifecycle, tracking, metric modeling, comparability standards, automated analysis) so the team can iterate quickly and safely.
Own serving availability and efficiency for Azure OpenAI Service workloads through tiered experimentation, lean segmentation, and multi-modal utilization across heterogeneous fleets—turning findings into shipped engine improvements.
Design and evolve inference serving architectures to improve utilization and latency using techniques such as disaggregated serving, multi-token prediction, KV offload/retrieval, and quantization—validated via staged rollouts and production guardrails.
Extend AI infrastructure abstractions to support elastic, heterogeneous inference engines reliably at scale (e.g., dynamic scaling across model families, modalities, and workload classes while maintaining isolation and SLOs).
Tune and scale inference engines across NVIDIA GPU generations (A100, H100, H200) for state-of-the-art OpenAI models, focusing on serving efficiency, utilization, and reliability (not hardware bring-up).
Partner with networking and storage teams to leverage high-performance interconnects (e.g., RDMA/InfiniBand-class fabrics such as RoCE over IB) for distributed inference, without owning low-level kernel/driver enablement.
Drive end-to-end features from design through production: observability, diagnostics, performance regression detection, and operational excellence for inference serving.
Influence platform architecture and technical direction across teams through design reviews, clear metrics, and technical leadership focused on experimentation velocity and production reliability.

What we offer

Benefits and other compensation

Fulltime

Principal Engineer

The Principal AI/ML Operations Engineer leads the architecture, automation, and ...

Location

United States , Pleasanton, California

Salary:

251000.00 - 314500.00 USD / Year

BlackLine

Expiration Date

Until further notice

Requirements

Bachelor’s or Master’s degree in Computer Science, Machine Learning, Data Science, or a related field
10+ years in ML infrastructure, DevOps, and software system architecture
4+ years in leading MLOps or AI Ops platforms
Strong programming skills in languages such as Python, Java, or Scala
Expertise in ML frameworks (TensorFlow, PyTorch, scikit-learn) and orchestration tools (Airflow, Kubeflow, Vertex AI, MLflow)
Proven experience operating production pipelines for ML and LLM-based systems across cloud ecosystems (GCP, AWS, Azure)
Deep familiarity with LangChain, LangGraph, ADK or similar agentic system runtime management
Strong competencies in CI/CD, IaC, and DevSecOps pipelines integrating testing, compliance, and deployment automation
Hands-on with observability stacks (Prometheus, Grafana, Newrelic) for model and agent performance tracking
Understanding of governance frameworks for Responsible AI, auditability, and cost metering across training and inference workloads

Job Responsibility

Define enterprise-level standards and reference architectures for ML-Ops and AIOps systems
Partner with data science, security, and product teams to set evaluation and governance standards (Guardrails, Bias, Drift, Latency SLAs)
Mentor senior engineers and drive design reviews for ML pipelines, model registries, and agentic runtime environments
Lead incident response and reliability strategies for ML/AI systems
Lead the deployment of AI models and systems in various environments
Collaborate with development teams to integrate AI solutions into existing workflows and applications
Ensure seamless integration with different platforms and technologies
Define and manage MCP Registry for agentic component onboarding, lifecycle versioning, and dependency governance
Build CI/CD pipelines automating LLM agent deployment, policy validation, and prompt evaluation of workflows
Develop and operationalize experimentation frameworks for agent evaluations, scenario regression, and performance analytics

What we offer

short-term and long-term incentive programs
robust offering of benefit and wellness plans

Fulltime

Senior Software Engineer, AI Runtime

Apollo GraphQL

Location:
United States

Category:
IT - Software Development

Contract Type:
Not provided

Salary:

Job Description:

Job Responsibility:

Requirements:

Nice to have:

Additional Information:

Job Posted:
December 06, 2025

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Senior Software Engineer, AI Runtime

Senior Solution Engineer

Senior AI Engineer - MSC AI Innovation

Director, Digital Ecosystem Applications

Senior Software Engineer, Managed AI - AI model LifeCycle

Senior Software Engineer, Managed AI - AI Platform

Senior Software Engineer - Performance Tooling

Senior Software Engineer, CoreAI Workload Engines

Principal Engineer

Our AI answers in your language

Senior Software Engineer, AI Runtime

Apollo GraphQL

Location:United States

Category:IT - Software Development

Contract Type:Not provided

Salary:

Job Description:

Job Responsibility:

Requirements:

Nice to have:

Additional Information:

Job Posted:December 06, 2025

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Senior Software Engineer, AI Runtime

Senior Solution Engineer

Senior AI Engineer - MSC AI Innovation

Director, Digital Ecosystem Applications

Senior Software Engineer, Managed AI - AI model LifeCycle

Senior Software Engineer, Managed AI - AI Platform

Senior Software Engineer - Performance Tooling

Senior Software Engineer, CoreAI Workload Engines

Principal Engineer

Location:
United States

Category:
IT - Software Development

Contract Type:
Not provided

Job Posted:
December 06, 2025