CrawlJobs Logo
Briefcase Icon
Category Icon

Manager - AI Observability Jobs (Hybrid work)

4 Job Offers

Filters
Observational Research Manager
Save Icon
Join Amgen's Center for Observational Research (CfOR) in Seoul. Lead innovative studies using real-world data to advance drug development and patient outcomes. We seek a PhD pharmacoepidemiologist with a strong publication record and excellent communication skills. Enjoy a collaborative, global e...
Location Icon
Location
South Korea , Seoul
Salary Icon
Salary
Not provided
amgen.com Logo
Amgen
Expiration Date
Until further notice
Senior Product Manager, Observability & Monitoring
Save Icon
Lead the Observability & Monitoring product for a leading enterprise data platform. This senior role in Denver or Oakland requires B2B SaaS experience and technical fluency in logs, APIs, and monitoring pipelines. You'll define the roadmap, integrate AI-native features, and ensure reliability for...
Location Icon
Location
United States , Denver; Oakland
Salary Icon
Salary
184800.00 - 231000.00 USD / Year
fivetran.com Logo
Fivetran
Expiration Date
Until further notice
Senior Product Manager, Observability & Monitoring
Save Icon
Lead the Observability & Monitoring product for a leading enterprise data platform. This high-impact role requires deep technical expertise in SaaS, logs, and APIs to deliver a world-class customer experience. You'll shape the roadmap using AI-native thinking in a collaborative, fast-paced enviro...
Location Icon
Location
United States , Oakland; Denver
Salary Icon
Salary
211200.00 - 264000.00 USD / Year
fivetran.com Logo
Fivetran
Expiration Date
Until further notice
New
Observability Event Management Engineer
Save Icon
Join Citi's IDEAS Observability team as an Event Management Engineer in Irving. Architect and develop cutting-edge solutions, processing events with AI/ML, ticketing, and automation. Requires 6+ years in event management systems (e.g., BigPanda), Linux/Windows, and Agile. Enjoy comprehensive bene...
Location Icon
Location
United States , Irving
Salary Icon
Salary
125760.00 - 188640.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Explore a career at the forefront of modern technology with Manager - AI Observability jobs. This pivotal leadership role sits at the intersection of artificial intelligence, software engineering, and IT operations, dedicated to ensuring the health, performance, and reliability of complex AI systems in production. As businesses increasingly rely on AI and machine learning models to drive decision-making and automate processes, the need for professionals who can manage and oversee the observability of these systems has never been greater. A Manager in AI Observability is responsible for building and leading a team of engineers to create a transparent, trustworthy, and efficient AI operational environment. Professionals in these roles typically oversee the strategy and implementation of a comprehensive observability framework. This framework is designed to provide deep insights into the behavior of AI models and the infrastructure they run on. Common responsibilities include defining the technical roadmap for monitoring, logging, tracing, and alerting systems specifically tailored for AI workloads. They manage the collection and analysis of key data points, such as model latency, throughput, prediction accuracy (data drift and concept drift), and resource utilization. A core part of the job involves translating this telemetry data into actionable intelligence, enabling proactive issue detection, rapid root cause analysis, and ensuring models perform as intended after deployment. This leadership position also involves cross-functional collaboration, working closely with data scientists, ML engineers, and product teams to establish Service Level Objectives (SLOs) and uphold a high standard of operational excellence. The typical skills and requirements for Manager - AI Observability jobs are a blend of technical depth and leadership acumen. A strong background in software engineering, DevOps, or Site Reliability Engineering (SRE) is fundamental, often coupled with experience in cloud platforms like AWS, GCP, or Azure. Candidates are expected to have hands-on knowledge of observability tools for metrics, logs, and traces (e.g., Prometheus, Grafana, ELK Stack, Jaeger) and understand how to apply them to machine learning pipelines. A solid grasp of MLOps principles and the machine learning lifecycle is crucial. Beyond technical expertise, successful managers possess exceptional leadership skills to mentor and grow a team, strong strategic thinking to align observability initiatives with business goals, and superb communication abilities to articulate complex technical concepts to non-technical stakeholders. They are results-oriented, adept at project management, and thrive in a dynamic environment where ensuring the reliability of AI is paramount. For those passionate about building resilient intelligent systems, Manager - AI Observability jobs offer a challenging and highly rewarding career path.

Filters

×
Countries
Category
Location
Work Mode
Salary