CrawlJobs Logo

Senior Product Manager, Investigation & Troubleshooting

United States 180000.00 - 230000.00 USD / Year · Job Posted February 20, 2026

Job offer has expired

Job Link Share

Job Description

As Senior Product Manager, Investigation & Troubleshooting, you will own the diagnostic experience for Temporal users when things are not working as expected. You'll define how developers debug workflows, trace execution across the distributed system, and integrate with their observability stack. This is high-leverage work: reducing mean-time-to-resolution directly impacts customer trust and developer productivity.

Job Responsibility

  • Own the roadmap for workflow debugging and diagnostic tooling, including trace exploration, execution replay, and root cause analysis
  • Drive distributed tracing strategy, enabling customers to correlate Temporal workflows with their broader application traces
  • Define and evolve APM integrations (Datadog, Honeycomb, Dynatrace, etc.) to meet customers where their observability stack already lives
  • Lead AI-assisted diagnostics initiatives for agentic workflows, helping developers understand complex execution patterns
  • Partner with SDK teams to surface actionable debugging context from workers and activities
  • Build the "why did my workflow fail?" experience across UI, CLI, and programmatic access
  • Collaborate with customers running mission-critical workloads to understand their incident response and debugging workflows

Requirements

  • Deep expertise in observability systems with practical experience in distributed tracing, log correlation, and debugging tools
  • Background in APM/observability platforms or developer tools where diagnostic experience is core to the product
  • Familiarity with OpenTelemetry, distributed systems debugging, and the challenges of correlating events across service boundaries
  • Proven track record delivering developer-facing diagnostic tooling that reduces time-to-resolution
  • A developer-empathetic, customer-focused product management style with strong technical communication skills

Nice to have

Experience with AI/ML applications in observability or debugging contexts

What we offer

  • Unlimited PTO, 12 Holidays + 2 Floating Holidays
  • 100% Premiums Coverage for Medical, Dental, and Vision
  • AD&D, LT & ST Disability, and Life Insurance (Standard & Supplemental Available)
  • Empower 401K Plan
  • Additional Perks for Learning & Development, Lifestyle Spending, In-Home Office Setup, Professional Memberships, WFH Meals, Internet Stipend and more
  • $3,600 / Year Work from Home Meals
  • $1,800 / Year Professional Enrichment (Career Development & Professional Memberships)
  • $1,200 / Year Lifestyle Spending Account
  • $1,000 / Year In-Home Office Setup (In addition to Temporal issued equipment)
  • $74 / Month Reimbursement for Internet
  • Calm App Subscription for Mental Health & Wellness
  • Eligibility to participate in Temporal's equity plan

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Senior Product Manager, Investigation & Troubleshooting

8 matching positions

Value Stream Manager

The role is responsible for the operations performance (Safety, Quality, Deliver...
Location
Location
Ireland , Stamullen, Co. Meath
Salary
Salary:
Not provided
pci.com Logo
PCI Pharma Services
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Degree/diploma qualified in a science/engineering/manufacturing discipline. May substitute experience in lieu of education
  • Min 5 years’ manufacturing experience in a GMP or Regulated Environment
  • Min 2 years’ experience of people-management
  • Strong knowledge of production & quality systems
  • Experience in Lean Manufacturing, including experience in Lean tools to develop continuous improvements
  • Ability to manage multiple tasks and set priorities
  • Problem Solving and Troubleshooting skills. The successful candidate should display a dynamic and proactive approach to manage production issues
  • Self-motivated with a results driven approach
  • Flexibility to work across different shifts on request in line with business needs
  • Adaptable and ability to work collaboratively
Job Responsibility
Job Responsibility
  • Ensure compliance with relevant safety and cGMP regulations and adherence to Company policies across all production activities, fostering a culture of safety, compliance and efficiency
  • Lead the establishment and ongoing enhancement of best practices and operational solutions, driving Operational Excellence and service quality across the value stream
  • Manage all operational aspects of their area of responsibility and align materials, quality, engineering, production and customer service functions to establish reliable processes that improve safety, quality, schedule and cost performance. Specifically, work with the Planning, Warehouse & QA teams to ensure production kits are available OTIF
  • Support the QMS function in dealing with customer complaints, ensuring thorough investigations are conducted and effective CAPAs are put in place
  • Be the Production point of contacts for the NPI team and respective clients in the running of trials and PVs for new products on existing and new packing lines
  • Ensure the production area is maintained in an audit ready state for both customer and regulatory inspections and act as SME for all visits
  • Ensure clear and accurate communication of current production status to all relevant departments through the appropriate meetings / updates
  • Spearhead continuous improvement and innovation initiatives within their areas of responsibility to reduce cycle time, increase throughput and minimize changeover losses to increase internal capacity
  • Establish and utilize metrics to drive improvement activity and ensure that decisions are based on facts and measurement trends
  • Interface with customer as required in a professional manner to ensure customer satisfaction
  • Fulltime
Read More
Arrow Right

Production Manager

To support the Entertainment HOD in the overall success and consistent developme...
Location
Location
United Kingdom , Bognor Regis
Salary
Salary:
Not provided
butlins.com Logo
Butlin's
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong people skills and knowledge of live production environments
  • Previous demonstratable experience in a leadership role within entertainments and performance
  • Previous experience of managing an operation where speed is important whilst still maintaining high levels of guest experience and high quality
  • Demonstratable experience of creating and leading development programmes & improving performance standards
  • Ability to effectively lead, motivate and engage team, even in times of high demand and under pressure of a live entertainment environment
  • Ability to communicate effectively at all levels
  • Highly detail conscious to ensure the highest of standards are implemented, with good organisational skills and problem-solving skills
Job Responsibility
Job Responsibility
  • Lead, develop and manage the production teams, being accountable for all ‘on stage’ and ‘off stage’ product that involves direct reports
  • Support recruitment, rehearsal planning, onboarding and training of production team and Wardrobe
  • Consistently evaluate the quality of entertainment product and its delivery in all areas
  • Create rotas to ensure appropriate coverage across all venues and show times
  • Develop and implement a robust understudy programme
  • Regular communication and team meetings with the Cast Captains and Wardrobe team
  • Responsible to uphold the Team performance cycle of catch ups
  • Manage performance, absence, conduct and development within the team
  • Promote a collaborative and inclusive team culture
  • Ensure all productions comply with H&S legislation and internal policies
  • Fulltime
Read More
Arrow Right

Senior Software Engineer

As part of our Global Operations Team, this position is responsible for systems ...
Location
Location
United States , San Diego
Salary
Salary:
53.00 - 56.00 USD / Hour
gomillenniumsoft.com Logo
MillenniumSoft Inc
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Engineering, or relevant field (highly preferred)
  • 5+ years’ experience as a DevOps Engineer or equivalent software-engineering role
  • Proven work experience in DevOps or relevant experience
  • Experience with site reliability services and application performance monitoring
  • Experience in incident management workflow
  • Exercise computing skills to deploy upgrades and fixes in a Windows .Net environment. (Azure highly desirable)
  • Develop internal tools and scripts to support product engineers
  • Troubleshoot production issues and coordinate with the support staff and product engineers to streamline code deployments
  • Maintain continuous integration and delivery pipelines that produce rapid, low-risk releases and improved velocity
  • Collaborate with team members to improve the company’s engineering tools, systems and procedures and data security
Job Responsibility
Job Responsibility
  • Manage information technology and computer systems running Windows, Linux and virtualization in a private data center
  • Design, develop, implement and coordinate systems, policies and procedures
  • Ensure security of information, reliability and scalability of services
  • Possess familiarity of software-automation production systems (Octopus and Terraform)
  • Exhibit expertise in software development methodologies
  • Identify problematic areas and implement strategic solutions in time
  • Assist with response to platform issues, retrospectives, and future enhancements
  • Can work independently and as part of a team with the ability to lead projects from plan, design, and release
  • Able to effectively communicate to all levels of management and technical teams
  • On-call rotation 24x7 support for production and non-production environments
  • Fulltime
Read More
Arrow Right

Nxe Fls Production Engineer Cww D2

ASML is the world’s leading provider of lithography systems for the semiconducto...
Location
Location
United States , Wilton
Salary
Salary:
Not provided
asml.com Logo
ASML
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Mechanical, Electrical or other relevant engineering domain
  • Technical experience preferred
  • Must display a solid understanding of complicated manufacturing assemblies
  • Working knowledge and experience with integrated Hardware/Software testers
  • Working knowledge and experience with CMM tooling is a plus
  • Working knowledge with complicated multi-disciplinary (Electro-Mechanical) assemblies is a plus
  • Working knowledge of Unix/Linux/Python/ or MATLAB software languages is a plus
  • Working knowledge of statistical process control and FMEA is a plus
  • Working knowledge of NX, Teamcenter, SAP and Twinscan software is a plus
  • Working knowledge with vacuum systems is a plus
Job Responsibility
Job Responsibility
  • Provide first-line troubleshooting support to ensure uninterrupted production output
  • Spends 75% of time in the Cleanroom supporting production processes and leading troubleshooting actions
  • Provide real-time troubleshooting support for production issues to minimize downtime and optimize output
  • Document all disturbances and resolutions accurately for future learning
  • Escalate issues to senior FLS when resolution cannot be achieved independently
  • Capture learnings from troubleshooting efforts and use them to write & enhance OCAPs
  • Troubleshoot mechanical assembly, glass to metal bonding, and tooling stand issues
  • Create ETPs, task lists, and DOs related to escalation management, tooling maintenance, and troubleshooting
  • Advancing problem resolution through detailed investigation of recurring issues
  • Support and maintain SPC tracking for tooling and products
What we offer
What we offer
  • D2 Shift: Every Thursday, Friday, Saturday and every other Wednesday, 6AM - 6PM
  • Fulltime
Read More
Arrow Right

First Line Support Production Engineer

The First Line Support Production Engineer is responsible for providing first-li...
Location
Location
United States , Wilton
Salary
Salary:
Not provided
asml.com Logo
ASML
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Mechanical, Electrical or Mechatronics engineering with 2+ years of experience in a production engineering environment
  • Masters Degree in Mechanical, electrical or Mechatronics engineering and prior internship experience within a production environment
  • Prior experience troubleshooting mechanical, and or electro-mechanical parts
  • Strong troubleshooting and analytical skills
  • Ability to pick up structured problem-solving methodologies
  • Exposure to SPC and data-driven process control
  • Ability to document technical issues and resolutions clearly
  • Effective collaboration with cross-functional teams
  • Exposure in escalation management and technical documentation
  • Familiarity with quality processes
Job Responsibility
Job Responsibility
  • Provide first-line troubleshooting support to ensure uninterrupted production output
  • Provide real-time troubleshooting support for production issues to minimize downtime and optimize output
  • Document all disturbances and resolutions accurately for future learning
  • Escalate issues to senior FLS when resolution cannot be achieved independently
  • Capture learnings from troubleshooting efforts and use them to write & enhance OCAPs (Out of Control Action Plans)
  • Execute ‘As New’ based on input and documentation from D&E/Module PE to ensure product quality meets defined standards
  • Perform ‘cherry picks’ to select and evaluate specific parts for quality and process validation
  • Create ETPs (Engineering Test Plans), task lists, and DOs (Disturbance Orders) related to escalation management, tooling maintenance, and troubleshooting
  • Advancing problem resolution through detailed investigation of recurring issues
  • Support and maintain SPC tracking for tooling and products, ensuring data-driven process control
  • Fulltime
Read More
Arrow Right

Senior Applications Specialist

Location
Location
Canada , Mississauga
Salary
Salary:
Not provided
advancedtechsearch.com Logo
Advanced Technology Search Group
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A Degree in Electrical Engineering, Computer Science, or related technical discipline or equivalent experience
  • At least 3 years of experience in advanced systems engineering support, focusing on complex technical problem resolution
  • Proven generalized understanding of computer networking (LAN, WAN, NAT, DNS, Basic Firewalls, etc.)
  • Hands-on experience with Linux and/or Windows (CMD, Bash, PS, regedit, etc.)
  • Demonstrated ability to diagnose sophisticated technical issues and implement effective solutions
  • Ability to work collaboratively with cross-functional teams
  • Willing to adapt to evolving technologies and industry standards
Job Responsibility
Job Responsibility
  • Analyze complex technical issues and system integrations to identify root causes and develop effective solutions
  • Conduct systematic analysis to diagnose customer system issues and implement effective technical solutions
  • Travel to customer’ sites in Canada and US for advanced troubleshooting and customer support
  • Collaborate with designers, developers, and stakeholders and well as technical support team to endure seamless product integration and customer satisfaction
  • Manage the deployment and configuration of integrated systems, ensuring optimal performance and reliability
  • Develop detailed Product Support Documents, and train internal technical support as appropriate
  • Develop comprehensive technical manuals and field installation guides to support customers during product installation, commissioning, and troubleshooting
  • Investigate and review recurring product issues to drive product improvements
  • Equip and support the team with in-depth product knowledge and configuration strategies
  • Provide post-sales customer support, including consultation on product configuration, installation, and usage
  • Fulltime
Read More
Arrow Right

Process Development Senior Engineer

Process Development Senior Engineer for Amgen Singapore Manufacturing. Amgen is ...
Location
Location
Singapore , Tuas
Salary
Salary:
Not provided
amgen.com Logo
Amgen
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Doctorate degree
  • Master’s degree and 2 years of directly related experience
  • Bachelor’s degree and 4 years of directly related experience
  • Diploma and 8 years of directly related experience
  • 6+ years of relevant work experience in the commercial manufacturing environment within the biotechnology or pharmaceutical industries
  • At least 4 years’ experience with regulated environments (i.e. cGMP) required
  • In depth cell culture and/or purification process knowledge, including single-use technologies and harvest technologies
  • Good understanding of process characterization and process scale up to resolve technical issues observed during transfer / manufacturing at large scale.
  • Experienced in providing floor support, troubleshooting unit operations, and resolving and documenting investigations to support cGMP production
  • Experienced in Technology Transfer of new process and technology to commercial site.
Job Responsibility
Job Responsibility
  • Assume a Subject Matter Expert role within ASM Process Development for cell culture and/or purification commercial process support.
  • Act as Drug Substance Team Leader within the global Product Development Team (PDT)
  • providing stewardship of product lifecycle and process improvement.
  • Lead complicated investigations independently, providing concise communications to teams and leadership
  • Integrate trends, data and information into plans, deliverables and recommendations
  • Manage key projects to deliver site goals while meeting quality, schedule, and cost objectives.
  • lead productivity projects, process optimization, and product life cycle management.
  • Collaborate with cross-functional teams and network drug substance teams to resolve process challenges by applying advanced technical principles and concepts for troubleshooting.
  • Apply best practices to leverage data for more proactive and predictive approaches.
  • Support regulatory filings, audits and inspection, and other CMC activities (e.g author and review regulatory submissions and responses to questions as required).
What we offer
What we offer
  • Vast opportunities to learn and move up and across our global organization
  • Diverse and inclusive community of belonging, where teammates are empowered to bring ideas to the table and act
  • Generous Total Rewards Plan comprising health, finance and wealth, work/life balance, and career benefits
Read More
Arrow Right

Senior Software Engineer - Infrastructure Reliability

We are seeking a Senior Software Engineer to join our Security Product team, foc...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
jfrog.com Logo
JFrog
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in software engineering, with at least 3+ years focused on debugging and solving infrastructure-level problems in distributed systems
  • Strong proficiency in Go
  • familiarity with Python and Helm is a plus
  • Deep hands-on experience with RabbitMQ or similar message brokers (Kafka, ActiveMQ) - including queue management, clustering, monitoring, and production troubleshooting
  • Solid working knowledge of Kubernetes (pod lifecycle, resource management, networking, debugging CrashLoopBackOff / OOMKilled scenarios) and Docker
  • Experience investigating production incidents and conducting post-incident reviews with clear root cause analysis and follow-through
  • Strong understanding of Linux systems, networking fundamentals, and cloud infrastructure (AWS, Azure, or GCP)
  • Ability to read and interpret logs, thread dumps, heap dumps, and system metrics to isolate root causes under time pressure
  • Excellent analytical and problem-solving skills with a methodical approach to debugging
  • Strong written and verbal communication skills - ability to produce clear incident reports, root cause analyses, and playbooks, and to communicate effectively across engineering, SRE, and customer-facing teams
Job Responsibility
Job Responsibility
  • Investigate system outages and production failures across customer environments (SaaS and self-hosted), spanning RabbitMQ, Kubernetes, Docker, Postgres, and cloud infrastructure (AWS, Azure, GCP)
  • Identify recurring failure patterns and systemic weaknesses from incident data, and drive them to resolution - whether by writing Go code yourself (resilience features, infrastructure fixes, observability) or by collaborating with service owners to prioritize and address reliability gaps
  • Lead and participate in post-incident reviews - document root causes, corrective actions, and follow through to ensure issues are properly resolved
  • Collaborate with production engineering and SRE teams to develop and maintain operational playbooks and runbooks that reduce time-to-resolution
  • Diagnose root causes across the full stack - message queue failures, container lifecycle issues, cloud networking, disk and memory pressure, and deployment topology mismatches
  • Design and implement data migrations and lifecycle management for infrastructure components such as queue management and vhost operations
  • Emit and monitor operational metrics to proactively detect infrastructure degradation and measure service reliability
Read More
Arrow Right