This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
The Senior Production Support Engineer will establish production engineering functions, support incident management, automate tasks, and collaborate with teams to enhance system performance.
Job Responsibility:
Work with product feature teams to ensure appropriate monitors are in place for production systems and applications to ensure high availability and performance
Assist responders as needed with incidents and alerts, helping to diagnose and resolve issues in a timely manner
Collaborate with development and operations teams to implement fixes and improvements
Ensure root cause analysis is performed for critical incidents and track implementation of preventive measures
Automate repetitive tasks using scripting languages to improve efficiency and reduce manual work
Maintain documentation of production support procedures, incidents, and resolutions
Own the incident response and production support toolsets and processes. Measure and report on their effectiveness on a regular cadence
Requirements:
Programming fundamentals from a Computer Science degree program or software engineering bootcamp
3+ years of experience in a production support or similar role
Proficiency in using production support tools such as Datadog, Pagerduty, etc.
Strong understanding of incident management processes and best practices
Experience with scripting languages (e.g., Python, Bash) to automate tasks and improve efficiency
Familiarity with creating AI prompts and using agentic tools to assist with incident and support tasks
Excellent troubleshooting skills with a strong attention to detail
Ability to work in a fast-paced environment and manage multiple priorities effectively
Strong communication skills to collaborate with cross-functional teams and provide clear updates to stakeholders
Adaptability: Ability to thrive in a startup environment