This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
As a Production Quality Management Engineer at Uber, you will play a key role in improving the reliability, safety, and operational excellence of Uber's services. You will work closely with engineering teams to ensure systems meet production standards through tooling, automation, and process improvements that reduce risk and enhance engineering quality. This is a hands-on, highly collaborative role that provides exposure to large-scale systems, real-time incident management, and opportunities to grow both technical and operational expertise.
Job Responsibility:
Support Production Readiness: Participate in incident reviews, postmortems, and quality audits. Help enforce production standards across services
Drive Metrics & Automation: Build or enhance dashboards and tools (Tableau, SQL, etc.) to monitor reliability, track SLAs/SLOs, and identify improvement areas
Collaborate Across Teams: Work closely with Platform, Infrastructure, and Compliance teams to align on reliability practices and implement safeguards
Improve Engineering Workflows: Contribute to runbooks, PRRs (Production Readiness Reviews), lockdown processes, and alerting hygiene to streamline operations
Learn & Grow: Shadow experienced engineers, participate in architecture discussions, and gain exposure to high-severity incident management
Requirements:
Bachelor’s degree in Computer Science, Engineering, or equivalent experience
3+ years of experience in software engineering, production engineering, incident management, or related areas
Hands-on experience with SQL and/or Python for data extraction and analysis
Strong communication and documentation skills with attention to detail
Interest in systems reliability, incident management, and continuous improvement
Experience in production support, DevOps, or quality engineering environments (preferred)
Familiarity with SLAs, SLOs, alerting, and postmortem practices (preferred)
Basic coding experience in one or more of the following: Python or Go (backend), API development, or frontend technologies (e.g., React, JavaScript) (preferred)
Experience using AI-assisted development tools such as Cursor, GitHub Copilot, or Claude (preferred)
Exposure to incident management tools (e.g., Jira, PagerDuty, Linear) and monitoring platforms (e.g., Tableau, Grafana, Prometheus) (preferred)
Ability to work in a fast-paced environment and collaborate across globally distributed teams (preferred)
Nice to have:
Experience in production support, DevOps, or quality engineering environments
Familiarity with SLAs, SLOs, alerting, and postmortem practices
Basic coding experience in one or more of the following: Python or Go (backend), API development, or frontend technologies (e.g., React, JavaScript)
Experience using AI-assisted development tools such as Cursor, GitHub Copilot, or Claude
Exposure to incident management tools (e.g., Jira, PagerDuty, Linear) and monitoring platforms (e.g., Tableau, Grafana, Prometheus)
Ability to work in a fast-paced environment and collaborate across globally distributed teams