Site Reliability Product Owner Job at Boeing (Kent, Washington)

Site Reliability Product Owner

Boeing

Location:
United States , Kent, Washington

Category:
IT - Software Development

Contract Type:
Not provided

Salary:

224100.00 - 273900.00 USD / Year

Save Job

Apply Position

Job Description:

The Site Reliability Product Owner leads end-to-end release engineering and operationalizing for a growing, multi-application software portfolio across multiple missions and effectivities—owning release coordination, bug/fix lifecycle, customer and multi-level leadership approvals, incident command, and post-incident reporting. This hands-on development-focused role requires strong AWS infrastructure and Python automation skills, practical knowledge of signal‑processing algorithm behavior to interpret anomalous system results, and ownership of on-call scheduling with an expectation of ~80% availability while assigned. The Product Owner defines and implements environment-wide monitoring and observations, builds comprehensive monitoring strategies (real‑time system health, anomaly detection, and alerting to pre-empt resource exhaustion and performance degradation), and develops environment monitoring dashboards and application monitoring using APM tools with proactive thresholds. Responsible for CI/CD and release quality, the role validates release candidates through operational and enterprise testing, compiles and coordinates release packages, facilitates development activities into operational environments, and enforces release control (scheduling, versioning, change control) while tracking and verifying fixes. The position drives continuous improvement—standardizing runbooks, automating deployment and recovery workflows, instrumenting DORA-style KPIs (deployment frequency, lead time, change success rate, MTTR), and partnering with engineering, suppliers, and the customer to reduce downtime, accelerate delivery cadence, and enable future capability growth and proposal support.

Job Responsibility:

Oversee end-to-end release engineering and sustainment for a multi-application portfolio supporting multiple missions and effectivities
Own release control processes: scheduling, versioning, change control, approvals, and authoritative configuration/deployment records
Coordinate and compile release packages
validate release candidates through operational and enterprise testing and facilitate development activities into operational environments
Track, verify, and communicate bug/fix status across the portfolio and obtain customer and multi-level leadership sign‑offs prior to deployments
Define, implement, and maintain environment monitoring and observations across all environments, including real‑time system health, anomaly detection, and alerting to pre‑empt resource exhaustion and performance degradation
Design and maintain environment monitoring dashboards, application monitoring, and APM monitoring tools with proactive thresholds to surface performance issues
Manage on‑call scheduling and incident response
serve as incident commander during outages, lead diagnostics and mitigation, and prepare and present executive incident slide decks and after‑action reports
Instrument and track release and operational KPIs (deployment frequency, lead time, change success rate, MTTR) and drive continuous improvement to release cadence and reliability
Automate deployment, rollback, and recovery workflows using Python and cloud-native tooling (including serverless patterns) to reduce manual effort and MTTR
Advise on signal‑processing algorithm behavior and cloud operations at scale to interpret anomalous outputs and recommend corrective actions
Coordinate supplier management and cross‑functional team activities to ensure release readiness, quality, and contractual compliance
Maintain and update operational runbooks, playbooks, and run‑to‑failure/response procedures
train and mentor junior SWE staff as the sustainment team grows
Support research into emerging technologies and contribute technical inputs for proposals, bids, and future architecture planning
Serve as the primary Boeing representative to the customer enterprise for release and sustainment matters, ensuring clear, accurate, and timely stakeholder communications

Requirements:

Bachelor’s Degree in an engineering discipline or 18 years’ directly related work experience or 22 years’ related relevant work experience
20+ years of experience in software engineering, with demonstrated expertise in cloud‑native distributed systems, orchestration, and operationalizing services at scale (including serverless and containerized deployments)
1+ years of experience in deploying and managing distributed systems in cloud platforms (Ex. Azure, AWS, GCP)
1+ years of experience with Engineering Releases
1+ years of experience in managing product backlog, writing user stories, and managing releases
1+ years of experience with cloud platforms (e.g. AWS or Azure), infrastructure as code (e.g., Terraform), and automation tools (e.g. Puppet, Ansible, Chef etc.)
1+ years of experience developing and operating microservice, containerized, or serverless applications
1+ years of experience with signal processing or image processing

Nice to have:

1+ years incident management experience, including leading post-incident reviews and preparing executive-level incident reports and slide decks
3+ years experience in Python development, scripting and automation
experience building operational tooling, and automation for deployments and incident response

What we offer:

Generous company match to your 401(k)
Industry-leading tuition assistance program pays your institution directly
Fertility, adoption, and surrogacy benefits
Up to $10,000 gift match when you support your favorite nonprofit organizations
health insurance
flexible spending accounts
health savings accounts
retirement savings plans
life and disability insurance programs
a number of programs that provide for both paid and unpaid time away from work
relocation based on candidate eligibility

Additional Information:

Job Posted:
March 04, 2026

Expiration:
March 18, 2026

Employment Type:

Fulltime

Work Type:

On-site work

Boeing - All Job Offers

Job Link Share:

Site Reliability Product Owner

Boeing

Location:
United States , Kent, Washington

Category:
IT - Software Development

Contract Type:
Not provided

Salary:

Job Description:

Job Responsibility:

Requirements:

Nice to have:

Additional Information:

Job Posted:
March 04, 2026

Expiration:
March 18, 2026

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Site Reliability Product Owner

Site Reliability Engineering Manager

Manager, Reliability

Migration Services Product Manager

Site Reliability Engineering Intern

Site Reliability Engineer II

Senior Manager, Staff Software Engineering

Manager – AI Infrastructure Operations

Business Development System Architect

Site Reliability Product Owner

Boeing

Location:United States , Kent, Washington

Category:IT - Software Development

Contract Type:Not provided

Salary:

Job Description:

Job Responsibility:

Requirements:

Nice to have:

Additional Information:

Job Posted:March 04, 2026

Expiration:March 18, 2026

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Site Reliability Product Owner

Site Reliability Engineering Manager

Manager, Reliability

Migration Services Product Manager

Site Reliability Engineering Intern

Site Reliability Engineer II

Senior Manager, Staff Software Engineering

Manager – AI Infrastructure Operations

Business Development System Architect

Location:
United States , Kent, Washington

Category:
IT - Software Development

Contract Type:
Not provided

Job Posted:
March 04, 2026

Expiration:
March 18, 2026