This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
At Boeing, we innovate and collaborate to make the world a better place. We’re committed to fostering an environment for every teammate that’s welcoming, respectful and inclusive, with great opportunity for professional growth. Find your future with us. Boeing Vancouver is embarking on an exciting journey to modernize and migrate our systems to the cloud. We are seeking a skilled Site Reliability Engineer to join our Defence & Government Services team. This position will focus on supporting the Boeing Global Services (BGS) business organization. This new SRE role will bridge the gap between traditional software engineering and operations to create highly scalable and fault-tolerant systems. As a result, you will ensure the reliable and efficient operation of Boeing Vancouver’s systems and services within the Defence & Government Services Portfolio. The position will be based out of our Richmond BC office, offering a flexible hybrid work style that allows for both virtual and in-office work. As a Site Reliability Engineer at Boeing, you will play a pivotal role in streamlining our development and operations processes to ensure seamless software delivery and infrastructure management. You will collaborate closely with our development, architecture, and analytics to build and maintain robust systems, automate deployment pipelines, and optimize performance, reliability, and scalability of our applications.
Job Responsibility:
Design, build, and maintain scalable and highly available infrastructure and processes using modern DevOps practices
Deploy and support customer installations, ensuring a smooth setup and integration of our hybrid multi-tenant SaaS solutions into their environments
Provide both reactive and proactive support to customers, addressing issues as they arise and implementing strategies to prevent future incidents
Lead incident response efforts, perform root cause analysis, and implement preventive measures to minimize downtime and service disruptions
Develop and enhance automation tools and scripts to streamline operations, reduce manual intervention, and improve efficiency
Set up and manage monitoring and alerting systems to proactively identify and resolve performance issues
Analyze system capacity and performance metrics to forecast future needs and ensure scalability of services
Collaborate with cross-functional teams to identify and implement new tools, technologies, and processes to enhance DevOps practices
Implement and advocate for “security best practices” to protect our applications and customer data
Pioneer and support special projects
Serve as a go-to resource for tools development and process improvement, with the ability to write small custom web-based tools for internal use
Create and maintain comprehensive documentation for systems, processes, and incident response procedures
Effectively contribute to building the overall knowledge and expertise of the technical team
Conduct training sessions and provide mentorship to team members and other departments. Focus on teaching others to enable them to take on newer and bigger tasks, fostering a culture of continuous learning and improvement
Be available to support emergencies via a Boeing provided mobile device
Monitoring and improving the availability and reliability of applications
Optimizing infrastructure and enhancing performance
Developing tools and scripts to automate repetitive tasks, such as deployment, monitoring, and scaling
Quickly addressing failures or outages and implementing solutions to prevent recurrence
Proactively analyzing and improving system performance to meet service level targets
Working closely with development and operations teams to ensure seamless integration
Requirements:
7+ years in software development or advanced technical support role
5+ years of experience in site reliability engineering, DevOps, or a related role
Proven experience in site reliability engineering, DevOps, or a related role, with a track record of successfully implementing and managing infrastructure and deployment pipelines
Candidate must be eligible for authorization under the Canadian Government Controlled Goods Program (CGP) assessment
Must be able to obtain Canadian Secret Level II Security Clearance
Must be legally able to work in Canada
Individuals must not pose a risk for safeguarding of controlled goods
Must be eligible to handle US export-controlled data
Fluency in English language
Nice to have:
Strong proficiency in programming/scripting languages such as Python, Go, or Bash
Extensive experience with cloud platforms (e.g., AWS, Azure) and container orchestration technologies (e.g., Kubernetes, Docker)
Familiarity with monitoring and observability tools (e.g., Prometheus, Grafana, AWS CloudWatch)
Strong understanding of IAM policies, Key Vaults, and security best practices in cloud environments
Strong problem-solving skills and the ability to troubleshoot complex systems
Excellent communication and collaboration skills, with the ability to work effectively in a team environment
Proficiency in managing storage solutions such as Azure Storage, AWS S3, and related services
Familiarity with disaster recovery and business continuity solutions
Strong knowledge of AWS and experience deploying and managing applications within the AWS cloud environment
Proven experience in incident management and root cause analysis
Solid understanding of configuration management tools (e.g., Ansible, Chef, Puppet) and infrastructure-as-code tools such as Terraform, ARM templates, and AWS CloudFormation
Ability to write and integrate with small custom web-based tools for internal use, demonstrating basic web development
AWS Cloud Practitioner or AWS Cloud Developer certification
Azure AI fundamentals certification
Ability to support emergencies via a Boeing mobile device
What we offer:
Competitive base pay and incentive programs
Industry-leading tuition assistance program pays your institution directly
Resources and opportunities to grow your career
Up to $10,000 match when you support your favorite nonprofit organizations