CrawlJobs Logo

Production Systems & Software Engineering Manager

United States, Los Lunas 173000.00 - 245000.00 USD / Year · Job Posted February 01, 2026
Apply Position
Job Link Share

Job Description

Meta is seeking a Production Systems & Software Engineering Manager to join our Data Center Site Operations (SiteOps) team. This role leads the Systems & Software Engineering team which drives the integration, performance, and alignment of tooling, automation, break/fix triage, and related workflows critical to Site Operations.

Job Responsibility

  • Develop and collaboratively own the roadmap for all tooling, automation, processes and workflows for compute, storage and accelerator delivery from Infra into mass production (MP) deployments. Serve as the central point of contact representing these functions across SiteOps
  • Develop and collaboratively own the processes and workflows required to support Global Operations in maintaining a high SLA for our compute, storage and accelerator platforms
  • Build relationships and collaboration with engineering and cross functional teams across the company. Actively solicit feedback from teams, and use that feedback to improve operational effectiveness as infrastructure scales
  • Lead the team to identify and root cause systemic issues in the fleet and drive resolution. Deliver maximum server fleet up-time and utilization rates, by leveraging data to understand hardware failure conditions and root cause
  • Provide people management, mentorship, coaching, and career development to build an environment fostering commitment to impact
  • Support leadership meetings and facilitate alignment on key issues and opportunities
  • provide timely alerts and data for enabling cross-functional teams to develop requisite corrective actions and forward looking implementations
  • Collaborate with stakeholders, functional owners and subject matter experts to interpret and articulate business and operations needs
  • Travel up to 30% is required

Requirements

  • BS or BA in technical field or commensurate experience
  • 10+ years experience in managing teams in software design, workflows and validation, working with cross functional teams to deliver products to production
  • Experience working across a global organization and building partnerships with cross functional teams inside and outside of the organization
  • Demonstrated success in developing and executing a strategic roadmap that supports organizational scaling
  • Experience in processing and analyzing large sets of data
  • Demonstrated knowledge of server and storage platforms, principles, technologies, protocols, and standards
  • Experience managing multiple concurrent projects and managing tight timelines

Nice to have

  • Large-scale data center environment experience, including tooling and automation deployments
  • Experience in data center system and workflows development and deployments
  • Leadership presence and presentation skills

What we offer

  • bonus
  • equity
  • benefits

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Production Systems & Software Engineering Manager

8 matching positions

Systems Software Engineering Manager

Meta is seeking a highly motivated and experienced Software Engineering Manager ...
Location
Location
United States , Denver, CO +4 locations
Salary
Salary:
219000.00 - 301000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Demonstrated experience recruiting and managing technical teams, including performance management and managing engineers
  • 12+ years, or PhD + 8 years, of software engineering work experience, including technical management
  • BS or MS in Computer Science, Engineering, or a related technical discipline or equivalent experience
  • 2+ years managing managers, 5+ years managing technical teams
Job Responsibility
Job Responsibility
  • Lead and manage a team of software engineers to deliver high-quality products and solutions
  • Collaborate with cross-functional teams to drive technical innovation and proven experience
  • Develop and implement technical strategies to achieve business objectives
  • Foster a work environment of continuous learning, growth, and improvement
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Manager, Systems Engineering, Production

We are currently seeking a Manager, Systems Engineering, Production to join our ...
Location
Location
Canada , Toronto, Calgary, Vancouver
Salary
Salary:
172000.00 - 258000.00 CAD / Year
clio.com Logo
Clio
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Demonstrated success in people leadership in systems engineering, particularly in large-scale SaaS products
  • Competency in at least one software development language
  • Diverse base of knowledge that allows you to help your team solve complex technical problems
  • The ability to describe successful projects you worked on, as well as a collection of lessons learned from failed projects
  • Demonstrated ability to hire the best and brightest engineers in a fast-paced job market - and to coach, develop, and retain engineering talent
  • You are equally energized by both your own technical work as well as contributing to the career growth of your team
  • You have strong opinions that are weakly held, and foster that same attitude in others
  • You believe in providing honest, actionable feedback to your team, and encourage your team to reciprocate
  • You devise roadmaps to guide your team, but aren't beholden to them -- you easily adapt to a constantly changing world
  • Growth mindset when it comes to process improvement and new technologies, especially AI
Job Responsibility
Job Responsibility
  • Defining the vision, and supporting your team in designing and implementing the architecture to power new global capabilities for Clio's customers
  • Authoring, reviewing, and shipping infrastructure-as-code in Clio’s cloud environments
  • Enabling product teams to self-serve issues and reduce the time-to-resolve for production issues by smoothing friction points through automation and creating observability tooling
  • Directly supervise, mentor, and collaborate with your individual contributors, and build cross-functional relationships across the engineering organization
  • Partner with Clio’s product development teams to support them in ensuring all of Clio’s products meet a high bar for security, performance, reliability, availability and change velocity
  • Work closely with the recruiting/people team to actively recruit and hire top talent to support Clio’s ambitious growth plans for our infrastructure and development teams
What we offer
What we offer
  • Competitive, equitable salary with top-tier health benefits, dental, and vision insurance
  • Hybrid work environment, with expectation for local Clions (Vancouver, Calgary, Toronto, and Dublin) to be in office minimum 2 days per week on our Anchor Days
  • Flexible time off policy, with an encouraged 20 days off per year
  • $2000 annual counseling benefit
  • RRSP matching and RESP contribution
  • Clioversary recognition program with special acknowledgement at 3, 5, 7, and 10 years
  • Fulltime
Read More
Arrow Right

Senior Software Engineering Manager - Pharmacy Systems

We’re building a world of health around every individual — shaping a more connec...
Location
Location
United States , Woonsocket
Salary
Salary:
130295.00 - 260590.00 USD / Year
https://www.cvshealth.com/ Logo
CVS Health
Expiration Date
June 26, 2026
Flip Icon
Requirements
Requirements
  • 7+ overall years of software development experience either building or supporting / managing Backend development teams
  • 5+ years of backend software development experience with programming languages/tools including: Java, Spring boot
  • 3+ years of leadership experience, directly managing individual contributors
  • 3+ years building of prior hands on experience developing highly available, performant, scalable Backend services
  • Bachelor’s degree or, equivalent experience (HS diploma + 4 years relevant experience)
Job Responsibility
Job Responsibility
  • Partner with application owners, business partners and peer groups regarding long and short-range technical solutions that meets business objectives
  • Analyze and contribute to project and business requirements based on product team milestones and priority
  • Actively participate in Agile Scrum team activities including Sprint Planning, Grooming, Scrum, Reviews and Retrospectives
  • Be a technologist and work with other Engineers in planning, prioritizing and performing assigned tasks within deadlines
  • Will be primarily responsible for Backend Application development & delivery including production deployment, application operationalization, and observability
  • Develop GraphQL APIs using Springboot and other tech stacks (Open source and proprietary)
  • Unit testing using frameworks such as Junit, Mockito
  • Build and deploy services using GitHub Actions as part of CI/CD process in leading Cloud Platforms - AWS, GCP or Azure etc.
  • Continuously checking and monitoring App health and KPIs, support triage of any production issues as and when needed
  • Be an advocate for and implementer of security best practices
What we offer
What we offer
  • Affordable medical plan options, a 401(k) plan (including matching company contributions), and an employee stock purchase plan
  • No-cost programs for all colleagues including wellness screenings, tobacco cessation and weight management programs, confidential counseling and financial coaching
  • Benefit solutions that address the different needs and preferences of our colleagues including paid time off, flexible work schedules, family leave, dependent care resources, colleague assistance programs, tuition assistance, retiree medical access and many other benefits depending on eligibility
  • Fulltime
!
Read More
Arrow Right

Manager, Software Engineering - Design Systems Management

Figma is growing our team of passionate creatives and builders on a mission to m...
Location
Location
United States , San Francisco; New York
Salary
Salary:
250000.00 - 350000.00 USD / Year
figma.com Logo
Figma
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of engineering management experience, including 2+ years of manager of manager experience
  • 8+ years of product engineering experience
  • Built high-performing and highly engaged engineering teams through hiring and growing people
  • Proven track record of building and shipping beloved and high-quality products
  • A deep passion for design craft and usability
Job Responsibility
Job Responsibility
  • Manage and support a team of experienced engineers to drive design systems usage across Figma’s product suite, with a goal of growing the team in 2026
  • Partner with product and design leadership to set the strategy, priorities, and mission for the team and roadmap
  • Roll up your sleeves to provide technical leadership of the team, including supporting their technical decisions and growth of team members
  • Build a collaborative culture of doing great work together, by investing in direct mentorship, shaping team practices, and helping guide the team towards meaningful work
  • Grow your career in a collaborative and creative engineering community
What we offer
What we offer
  • equity
  • health, dental & vision
  • retirement with company contribution
  • parental leave & reproductive or family planning support
  • mental health & wellness benefits
  • generous PTO
  • company recharge days
  • a learning & development stipend
  • a work from home stipend
  • cell phone reimbursement
  • Fulltime
Read More
Arrow Right

Manager, Software Engineering (User Systems)

Join Simplisafe's User Systems team as a hands-on Manager of Software Engineerin...
Location
Location
United States , Boston
Salary
Salary:
142800.00 - 209500.00 USD / Year
simplisafe.com Logo
SimpliSafe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field
  • 8+ years of professional software engineering experience
  • at least 1-2 years of experience in a formal management or technical lead role managing direct reports
  • deep expertise in developing and deploying complex, high-traffic backend systems and microservices
  • proficiency in at least one major programming language (e.g., JavaScript/TypeScript, Rust, Java, Go, Python, C#) with the adaptability to work with multiple languages
  • strong understanding of distributed systems, relational and NoSQL databases (e.g., MySQL, MongoDB, DynamoDB), caching, and message queues (e.g., Kafka, RabbitMQ)
  • hands-on experience building, deploying, and maintaining cloud-based backend systems (AWS, GCP, or Azure)
  • familiarity with Agile methodologies (Scrum or Kanban) and DevOps principles
Job Responsibility
Job Responsibility
  • Manage and mentor a team of 3-5 backend software engineers, fostering a culture of ownership, continuous improvement, and technical excellence
  • conduct regular one-on-ones, provide coaching, write and deliver performance reviews, and support career development plans for all team members
  • drive the planning, execution, and successful delivery of projects within an Agile/Scrum framework, ensuring on-time delivery and high-quality results
  • partner with product managers, QA, and other engineering teams to define requirements, scope projects, and manage dependencies
  • serve as a technical leader and active individual contributor, spending a significant portion of your time writing high-quality, production-ready code in Typescript/Javascript and Rust
  • lead the design, architecture, and implementation of scalable, high-availability, and fault-tolerant backend services and APIs
  • set and enforce technical standards, conduct rigorous code and design reviews, and ensure the team adheres to best practices in areas such as testing, monitoring, and security
  • oversee the deployment, monitoring, and maintenance of production systems, and participate in an on-call rotation as needed
What we offer
What we offer
  • A mission- and values-driven culture and a safe, inclusive environment where you can build, grow and thrive
  • a comprehensive total rewards package that supports your wellness and provides security for SimpliSafers and their families
  • free SimpliSafe system and professional monitoring for your home
  • employee resource groups (ERGs) that bring people together, give opportunities to network, mentor and develop, and advocate for change
  • participation in our annual bonus program, equity, and other forms of compensation, in addition to a full range of medical, retirement, and lifestyle benefits
  • Fulltime
Read More
Arrow Right

Engineering Manager, Production Engineering

We're looking for a hands-on Engineering Manager to lead our Production Engineer...
Location
Location
United States , San Francisco
Salary
Salary:
209000.00 - 253000.00 USD / Year
crusoe.ai Logo
Crusoe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of software or infrastructure engineering experience, with at least 1–2 years in an engineering management or tech lead role
  • Strong SRE or production engineering background — hands-on experience with incident management, SLO frameworks, runbooks, and on-call operations
  • Solid coding ability
  • comfortable writing production-grade code in Go, Python, or similar languages to build tooling and automation
  • Experience working with or embedding into cross-functional product teams, and influencing engineering decisions across organizational boundaries
  • Familiarity with container orchestration and cloud-native infrastructure — Kubernetes, distributed systems, and cloud service architectures
  • Strong communication skills — able to clearly represent technical risk and operational status to both engineering peers and business stakeholders
Job Responsibility
Job Responsibility
  • Leading and growing a team of SREs embedded within Crusoe's AI product areas, setting technical direction and fostering a culture of ownership and continuous improvement
  • Contributing as an IC — reviewing code, building tooling, and driving automation to reduce toil and improve the reliability and scalability of production services
  • Owning SLA/SLO performance, incident response, and on-call health for service offerings
  • leading blameless post-mortems and driving systemic remediation
  • Partnering with embedded product and platform engineering teams to influence infrastructure design, observability strategy, and operational readiness for new and existing services
  • Defining and tracking reliability, performance, and operational maturity metrics across the team
  • translating data into prioritized roadmap investments
  • Serving as a technical escalation point for high-severity production incidents affecting enterprise customers, and collaborating with Cloud Support and Customer Success on resolution and communication
What we offer
What we offer
  • Industry competitive pay
  • Restricted Stock Units in a fast growing, well-funded technology company
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Fulltime
Read More
Arrow Right

Principal Software Engineering Manager - AI Engineering

The Fabric Data Engineering Experience & Infrastructure team is hiring a Princip...
Location
Location
Canada , Vancouver
Salary
Salary:
142400.00 - 257500.00 CAD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, or related technical discipline AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Job Responsibility
Job Responsibility
  • Lead and grow a team: Hire, onboard, coach, and develop engineers
  • set clear expectations
  • create an inclusive culture of accountability, learning, and collaboration.
  • Drive execution and delivery: Guide team planning and prioritization across multiple workstreams
  • manage dependencies, risks, and release readiness
  • ensure predictable delivery from requirements → architecture → implementation → rollout → live-site operations.
  • Shape requirements with partners: Partner with Product Management, Design, Research, and dependent engineering teams to translate ambiguous customer needs into crisp scenario plans and measurable outcomes.
  • Guide architecture and technical strategy: Lead identification of dependencies and development of design documents
  • guide architectural decisions for distributed, cloud-scale systems (Spark/PySpark + Python services) with explicit tradeoffs across performance, reliability, cost, security, privacy, and operability.
  • Raise the engineering quality bar: Establish and reinforce engineering standards (design reviews, coding patterns, test strategy, performance practices, operational readiness)
  • Fulltime
Read More
Arrow Right

Principal Software Engineering Manager - Data Science & Engineering

The MSRC Data Science team is responsible in building data pipelines, data minin...
Location
Location
United States , Redmond
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role
  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Leads team on the disciplined use of, and improving artificial intelligence (AI) tools and practices across the software development lifecycle (SDLC)
  • Guides team on proactively taking responsibility for the content of their AI-generated requirements, design documents, code, and other assets, and assisting other members of the team to do the same
  • Leads team on incorporating Responsible AI practices into the SDLC to ensure appropriate controls over AI-generated assets
  • Coaches team on applying SDLC and engineering health measures (e.g., Accelerate, SPACE framework, Engineering System Success Playbook [ESSP]) to guide improvements to processes and practices, especially those involving AI
  • Leads team on experimenting with AI tools and practices to improve their own capabilities, and providing recommendations on how to adopt them to others
  • Reviews debugging tools, tests, logs, telemetry, and other methods, and acts as an expert for others to proactively verify assumptions while developing code before issues occur across products in production
  • Guides team to perform machine learning/data extraction, transformation, and loading (ETL) pipelines (e.g., data collection, cleaning) based on data prepared
  • Guides the architecture of scalable pipelines and datasets
  • Influences the direction of the team
  • Begins to anticipate potential data pipeline issues and provides solutions
  • Fulltime
Read More
Arrow Right