CrawlJobs Logo

Software Engineer - Capacity Planning

meta.com Logo

Meta

Location Icon

Location:
United States , Bellevue

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

219000.00 - 301000.00 USD / Year

Job Description:

Meta is seeking a principal engineer to join our Infrastructure Capacity Planning team, driving automation across our global supply chain ecosystem. You will architect and build frameworks that unify fragmented data, automate complex processes, and empower business and technical users to make decisions at scale. The team is responsible for building and scaling automation across data, process, and system layers, enabling Meta's infrastructure supply chain to meet rapid growth, frequent change, and increasing complexity. You will drive the design and implementation of automation frameworks, data integration, and governance solutions that span multiple systems, supporting business and technical users in planning, execution, and analytics.

Job Responsibility:

  • Architect and implement automation solutions across the supply chain ecosystem, including data onboarding, validation, transformation, and reporting
  • Lead the development of frameworks and tools for end-to-end process automation, integrating variety of systems and partners, and enabling rapid onboarding and extensibility
  • Design and build modular, reusable components for scenario modeling, simulation, analytics, and optimization
  • Drive governance, data quality, and compliance automation, ensuring robust stewardship, certification, and auditability across all supply chain data assets
  • Collaborate with cross-functional partners (Data Engineering, Data Science, Business Operations, external partners) to deliver unified, business-facing platforms and self-serve interfaces
  • Incorporate GenAI and advanced analytics into supply chain automation, leveraging AI/ML for predictive modeling, anomaly detection, and decision support
  • Set technical direction for integrating and automating workflows across Meta-native and external supply chain systems
  • Define and track success metrics: reduced manual effort, increased automation coverage, improved data quality, faster scenario analysis, and successful onboarding of new models and partners

Requirements:

  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • 8+ years of programming experience in a relevant language
  • Proven track record of planning multi-year roadmap in which short-term projects ladder to the long-term mission
  • Experience driving large cross-functional/industry-wide engineering efforts
  • Experience utilizing data and analysis to explain technical problems and provide detailed feedback and solutions
  • Experience communicating and working across functions to drive solutions
  • Experience mentoring/influencing executive stakeholders across organizations

Nice to have:

  • Extensive experience architecting and building large-scale automation frameworks (ideally in supply chain, infrastructure, or similar domains)
  • Expertise in distributed systems, data integration, and complex process automation
  • Extensive background in scenario modeling, simulation, and analytics
  • Experience with AI/ML (GenAI) for predictive analytics and optimization
  • Track record of delivering business-facing, self-serve automation tools
  • Experience with compliance, auditability, and data quality frameworks
  • Practical knowledge of how to drive automation and reduce manual processes across the ecosystem
What we offer:
  • bonus
  • equity
  • benefits

Additional Information:

Job Posted:
January 23, 2026

Employment Type:
Fulltime
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Software Engineer - Capacity Planning

Staff Software Engineer, Cloud Capacity

The Cloud Capacity team plays a critical role in ensuring the Temporal Cloud is ...
Location
Location
United States
Salary
Salary:
170000.00 - 250000.00 USD / Year
temporal.io Logo
Temporal
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience contributing to large-scale infrastructure efforts spanning cloud compute, storage, and networking systems
  • Strong product and operational intuition around managing cloud costs, utilization tracking, and workload forecasting
  • A track record of designing distributed systems and services in a production cloud environment (preferably AWS, GCP, or Azure)
  • Hands-on experience with container orchestration technologies (e.g., Kubernetes) and the surrounding ecosystem
  • Exceptional collaboration and communication skills
  • Comfortable aligning cross-functional stakeholders on complex infrastructure problems, including executives and finance partners
  • 6+ years of experience building production software using Go, Java, or similar languages
Job Responsibility
Job Responsibility
  • Drive the technical vision and roadmap for Temporal’s Cloud Capacity systems in partnership with engineering and product leadership
  • Design and implement infrastructure to track resource utilization, forecast consumption, and support automated capacity planning at scale
  • Lead development of a resource manager that optimizes infrastructure efficiency based on usage trends, cost insights, and evolving customer needs
  • Collaborate cross-functionally with Product, Cloud Infrastructure, and Finance to inform business-critical decisions around provisioning, pricing, and scaling
  • Guide long-term strategy to support intelligent autoscaling, workload isolation, and predictable performance in a multi-tenant cloud environment
What we offer
What we offer
  • Unlimited PTO, 12 Holidays + 2 Floating Holidays
  • 100% Premiums Coverage for Medical, Dental, and Vision
  • AD&D, LT & ST Disability, and Life Insurance (Standard & Supplemental Available)
  • Empower 401K Plan
  • Additional Perks for Learning & Development, Lifestyle Spending, In-Home Office Setup, Professional Memberships, WFH Meals, Internet Stipend and more
  • $3,600 / Year Work from Home Meals
  • $1,500 / Year Career Development & Learning
  • $1,200 / Year Lifestyle Spending Account
  • $1,000 / Year In-Home Office Setup (In addition to Temporal issued equipment)
  • $500 / Year Professional Memberships
  • Fulltime
Read More
Arrow Right

Associate Director, Software Engineering & DevSecOps

We are looking for a motivated and passionate Associate Director to help us driv...
Location
Location
United States , Irving, Texas
Salary
Salary:
Not provided
siriusxm.com Logo
SiriusXM
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS in Engineering, Computer Science, Information Systems, or other technically related field. Equivalent experience and/or degrees in other technical fields will be evaluated and considered
  • 8+ years of experience as a software developer with experience in multiple development languages and platforms delivering multiple commercially deployed products to market
  • 5+ years of cumulative software development leadership at a manager/senior manager level
  • Mentoring skills and competencies, with an ability to transfer knowledge to junior and senior members of the team
  • Proven ability to organize and manage priorities across multiple stakeholders while ensuring a sustainable pace of work
  • Proven ability to translate business needs into technology solutions
  • Proven ability to lead and work within geographically distributed engineering teams
  • Excellent communication skills, both written and oral
  • acts with professionalism both in person and when working on the phone with partners
  • Excellent analytical and problem-solving skills
Job Responsibility
Job Responsibility
  • Lead a team of 10+ engineers, utilizing best practices in agile software development, test automation and quality assurance, CICD processes, and Operational discipline
  • Lead architecture, design, code, and implementation review sessions with team
  • Lead/co-lead scrum rituals like stand-ups, sprint planning, retrospectives, and backlog grooming
  • Work closely with your peers, Product Managers, and Product Owners to develop strategic vision for your components, clarify goals, deliver on software roadmaps, and prioritize effectively balancing technical debt vs. new functionality
  • Provide leadership, capacity planning, activity planning and direction to complete team tasks, produce the required deliverables, track/resolve issues, and meet project milestones
  • Establish and implement an overall DevSecOps strategy and roadmap, aligning it with business objectives, and promoting a shift-left approach to security
  • Conduct regular security assessments, identifying and mitigating potential security risks, and coordinate vulnerability testing
  • Monitor and analyze Production incidents (security, performance, outage) and implement incident response and recovery plans
  • Participate in an Incident Management on-call rotation
  • Grow and cultivate a culture of accountability, security awareness, collaboration, innovation, and continuous improvement
  • Fulltime
Read More
Arrow Right

Software Engineer

At Intercom, you will be a product engineer - someone who solves real customer p...
Location
Location
Germany , Berlin
Salary
Salary:
Not provided
intercom.com Logo
Intercom
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 2+ years of industry experience in a software engineering role, preferably building a SaaS product
  • Deep knowledge of a high-level programming language (for example, Ruby, Python, Javascript etc.)
  • Experience collaborating directly with product teams and designers, and a proven track record of delivering value to customers or users
  • Experience with Distributed systems
Job Responsibility
Job Responsibility
  • Develop technical plans and contribute to our technical architecture as we scale our products
  • Write Ruby code, which knits together a lot of AWS, infrastructure, platform and SaaS technologies
  • Ship a change to production on your first day and a feature in your first week
  • Build using the best tools in the industry
  • Grow your team’s capacity by mentoring other engineers and interviewing candidates
What we offer
What we offer
  • Competitive salary, annual bonus and equity
  • Regular compensation reviews
  • Generous paid time off above statutory minimum
  • Hybrid working
  • MacBooks are our standard, but we also offer Windows for certain roles when needed
  • Fun events for Intercomrades, friends, and family
Read More
Arrow Right

Senior Software Engineer

At Intercom, you will be a Senior Software Engineer - someone who solves real cu...
Location
Location
Ireland , Dublin
Salary
Salary:
Not provided
intercom.com Logo
Intercom
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of industry experience in a software engineering role, preferably building a SaaS product
  • Deep knowledge of a high-level programming language (for example, Ruby, Python, Javascript etc.)
  • Experience collaborating directly with product teams and designers, and a proven track record of delivering value to customers or users
  • Experience with Distributed systems
Job Responsibility
Job Responsibility
  • Develop technical plans and contribute to our technical architecture as we scale our products to serve tens of millions of people every day
  • Write Ruby code, which knits together a lot of AWS, infrastructure, platform and SaaS technologies that form the core of Intercom’s backend infrastructure
  • Ship a change to production on your first day and a feature in your first week
  • Build using the best tools in the industry
  • Grow your team’s capacity by mentoring other engineers and interviewing candidates
What we offer
What we offer
  • Competitive salary and equity in a fast-growing start-up
  • We serve lunch every weekday, plus a variety of snack foods and a fully stocked kitchen
  • Regular compensation reviews - we reward great work
  • Pension scheme & match up to 4%
  • Peace of mind with life assurance, as well as comprehensive health and dental insurance for you and your dependents
  • Flexible paid time off policy
  • Paid maternity leave, as well as 6 weeks paternity leave for fathers
  • Cycle-to-Work Scheme
  • MacBooks are our standard, but we also offer Windows for certain roles when needed
  • Fulltime
Read More
Arrow Right

Software Engineer

As a Site Reliability Engineer (SRE) you will actively work to improve the perfo...
Location
Location
United States
Salary
Salary:
116700.00 - 187400.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong scripting experience
  • Serious troubleshooting skills across different levels of the stack
  • Engage in capacity planning, demand forecasting, software performance analysis, and systems tuning
  • Experience configuring and managing enterprise monitoring solutions
  • Understanding of Linux systems
  • Building, automating, and maintaining infrastructure in Amazon Web Services
  • Maintaining a high standard of code quality
Job Responsibility
Job Responsibility
  • Improve the performance and reliability of services
  • Address root causes of incidents and reduce incident rates
  • Deep dive into the services we support and own the problem and the corresponding solution
  • Automate away repetitive work
  • Respond to pings, pages, and alerts to investigate issues in our systems
  • Serve in an on-call weekly rotation to make sure our products meet established SLAs
What we offer
What we offer
  • Health coverage
  • Paid volunteer days
  • Wellness resources
  • Fulltime
Read More
Arrow Right

Software Engineer, Site Reliability

As a Site Reliability Engineer (SRE) at Fireworks AI, you will play a critical r...
Location
Location
United States , San Mateo
Salary
Salary:
Not provided
fireworks.ai Logo
Fireworks AI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, related technical field, or equivalent practical experience
  • 5+ years of experience in Site Reliability Engineering, DevOps, or a similar role focused on large-scale production systems
  • Deep expertise in SRE principles and practices, including SLOs, SLIs, operational automation, incident management, and post-mortems
  • Extensive hands-on experience with public cloud platforms (AWS, GCP, Azure), including compute, networking, storage, and database services
  • Strong experience with containerization technologies (Docker) and orchestration platforms (Kubernetes)
  • Proficiency in designing and implementing robust monitoring, logging, and alerting systems using tools like Prometheus, Grafana, ELK stack, and distributed tracing
  • Solid programming/scripting skills in at least one language (e.g., Python, Go) for automation and tool development
  • In-depth knowledge of Linux operating systems, networking fundamentals, and system debugging
  • Proven ability to troubleshoot complex issues across the entire stack
  • Excellent communication, collaboration, and problem-solving skills
Job Responsibility
Job Responsibility
  • Ensuring System Reliability: Ensure systems are designed and implemented with high availability, scalability, and performance. Focus on fault tolerance, disaster recovery, identifying and removing scaling bottlenecks, and performance optimization across our multi-cloud infrastructure
  • Incident Management & Response: Lead efforts in incident detection, response, and resolution for critical production issues. Drive post-mortems to identify root causes and implement preventative measures to improve system reliability
  • Observability & Monitoring: Develop, implement, and maintain comprehensive monitoring, alerting, logging, and tracing solutions to provide deep insights into system health and performance
  • Automation & Toil Reduction: Identify and automate repetitive operational tasks to reduce toil and improve operational efficiency. Develop tools and scripts to streamline deployments, scaling, and system management
  • Capacity Planning & Performance Tuning: Work proactively on capacity planning to ensure our infrastructure can gracefully handle growth and peak loads. Optimize system performance and resource utilization
  • Reliability Best Practices: Collaborate with software engineers to embed reliability principles (e.g., SLOs, SLIs, error budgets) into the development lifecycle, promoting a culture of operational excellence
  • On-call Rotation: Participate in a periodic on-call rotation to support our production environment and respond to critical alerts
  • Fulltime
Read More
Arrow Right

Software Engineer (DevOps profile)

We are looking for experienced DevOps engineers to join our Data Center team and...
Location
Location
Poland , Gdańsk
Salary
Salary:
Not provided
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience with Python web applications (2+ years) and practical knowledge of Python
  • Design, delivery and operation of large-scale, AWS cloud-native infrastructure solutions (3+ years)
  • Automation and Infrastructure-as-code projects and tooling (Ansible, Terraform, 2+ years)
  • Experience with Kubernetes and Helm (2+ years)
  • Solid expertise in Git
  • Incident response and management in on-call rotation
  • Hands-on experience resolving production problems
  • Design CI/CD pipelines (for example, Bamboo, Jenkins)
  • Excellent scripting skills (Python, Bash)
  • Basic Linux administration
Job Responsibility
Job Responsibility
  • Build software to enhance the availability, performance and stability of the 'Instant Environments' cloud computing service, as well as automating away repetitive work
  • Develop new ideas to improve the efficiency of a Data Center engineer
  • You'll work on non-production and production environments, monitoring, data collection and configuration management, as well as disaster recovery planning, capacity engineering, reliability improvement projects and platform automation
What we offer
What we offer
  • health coverage
  • paid volunteer days
  • wellness resources
  • Fulltime
Read More
Arrow Right

Senior Software Engineer II

Axon’s Real Time Operations (RTO) division builds situational awareness software...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
axon.com Logo
Axon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years building and operating distributed systems at scale
  • proven track record of owning critical paths and SLOs
  • Deep expertise in control planes, service discovery, orchestration, partitioning/placement, and consistency models (CAP tradeoffs, CRDTs, leader/follower, quorum)
  • Strong coding in Go (also acceptable: Java/Rust)
  • design-first mindset, profiling and performance tuning (allocations, tail latency, lock contention)
  • Cloud-native foundations: Kubernetes, containers, service mesh (Istio/Envoy), gRPC/HTTP/2, backpressure and circuit-breaking patterns
  • Streaming/eventing: Kafka/NATS/Pub-Sub, schema evolution (Protobuf/Avro), idempotency keys, and exactly-once vs at-least-once tradeoffs
  • Security: mTLS, OAuth/OIDC, JWT, x.509, HSM/KMS, structured threat modeling and mitigation
Job Responsibility
Job Responsibility
  • Own control-plane architecture for multi-tenant, planet-scale IoT fleets: device provisioning and lifecycle, device identity & PKI, configuration/state management (twin/shadow), command & control, policy/RBAC enforcement, OTA updates and rollout strategies, and authoritative device state
  • Drive reliability, safety, and security-by-design: zero-trust defaults, mutual TLS, certificate rotation at scale, least-privilege key management (HSM/KMS), robust secrets hygiene, threat modeling, and defense-in-depth for multi-tenancy
  • Lead cross-org technical strategy: set engineering standards (APIs, versioning, deprecation, rollout, testing), create long-range roadmaps, and mentor/level-up senior engineers across cloud and device teams
  • Partner with device teams on transport and protocol choices, schema and API contracts, edge–cloud sync models, staged rollouts, failure injection, and field-safe rollback
  • Establish end-to-end observability (metrics, tracing, structured/audit logs), actionable dashboards, incident response runbooks, and capacity planning with empirical load testing and cost guardrails
What we offer
What we offer
  • Competitive Base Salary
  • Annual Bonus and Restricted Stock Unit Eligibility
  • Comprehensive Pension Plan with Matching Contribution
  • 30 days paid holiday in addition to UK public holidays
  • Enhanced Maternity and Paternity Leave for all employees
  • Private Health Insurance
  • Cash Plan including Dental, Optician and Therapeutic Treatment Plans
  • GymPass Subscription
  • Life assurance (x4 Annual Salary)
  • Group income Protection
  • Fulltime
Read More
Arrow Right