CrawlJobs Logo

Software Engineer 2 - Capacity Optimization

https://www.microsoft.com/ Logo

Microsoft Corporation

Location Icon

Location:
Serbia , Belgrade

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Azure is Microsoft’s central cloud infrastructure that supports public cloud services and many Microsoft-internal cloud scale systems. Cloud computing is a competitive and rapidly expanding industry, and Azure aims to lead across all key areas of its platform and services. Within Azure, the Azure Compute team provides core infrastructure capabilities for hosting virtual machines, containers, and other workloads. A foundational discipline in cloud computing is capacity management. Effective capacity management ensures that all regions, allocation domains, and hardware platforms have the resources needed to meet customer demand, while also preventing unnecessary spending and reducing cost of goods sold (COGS) and capital expenditures (CAPEX). At Azure’s scale, balancing these priorities across the entire Azure Compute fleet is highly complex, and improvements can prevent allocation issues while enabling significant cost savings. The Azure Compute Capacity and Efficiency team, also known as AC2E, is responsible for end-to-end capacity and efficiency management across the fleet. The team builds a fully automated, optimized tracking and management system, with the Capacity Management Automation System (CMAS) as a core component. These systems use advanced algorithms and apply artificial intelligence to predict capacity risks and trigger appropriate mitigation actions within the Azure Compute platform. Team members work across engineering, program management, and data science to define business problems, design solutions, and contribute to strategic decisions that influence Azure Compute’s capacity and efficiency.

Job Responsibility:

  • Design new tools and processes to enable better data modeling, analysis, and experimentation for capacity across Azure
  • Understand platform capacity constraints and work with teams across Azure to improve capacity manageability and efficiency
  • Build models, simulations, scalable and automated analytical systems and data mining frameworks to derive profound insights into the Azure Compute platform and its efficiency and capacity
  • Drive improvements to the product design and architecture, leading to increased customer satisfaction
  • Lead and collaborate with experts from across the company to advance capacity management, capacity planning, and efficiency
  • Contribute to the team culture and apply best practices in your day to day work

Requirements:

  • BS in Computer Science or equivalent
  • 2+ years of software development hands-on industry experience working on cloud infrastructure-related problems, with impact on critical product and business decisions
  • Azure Cloud Services development experience, or related
  • Programming skills (esp. related to data technologies like Python, PERL, Java, C#, etc.)
  • Proficiency with relational databases (Kusto, SQL or similar)
  • Good understanding of a modern state-of-the-art cloud platform, and related technologies
  • A proven track record of collaborating across organizational boundaries and delivering great results
  • Comfortable to work across the boundary between data science and software engineering

Nice to have:

  • Master's Degree in Computer Science or related field
  • 1+ years software development experience or equivalent experience
  • Experience with Globally Distributed cloud systems with focus on quality and scalability
  • Experience with working across data science and software development boundary

Additional Information:

Job Posted:
January 05, 2026

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Software Engineer 2 - Capacity Optimization

Site Reliability Engineer 2

Join us. At PagerDuty, you'll tackle complex problems, collaborate with kind and...
Location
Location
Portugal , Lisbon
Salary
Salary:
Not provided
https://www.pagerduty.com Logo
PagerDuty
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of experience in Site Reliability Engineering, DevOps, or Platform Engineering roles
  • Experience with Kubernetes and container orchestration
  • Experience working on cloud-native infrastructure (e.g. AWS, GCP, Azure)
  • Proficiency in at least one programming language (e.g. Python, Ruby, Go, etc.)
  • Experience with Infrastructure as Code, (e.g. Terraform, Cloudformation)
Job Responsibility
Job Responsibility
  • Deploy, configure, monitor and optimize highly available Kubernetes clusters on AWS/EKS
  • Help maintain the overall health of the platform, including triaging and troubleshooting production issues, monitoring system capacity, and working with other technical teams to ensure adherence to compliance and security best practices
  • Continuously strive to improve the internal developer experience and the software development lifecycle
  • Stay current on technical trends to suggest innovative tools and approaches to interesting problems
  • Participate in a 24/7 on-call rotation
What we offer
What we offer
  • Competitive salary
  • Comprehensive benefits package from day one
  • Flexible work arrangements
  • Company equity
  • ESPP (Employee Stock Purchase Program)
  • Retirement or pension plan
  • Generous paid vacation time
  • Paid holidays and sick leave
  • Dutonian Wellness Days & HibernationDuty - companywide paid days off in addition to PTO
  • Paid parental leave: 22 weeks for pregnant parent, 12 weeks for non-pregnant parent
  • Fulltime
Read More
Arrow Right

Site Reliability Engineer

Corporate Tools is looking for a Site Reliability Engineer. You will be a tradit...
Location
Location
United States
Salary
Salary:
175000.00 USD / Year
corporatetools.com Logo
Corporate Tools
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Software Engineering, or equivalent practical experience
  • 5+ years of experience in software engineering
  • 2+ years of experience in site reliability engineering, DevOps, or infrastructure engineering roles
  • Deep experience with cloud platforms (AWS, Azure, or GCP) and infrastructure as code tools such as Terraform, CloudFormation, or Pulumi
  • Strong proficiency with Kubernetes, Docker, and container orchestration in production environments
  • Hands-on experience with observability and monitoring tools like Prometheus, Grafana, OpenTelemetry, Sentry, or New Relic
  • Proven ability to design and implement highly available, fault-tolerant systems and lead proactive incident response efforts
  • Experience with performance tuning, database optimization, and caching strategies (e.g., PostgreSQL, Redis, Memcached)
  • Demonstrated ability to drive reliability improvements, reduce operational toil, and foster a culture of resilience and continuous improvement
  • Experience leading reliability-focused initiatives such as post-incident reviews, capacity planning, and root cause analysis
Job Responsibility
Job Responsibility
  • Stop problems before they start
  • Fix issues quickly and learn from them
  • Help keep systems steady, secure, and running
  • Work closely with DevOps engineers to build out tools and automation
  • Take ownership
What we offer
What we offer
  • 100% employer-paid medical, dental and vision for employees
  • Annual review with raise option
  • 22 days Paid Time Off accrued annually, and 4 holidays
  • After 3 years, PTO increases to 29 days
  • Employees transition to flexible time off after 5 years with the company—not accrued, not capped, take time off when you want
  • Paid Parental Leave
  • Up to 6% company matching 401(k) with no vesting period
  • Quarterly allowance
  • Open concept office with friendly coworkers
  • Creative environment where you can make a difference
  • Fulltime
Read More
Arrow Right

Senior Software Engineer (AI Applications)

Cambium Assessment Inc is seeking a Senior Software Engineer to develop AI appli...
Location
Location
United States of America
Salary
Salary:
120000.00 - 180000.00 USD / Year
edtechjobs.io Logo
EdTech Jobs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years in software engineering, including 2+ years in a Senior/Staff-level capacity leading significant technical initiatives
  • Proven, hands-on experience building complex agentic systems, including proficiency with tool calling, advanced prompt optimization, and comprehensive context engineering
  • Deep, practical experience integrating with LLMs, coupled with strong skills in prompt engineering and Retrieval-Augmented Generation (RAG) pipelines
  • Strong expertise in at least two of the following programming languages: Python, JavaScript/TypeScript, or C#
  • Experience deploying and managing systems on at least one of the major cloud platforms (AWS, GCP, or Azure), utilizing containerization and implementing robust observability tooling
  • Proven experience building and deploying robust, high-throughput full stack solutions including backend services/APIs that integrate with AI models and orchestrate complex workflows
  • Familiarity with vector databases and embedding models
  • Demonstrated experience designing secure systems capable of safely handling untrusted code execution or complex user-generated content
  • Excellent communication, collaboration skills, and the ability to autonomously drive technical initiatives from initial prototype through to high-scale production
Job Responsibility
Job Responsibility
  • Design, build, and deploy robust Generative AI agents featuring advanced capabilities like persistent memory, shared state management, and complex multi-step reasoning workflows specifically tailored for real-world educational applications
  • Drive the development of cutting-edge, AI-enabled web based educational software systems that redefine user interaction and learning outcomes
  • Seamlessly incorporate Generative AI features into existing platforms, focusing on optimizing performance, ensuring a seamless UX, and maintaining strict safety compliance
  • Expertly leverage Large Language Models (LLMs) and multimodal models to deliver highly intelligent, context-aware user experiences
  • Establish and maintain frameworks for comprehensive agent evaluation, including rigorous testing, critical safety assessments, and detailed performance monitoring
  • Partner closely with other teams to translate complex user needs and learning workflows into scalable, production-ready agentic capabilities
  • Act as a subject matter expert, guiding engineers on advanced agent design patterns, optimal LLM integration strategies, and best practices for building secure, autonomous systems
  • Contribute to a mission-driven environment focused on innovation, measurable impact, and the ethical, responsible development and deployment of AI enabled products
What we offer
What we offer
  • Remote First Work Environment
  • Reimbursement to help cover the cost of setting up your home or remote office
  • Fulltime
Read More
Arrow Right

Supply Chain Capacity Engineer

Meta is seeking a Planning & Capacity Engineer to join the Infrastructure Supply...
Location
Location
United States , Fremont
Salary
Salary:
117000.00 - 181000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Currently has, or is in the process of obtaining a Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience. Degree must be completed prior to joining Meta
  • 4+ years of experience in performance or software engineering and/or optimization pertinent data science
  • 4+ years of experience in designing and implementing models and optimization algorithms
  • 2+ years of experience in coding/scripting languages such as Python, R, Java, C, C++, PHP
  • 2+ years of experience in supply chain management - planning, manufacturing, operations, inventory, etc
  • Experience working with distributed systems at scale
  • Experience in supply chain planning optimization
  • Experience in infrastructure operations and technical infrastructure knowledge
  • Experience working with cross-functional teams
  • Experience managing ambiguity
Job Responsibility
Job Responsibility
  • Own infrastructure supply chain planning for Meta (as part of Meta’s overall capacity plan): including servers, network, and data center equipment from components to finished goods
  • Design improvements to software systems to improve supply chain planning efficiency and quality, partner with software engineers to implement and contribute code yourself
  • Develop and analyze business and technical data and scenarios to drive executive decisions around infrastructure supply and spend
  • Contribute to end to end supply chain planning processes (as part of Meta’s overall capacity plan), methodologies, and data to deliver an executable and optimized supply plan
  • Manage and resolve critical escalations and exceptions in all areas of the supply chain
  • Critique demand signals curated by upstream teams, and build an anticipatory demand view for ISCE through mathematical modeling and business judgement
  • Influence and interweave technology strategy and transitions into planning models
  • Develop and code models that drive supply flexibility and optionality to best balance delivery of required capacity, cost, and risk
  • Build mathematical models to perform simulation and optimization studies of demand projections, scenario planning, and gap closure analysis while balancing constraints, product mixes and inventory costs
  • Work cross-functionally to define problem statements, collect data, build analytical models and make recommendations to drive change and optimization at the most strategic levels
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right

Engineering Leader

We’re seeking an experienced Engineering Leader to join a growing Data Strategy,...
Location
Location
Salary
Salary:
Not provided
capstonec.com Logo
Capstone IT Staffing
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in software or data engineering
  • 2+ years in a technical leadership or team-lead capacity
  • Strong hands-on understanding of modern data platform technologies, including Snowflake, SQL, Python, and ETL frameworks (dbt, Airflow, or similar)
  • Experience in building or maintaining BI/analytics solutions (Power BI, Tableau, Looker, or similar)
  • Proven ability to manage multiple concurrent initiatives, balancing strategic oversight with tactical execution
  • Familiarity with data architecture, data pipelines, and data governance principles in an enterprise context
  • Excellent communication and stakeholder management skills
  • Demonstrated ability to lead through influence and foster a high-performing, collaborative team culture in a remote setting
Job Responsibility
Job Responsibility
  • Lead and mentor a small cross-functional engineering team (data engineers, analysts, BI developers) delivering multiple concurrent projects that serve enterprise data and analytics needs
  • Drive end-to-end project delivery — from problem definition and design through execution, testing, and delivery — ensuring high standards of quality, performance, and scalability
  • Engage in deep technical and design discussions, guiding solution approaches and removing blockers across data ingestion, transformation, and visualization layers
  • Oversee and contribute to projects such as: Building and optimizing data pipelines and ingestion frameworks (including integrations with Snowflake)
  • Developing and maintaining BI dashboards and analytics solutions for business and executive stakeholders
  • Sunsetting and modernizing legacy data lakes, ETL pipelines, and reporting systems
  • Supporting ad-hoc data engineering and analytics initiatives from senior leadership or the Data Science organization
  • Collaborate with senior managers and data strategy leaders to align team objectives with overall enterprise data goals
  • Establish and reinforce engineering best practices in code quality, CI/CD, data modeling, documentation, and operational excellence
  • Act as a bridge between technical contributors and business stakeholders — translating complex requirements into scalable, maintainable solutions
Read More
Arrow Right
New

Senior Software Engineer, Manager – AWS Developer

We’re looking for a Senior Software Engineer, Manager (AWS Developer) to lead an...
Location
Location
United States , San Diego
Salary
Salary:
Not provided
resmed.com Logo
ResMed
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of professional software development experience
  • Substantial work on AWS-based production systems
  • At least 2+ years in a tech lead and/or engineering management capacity
  • Strong proficiency in Python with deep understanding of object-oriented design, clean code principles, and design patterns
  • Expertise with AWS services and cloud-native architectures (e.g., Lambda, API Gateway, DynamoDB, S3, SQS/SNS, EventBridge, CloudWatch, CloudFront, RDS/Aurora, IAM)
  • Solid experience with infrastructure-as-code (e.g., Terraform, CloudFormation, CDK) and managing multi-environment deployments at scale
  • Strong grasp of RESTful API design, authentication/authorization mechanisms (OAuth2, JWT), and microservices / event-driven architectures
  • Practical experience designing and optimizing data models for both NoSQL (e.g., DynamoDB, MongoDB) and relational databases (e.g., PostgreSQL, MySQL)
  • Proven track record implementing and improving DevOps practices: CI/CD (e.g., GitHub Actions, CodePipeline), Git workflows, Docker, and observability (CloudWatch, Datadog or similar)
  • Deep understanding of testing strategies (unit, integration, contract, and end-to-end) and how to embed them into pipelines and team workflows
Job Responsibility
Job Responsibility
  • Lead, mentor, and grow a team of software engineers, providing regular feedback, coaching, and career development
  • Act as a Senior Software Engineer on the team: contribute to architecture, write and review code
  • Own the technical direction and architecture for key services and features on AWS
  • Partner with product, design, and other stakeholders to define roadmaps, break down complex problems, and deliver high-impact solutions
  • Oversee the design, development, testing, and operation of cloud-native systems
  • Establish and enforce high standards for code quality, testing, observability, and documentation
  • Guide and improve CI/CD pipelines, deployment strategies, and operational practices
  • Collaborate with other engineering leaders to shape platform-wide architecture, shared services, and common patterns for AWS usage
  • Drive effective incident management and post-incident reviews
  • Contribute to hiring and onboarding by participating in interviews, defining role expectations
  • Fulltime
Read More
Arrow Right

Principal Data Infrastructure Engineer

As Microsoft continues to push the boundaries of AI, we are on the lookout for p...
Location
Location
United States , Redmond
Salary
Salary:
139900.00 - 274800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 4+ years experience in business analytics, data science, software development, data modeling, or data engineering
  • OR Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND 6+ years experience in business analytics, data science, software development, data modeling, or data engineering
  • OR equivalent experience
  • 4+ years in Big Data Infrastructure, DevOps, SRE, or Platform Engineering
  • 3+ years of hands-on experience managing and scaling distributed systems—from bare-metal to cloud-native environments
  • 2+ years deploying containerized applications using Kubernetes and Helm/Kustomize
  • Solid scripting and automation skills using Python, Bash, or PowerShell
  • Proven success in CI/CD pipeline management, release automation, and production troubleshooting
  • Experience working with Databricks for scalable data processing and analytics
  • Familiarity with security practices in infrastructure environments, including IAM, OAuth, and Kerberos administration
Job Responsibility
Job Responsibility
  • Architect and maintain scalable, reliable, and observable Big Data Infrastructure for mission-critical AI applications
  • Champion DevOps and SRE best practices—automated deployments, service monitoring, and incident response
  • Build a self-service big data platform that empowers data and platform engineers and researchers
  • Develop robust CI/CD pipelines and automate infrastructure provisioning using Infrastructure as Code tools (Bicep, Terraform, ARM)
  • Collaborate with Data Engineers, Data Scientists, AI Researchers, and Developers to deliver secure, seamless big data workflows
  • Lead technical design reviews and uphold a clean, secure, and well-documented codebase
  • Proactively identify and resolve bottlenecks in data pipelines and infrastructure
  • Optimize system performance across storage, compute, and analytics layers
  • Partner with Security teams to enhance system security (IAM, OAuth, Kerberos)
  • Embody and promote Microsoft’s values: Respect, Integrity, Accountability, and Inclusion
  • Fulltime
Read More
Arrow Right

Network Modeling and Optimization Engineer

Meta's global network comprised of cutting-edge platforms, is looking for a Netw...
Location
Location
United States , Menlo Park
Salary
Salary:
154000.00 - 217000.00 USD / Year
meta.com Logo
Meta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Experience using concepts of operations research, stochastic optimization, machine learning, queuing theory, probability theory to construct models for solving network optimization problems
  • Experience creating formulation using commercial mathematical optimization software like: Xpress, Gurobi, CPLEX, and other similar optimization tools
  • 2+ years of experience coding in higher-level languages (e.g., Python, C++, Go, etc.) coupled with experience creating models for optimization
Job Responsibility
Job Responsibility
  • Work with various teams to understand Meta's network, user base, performance constraints, and growth requirements
  • Create modeling framework for various networking problems such as cross-layer optimization under constraints such as latency/availability, demand uncertainty, risk assessment, and data center optimization
  • Data analysis from a large number of data sources to create a network strategy for capacities, location and facilities
  • Work with procurement and other teams to devise strategies on hardware and network acquisitions around the globe
  • Own the design, development, testing, and tuning of future capacity and topology models
What we offer
What we offer
  • bonus
  • equity
  • benefits
Read More
Arrow Right