CrawlJobs Logo

Software Engineer 2 - Capacity Optimization

https://www.microsoft.com/ Logo

Microsoft Corporation

Location Icon

Location:
Serbia , Belgrade

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Azure is Microsoft’s central cloud infrastructure that supports public cloud services and many Microsoft-internal cloud scale systems. Cloud computing is a competitive and rapidly expanding industry, and Azure aims to lead across all key areas of its platform and services. Within Azure, the Azure Compute team provides core infrastructure capabilities for hosting virtual machines, containers, and other workloads. A foundational discipline in cloud computing is capacity management. Effective capacity management ensures that all regions, allocation domains, and hardware platforms have the resources needed to meet customer demand, while also preventing unnecessary spending and reducing cost of goods sold (COGS) and capital expenditures (CAPEX). At Azure’s scale, balancing these priorities across the entire Azure Compute fleet is highly complex, and improvements can prevent allocation issues while enabling significant cost savings. The Azure Compute Capacity and Efficiency team, also known as AC2E, is responsible for end-to-end capacity and efficiency management across the fleet. The team builds a fully automated, optimized tracking and management system, with the Capacity Management Automation System (CMAS) as a core component. These systems use advanced algorithms and apply artificial intelligence to predict capacity risks and trigger appropriate mitigation actions within the Azure Compute platform. Team members work across engineering, program management, and data science to define business problems, design solutions, and contribute to strategic decisions that influence Azure Compute’s capacity and efficiency.

Job Responsibility:

  • Design new tools and processes to enable better data modeling, analysis, and experimentation for capacity across Azure
  • Understand platform capacity constraints and work with teams across Azure to improve capacity manageability and efficiency
  • Build models, simulations, scalable and automated analytical systems and data mining frameworks to derive profound insights into the Azure Compute platform and its efficiency and capacity
  • Drive improvements to the product design and architecture, leading to increased customer satisfaction
  • Lead and collaborate with experts from across the company to advance capacity management, capacity planning, and efficiency
  • Contribute to the team culture and apply best practices in your day to day work

Requirements:

  • BS in Computer Science or equivalent
  • 2+ years of software development hands-on industry experience working on cloud infrastructure-related problems, with impact on critical product and business decisions
  • Azure Cloud Services development experience, or related
  • Programming skills (esp. related to data technologies like Python, PERL, Java, C#, etc.)
  • Proficiency with relational databases (Kusto, SQL or similar)
  • Good understanding of a modern state-of-the-art cloud platform, and related technologies
  • A proven track record of collaborating across organizational boundaries and delivering great results
  • Comfortable to work across the boundary between data science and software engineering

Nice to have:

  • Master's Degree in Computer Science or related field
  • 1+ years software development experience or equivalent experience
  • Experience with Globally Distributed cloud systems with focus on quality and scalability
  • Experience with working across data science and software development boundary

Additional Information:

Job Posted:
January 05, 2026

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Software Engineer 2 - Capacity Optimization

Senior Software Engineer - Developer Experience and Automation

Senior Software Engineer for Developer Experience Tooling and Automation who wil...
Location
Location
United States
Salary
Salary:
83430.00 - 203940.00 USD / Year
https://www.cvshealth.com/ Logo
CVS Health
Expiration Date
January 30, 2026
Flip Icon
Requirements
Requirements
  • 5+ years of overall experience in Python
  • Experience in setting up and optimizing efficient data stores (RDBMS/NoSQL) for production
  • 3+ years of overall backend development experience on enterprise-class applications
  • 3+ years partnering with architecture, product, and program management teams to influence product development decisions
  • 3+ years of experience working on projects using mature CI/CD practices, source control such as Git, and automated testing
  • 2+ years of experience working with large public cloud technologies (e.g., GCP, AWS, Azure)
  • Experience with Prompt engineering: Ability to build and craft prompts that evoke desired responses from LLMs
  • Experience in team lead / technical lead capacity that follows a Scrum/Agile development methodology
  • Bachelor's degree or equivalent experience (HS diploma + 4 years relevant experience)
Job Responsibility
Job Responsibility
  • Build shared internal libraries, tools, and processes that enable teams across CVS Health to efficiently build, test, preview, deploy, and operate systems
  • Collaborate with various teams across CVS to influence the technical direction of front-end web development
  • Build APIs, CLI tools, out-of-the-box automation tools using CVS Health approved tools, LLMs and Machine Learning algorithms
  • Build, optimize, fine-tune Generative AI/LLM models to transform experience into solutions and deploy them
  • Work closely with data scientists, ML engineers, software developers, and business stakeholders to translate AI research into practical, deployable solutions
  • Lead the prototyping and experimentation with new generative models, optimizing them for specific use cases
  • Act as a technical leader across all parts of the CVS Health Infrastructure engineering team
  • Develop clear, concise, and clean code in any language (mostly in Python)
  • Collaborate with architecture and engineering teams to standardize how we can enhance the experience
  • Stay aligned with the latest developments in cloud-native and ML ops/engineering
What we offer
What we offer
  • Affordable medical plan options
  • 401(k) plan with matching company contributions
  • Employee stock purchase plan
  • No-cost wellness screenings
  • Tobacco cessation and weight management programs
  • Confidential counseling and financial coaching
  • Paid time off
  • Flexible work schedules
  • Family leave
  • Dependent care resources
  • Fulltime
Read More
Arrow Right

Site Reliability Engineer 2

Join us. At PagerDuty, you'll tackle complex problems, collaborate with kind and...
Location
Location
Portugal , Lisbon
Salary
Salary:
Not provided
https://www.pagerduty.com Logo
PagerDuty
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of experience in Site Reliability Engineering, DevOps, or Platform Engineering roles
  • Experience with Kubernetes and container orchestration
  • Experience working on cloud-native infrastructure (e.g. AWS, GCP, Azure)
  • Proficiency in at least one programming language (e.g. Python, Ruby, Go, etc.)
  • Experience with Infrastructure as Code, (e.g. Terraform, Cloudformation)
Job Responsibility
Job Responsibility
  • Deploy, configure, monitor and optimize highly available Kubernetes clusters on AWS/EKS
  • Help maintain the overall health of the platform, including triaging and troubleshooting production issues, monitoring system capacity, and working with other technical teams to ensure adherence to compliance and security best practices
  • Continuously strive to improve the internal developer experience and the software development lifecycle
  • Stay current on technical trends to suggest innovative tools and approaches to interesting problems
  • Participate in a 24/7 on-call rotation
What we offer
What we offer
  • Competitive salary
  • Comprehensive benefits package from day one
  • Flexible work arrangements
  • Company equity
  • ESPP (Employee Stock Purchase Program)
  • Retirement or pension plan
  • Generous paid vacation time
  • Paid holidays and sick leave
  • Dutonian Wellness Days & HibernationDuty - companywide paid days off in addition to PTO
  • Paid parental leave: 22 weeks for pregnant parent, 12 weeks for non-pregnant parent
  • Fulltime
Read More
Arrow Right

Site Reliability Engineer

Corporate Tools is looking for a Site Reliability Engineer. You will be a tradit...
Location
Location
United States
Salary
Salary:
175000.00 USD / Year
corporatetools.com Logo
Corporate Tools
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Software Engineering, or equivalent practical experience
  • 5+ years of experience in software engineering
  • 2+ years of experience in site reliability engineering, DevOps, or infrastructure engineering roles
  • Deep experience with cloud platforms (AWS, Azure, or GCP) and infrastructure as code tools such as Terraform, CloudFormation, or Pulumi
  • Strong proficiency with Kubernetes, Docker, and container orchestration in production environments
  • Hands-on experience with observability and monitoring tools like Prometheus, Grafana, OpenTelemetry, Sentry, or New Relic
  • Proven ability to design and implement highly available, fault-tolerant systems and lead proactive incident response efforts
  • Experience with performance tuning, database optimization, and caching strategies (e.g., PostgreSQL, Redis, Memcached)
  • Demonstrated ability to drive reliability improvements, reduce operational toil, and foster a culture of resilience and continuous improvement
  • Experience leading reliability-focused initiatives such as post-incident reviews, capacity planning, and root cause analysis
Job Responsibility
Job Responsibility
  • Stop problems before they start
  • Fix issues quickly and learn from them
  • Help keep systems steady, secure, and running
  • Work closely with DevOps engineers to build out tools and automation
  • Take ownership
What we offer
What we offer
  • 100% employer-paid medical, dental and vision for employees
  • Annual review with raise option
  • 22 days Paid Time Off accrued annually, and 4 holidays
  • After 3 years, PTO increases to 29 days
  • Employees transition to flexible time off after 5 years with the company—not accrued, not capped, take time off when you want
  • Paid Parental Leave
  • Up to 6% company matching 401(k) with no vesting period
  • Quarterly allowance
  • Open concept office with friendly coworkers
  • Creative environment where you can make a difference
  • Fulltime
Read More
Arrow Right
New

Senior Software Engineer (AI Applications)

Cambium Assessment Inc is seeking a Senior Software Engineer to develop AI appli...
Location
Location
United States of America
Salary
Salary:
120000.00 - 180000.00 USD / Year
edtechjobs.io Logo
EdTech Jobs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years in software engineering, including 2+ years in a Senior/Staff-level capacity leading significant technical initiatives
  • Proven, hands-on experience building complex agentic systems, including proficiency with tool calling, advanced prompt optimization, and comprehensive context engineering
  • Deep, practical experience integrating with LLMs, coupled with strong skills in prompt engineering and Retrieval-Augmented Generation (RAG) pipelines
  • Strong expertise in at least two of the following programming languages: Python, JavaScript/TypeScript, or C#
  • Experience deploying and managing systems on at least one of the major cloud platforms (AWS, GCP, or Azure), utilizing containerization and implementing robust observability tooling
  • Proven experience building and deploying robust, high-throughput full stack solutions including backend services/APIs that integrate with AI models and orchestrate complex workflows
  • Familiarity with vector databases and embedding models
  • Demonstrated experience designing secure systems capable of safely handling untrusted code execution or complex user-generated content
  • Excellent communication, collaboration skills, and the ability to autonomously drive technical initiatives from initial prototype through to high-scale production
Job Responsibility
Job Responsibility
  • Design, build, and deploy robust Generative AI agents featuring advanced capabilities like persistent memory, shared state management, and complex multi-step reasoning workflows specifically tailored for real-world educational applications
  • Drive the development of cutting-edge, AI-enabled web based educational software systems that redefine user interaction and learning outcomes
  • Seamlessly incorporate Generative AI features into existing platforms, focusing on optimizing performance, ensuring a seamless UX, and maintaining strict safety compliance
  • Expertly leverage Large Language Models (LLMs) and multimodal models to deliver highly intelligent, context-aware user experiences
  • Establish and maintain frameworks for comprehensive agent evaluation, including rigorous testing, critical safety assessments, and detailed performance monitoring
  • Partner closely with other teams to translate complex user needs and learning workflows into scalable, production-ready agentic capabilities
  • Act as a subject matter expert, guiding engineers on advanced agent design patterns, optimal LLM integration strategies, and best practices for building secure, autonomous systems
  • Contribute to a mission-driven environment focused on innovation, measurable impact, and the ethical, responsible development and deployment of AI enabled products
What we offer
What we offer
  • Remote First Work Environment
  • Reimbursement to help cover the cost of setting up your home or remote office
  • Fulltime
Read More
Arrow Right

Sr. Manager- Data Engineering

Adtalem is a data driven organization. The Data Engineering team builds data sol...
Location
Location
United States , Lisle
Salary
Salary:
96404.10 - 169021.85 USD / Year
adtalem.com Logo
Adtalem Global Education
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree Computer Science, Computer Engineering, Software Engineering, or other related technical field
  • Master's Degree Computer Science, Computer Engineering, Software Engineering, or other related technical field
  • 8+ years experience in data engineering solutions such as data platforms, data ingestion, data management, or publication/analytics
  • 3+ years leadership experience building and managing a high performing teams
  • 2+ years experience in Google Cloud with services like BigQuery, Composer, GCS, DataStream, Dataflows
  • Progressively responsible experience starting as Data Engineer and advancement in complexity, and level of responsibility
  • Must be highly analytical having the proven ability to develop and reverse complex engineering solutions
  • Critical thinking and advanced problem-solving skills are core behaviors among the team
  • Excellent oral/written communication and presentation skills
  • Experience in objectively evaluating current processes for opportunities to optimize inter departmental communication, collaboration, and end-to-end process performance
Job Responsibility
Job Responsibility
  • Design and build trusted, reliable and timely datasets, metrics and data pipelines that are critical to the direction of the company
  • Build and lead a high-performing data engineering team in a hands-on technical capacity
  • Be responsible for shaping how we acquire, collect and leverage data
  • Define and manage SLA's for all data sets and processes running in production
  • Work closely with Product Managers, Analysts, Data Scientists to develop and own data-driven systems
  • Develop data democratization layer for self-served reporting
  • Assist development team in troubleshooting, coding, testing, implementation, and documenting solutions
  • Support, grow, mentor and inspire new and existing team members
  • Be a key leader of the Data and Analytics team, working to propel the business towards being more data-driven
  • Performs other duties as assigned
What we offer
What we offer
  • Health, dental, vision, life and disability insurance
  • 401k Retirement Program + 6% employer match
  • Participation in Adtalem’s Flexible Time Off (FTO) Policy
  • 12 Paid Holidays
  • Eligible to participate in an annual incentive program
  • Fulltime
Read More
Arrow Right

Engineering Leader

We’re seeking an experienced Engineering Leader to join a growing Data Strategy,...
Location
Location
Salary
Salary:
Not provided
capstonec.com Logo
Capstone IT Staffing
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in software or data engineering
  • 2+ years in a technical leadership or team-lead capacity
  • Strong hands-on understanding of modern data platform technologies, including Snowflake, SQL, Python, and ETL frameworks (dbt, Airflow, or similar)
  • Experience in building or maintaining BI/analytics solutions (Power BI, Tableau, Looker, or similar)
  • Proven ability to manage multiple concurrent initiatives, balancing strategic oversight with tactical execution
  • Familiarity with data architecture, data pipelines, and data governance principles in an enterprise context
  • Excellent communication and stakeholder management skills
  • Demonstrated ability to lead through influence and foster a high-performing, collaborative team culture in a remote setting
Job Responsibility
Job Responsibility
  • Lead and mentor a small cross-functional engineering team (data engineers, analysts, BI developers) delivering multiple concurrent projects that serve enterprise data and analytics needs
  • Drive end-to-end project delivery — from problem definition and design through execution, testing, and delivery — ensuring high standards of quality, performance, and scalability
  • Engage in deep technical and design discussions, guiding solution approaches and removing blockers across data ingestion, transformation, and visualization layers
  • Oversee and contribute to projects such as: Building and optimizing data pipelines and ingestion frameworks (including integrations with Snowflake)
  • Developing and maintaining BI dashboards and analytics solutions for business and executive stakeholders
  • Sunsetting and modernizing legacy data lakes, ETL pipelines, and reporting systems
  • Supporting ad-hoc data engineering and analytics initiatives from senior leadership or the Data Science organization
  • Collaborate with senior managers and data strategy leaders to align team objectives with overall enterprise data goals
  • Establish and reinforce engineering best practices in code quality, CI/CD, data modeling, documentation, and operational excellence
  • Act as a bridge between technical contributors and business stakeholders — translating complex requirements into scalable, maintainable solutions
Read More
Arrow Right

Sr. Full Stack Software Engineer

We are seeking an experienced and ambitious Sr. Full Stack Software Engineer who...
Location
Location
Canada , Vancouver
Salary
Salary:
162950.00 - 185683.00 CAD / Year
dialpad.com Logo
Dialpad
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of professional experience in Full-Stack Software Engineering
  • 2+ years in a Senior or Lead capacity
  • Strong experience with Python, APIs, Vue/React, HTML, CSS, JavaScript, TypeScript, GraphQL, GCP, or other cloud infrastructures
  • Practical experience designing, deploying, and optimizing solutions leveraging serverless computing, microservices, and event-driven architectures
  • Proficiency with both SQL and NoSQL databases
  • Experience building reusable and modular components for both frontend and backend
  • Experience mentoring junior engineers
  • Experience with Agile development methodologies
  • Strong debugging and troubleshooting skills
  • Strong communication and collaboration skills
Job Responsibility
Job Responsibility
  • Design, develop, and deploy high-quality features across Dialpad's web and desktop-native applications
  • Write clean, modular, and maintainable code using best practices along with unit & integration tests
  • Participate in code reviews to ensure code quality, maintainability, and scalability
  • Ensure that features are shipped on time and with the highest quality
  • Take on production on-call activities to support and resolve issues arising from QA and customers
  • Participate in a rotating production on-call schedule to quickly diagnose and resolve critical issues
  • Participate in deploying new Dialpad releases
  • Collaborate with cross-functional teams to build and use common components and practices across Dialpad products
  • Mentor junior engineers and help them grow their skills and expertise
What we offer
What we offer
  • Competitive benefits and perks
  • Robust training program
  • Inclusive office environment
  • Certified Great Place to Work culture
  • Fulltime
Read More
Arrow Right
New

Machine Learning Engineering Team Lead

Lead a high-performing team focused on building large-scale distributed training...
Location
Location
Germany , Berlin
Salary
Salary:
Not provided
aignostics.com Logo
Aignostics
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Engineering, Mathematics, or a related field
  • 6+ years of software engineering or ML engineering experience, with at least 2 years in a technical leadership or team lead role
  • Proven track record of building and leading high-performing engineering teams
  • Experience guiding projects across the whole Software Development Life Cycle
  • Deep understanding of fundamental Machine Learning concepts and principles, familiarity with advanced model optimization techniques
  • Significant experience with large-scale distributed training systems and frameworks (especially PyTorch and NCCL)
  • Familiarity with GPUs, distributed systems, parallel computing and scaling laws
  • Advanced programming skills in Python, experience in performance-critical languages (C/C++ or CUDA) being a plus
  • Familiarity of MLOps/DevOps best practices including CI/CD, Docker, Kubernetes, and observability, cloud platforms (GCP, AWS or Azure) and infrastructure-as-code
  • Experience with Linux, version control, and container technologies
Job Responsibility
Job Responsibility
  • Build and scale a high-performing team capable of tackling complex distributed ML challenges
  • Own the full employee lifecycle: recruiting, onboarding, performance management, career development, and retention
  • Empower your team members and help them grow in autonomy and technical expertise
  • Mentor engineers at all levels, fostering a culture of continuous learning and psychological safety
  • Create an inclusive environment where diverse perspectives drive innovation
  • Define and execute technical roadmaps aligned with company objectives and product needs
  • Lead resource allocation and capacity planning to balance team workload and business priorities
  • Own FinOps responsibilities: optimize cloud costs, track spending, and ensure efficient resource utilization
  • Ensure operational readiness through monitoring, incident response protocols, and system reliability practices
  • Establish and track KPIs for team performance, system efficiency and health
What we offer
What we offer
  • Learning & Development yearly budget of 1,000€ (plus 2 L&D days)
  • Language classes, and internal development programs
  • Access to leadership development programs and executive coaching
  • Flexible working hours and teleworking policy
  • 30 paid vacation days per year
  • Family & pet friendly and support flexible parental leave options
  • Subsidized membership of your choice among public transport, sports, and well-being
  • Social gatherings, lunches, and off-site events for a fun and inclusive work environment
  • Optional company pension scheme
Read More
Arrow Right