CrawlJobs Logo

Senior Software Engineer - Datacenter Platform

Serbia, Belgrade · Job Posted February 04, 2026
Apply Position
Job Link Share

Job Description

It is a mobile-first, cloud-first world, and we are enabling it. Microsoft Azure is at the core of the Microsoft Cloud, providing the foundational infrastructure for large-scale, distributed, and dynamic computing. Our team within Azure delivers the software platform that powers internal Microsoft services such as Office 365, Bing.com, Xbox Live, Skype, and OneDrive, as well as external customers who rely on us to run mission-critical cloud applications for their businesses. We are seeking a Senior Software Engineer to help evolve, expand, and define our software platform and infrastructure. Areas of focus include core infrastructure services at the lowest levels of the stack, achieving five nines (99.999%) reliability, fault tolerance, distributed service monitoring, operational efficiency across the data center hardware lifecycle, performance metrics collection and analysis, alerting, visualization, device operations, and coordination of node diagnostics and repairs. This is a dynamic and fast-paced environment offering a unique opportunity to work on something highly strategic to Microsoft and impactful across the industry. Few roles in computer science provide the chance to operate at this massive scale. If you are passionate about building robust, highly distributed software systems that form the backbone of the Microsoft Cloud, we would love to connect.

Job Responsibility

  • Design and develop solutions that build and improve cloud services running over distributed system.
  • Provide new features for Microsoft Cloud internal infrastructure software.
  • Keep infrastructure services running and deliver code updates on a regular cadence to improve performance and reliability.
  • Collaborate with appropriate stakeholders to determine user requirements for a scenario.
  • Acts as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor system/product/service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system/product/service for simple and complex problems when appropriate.

Requirements

  • Bachelor's Degree in Computer Science or related technical field AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

Nice to have

Master's Degree in Computer Science or related technical field AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Senior Software Engineer - Datacenter Platform

8 matching positions

Senior+ Software Engineer - Cloud Availability Platform Engineering (Observability)

We are looking for a highly skilled engineer with deep expertise in building and...
Location
Location
United States , San Francisco
Salary
Salary:
166000.00 - 201000.00 USD / Year
crusoe.ai Logo
Crusoe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in infrastructure or platform engineering, with a focus on observability and monitoring systems
  • Deep expertise with metrics systems (Prometheus, Thanos, Mimir, Cortex), logging pipelines (Fluent Bit, Vector, Loki, ELK/Opensearch), and tracing platforms (Jaeger, Tempo, OpenTelemetry)
  • Strong programming skills in Go or Python for automation, operators, and custom integrations
  • Experience running observability platforms on Kubernetes and operating them at scale across multi-datacenter environments
  • Proven ability to design, optimize, and scale telemetry pipelines handling high cardinality and high throughput data
  • Solid understanding of distributed systems, performance engineering, and debugging complex workloads
  • Strong collaboration skills and the ability to influence engineering teams to adopt observability best practices
Job Responsibility
Job Responsibility
  • Designing and operating scalable observability systems (metrics, logging, tracing) across multi-datacenter Kubernetes environments
  • Architecting end-to-end telemetry pipelines, including ingestion, storage, querying, and visualization
  • Extending monitoring and alerting with Prometheus, Alertmanager, Thanos/Cortex, Grafana, and OpenTelemetry
  • Building scalable log collection and processing pipelines with Fluent Bit, Vector, Loki, or ELK/Opensearch stacks
  • Implementing distributed tracing platforms (Tempo, Jaeger, OpenTelemetry) and integrating with service meshes, load balancers, and APIs
  • Defining and driving adoption of SLOs, SLIs, and error budgets across services and teams
  • Automating provisioning and scaling of observability infrastructure with Kubernetes, Terraform, and custom tooling (Go, Python)
  • Ensuring reliability and cost efficiency of telemetry pipelines while supporting high-volume workloads (AI/ML, HPC clusters, GPU infrastructure)
  • Embedding security best practices into observability platforms, including RBAC, TLS, secret management, and multi-tenant access controls
  • Partnering with engineering teams to embed observability into applications, services, and infrastructure
What we offer
What we offer
  • Restricted Stock Units in a fast growing, well-funded technology company
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Tuition reimbursement
  • Fulltime
Read More
Arrow Right

Network Software Test – Senior Software Engineer

About Arrcus: Arrcus was founded to enhance business efficiency through superior...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
arrcus.com Logo
Arrcus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS/MS in Computer Engineering/Computer Science or equivalent degree
  • Ability to write high quality automated test cases using Python
  • 5+ years of hands-on test experience of Networking protocols such as OSPF, BGP, ISIS, MPLS, BFD, MLAG, EVPN, VxLAN, SR-MPLS, SRv6
  • Proficient in the use of traffic generators to develop Data Path and Control Plane Test cases
  • Growing the existing automation framework to support customer user case testing scenarios and cross-feature integrations
  • Working knowledge of Test Harness like Robot framework, Jinja2 templating
  • Expertise in Scale and Performance Testing using simulation for customer networks
  • Using development infrastructure tools, such as Jenkins, Git, JIRA, etc.
  • Familiarity with Docker Containers, VMs expected
  • Knowledge of Network merchant silicon chipsets and Whitebox platforms
Job Responsibility
Job Responsibility
  • Deep understanding of Layer 2/3 protocols like BGP, BGP EVPN, ISIS, SR, MPLS,L3VPN, SRv6, and ability to validate networking functionality and performance through automation
  • Ability to understand and learn Service Provider, Datacenter, Campus/ Enterprise Customer Solutions
  • Influence development team to align with customer expectations with respect to deployment and UX needs
  • Creative problem solving and excellent Troubleshooting skills
  • Ability to handle multiple tasks and complete them on time
  • Good documentation and presentation skills
What we offer
What we offer
  • Generous compensation packages including equity
  • Medical Insurance
  • Parental Leave
  • Sabbatical leave (After 4 years of service)
  • Fulltime
Read More
Arrow Right
New

Senior Software Engineer

Microsoft is a company where passionate innovators come to collaborate, envision...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science or Engineering or Mathematics or Physics or IT technical discipline
  • 7+ years of programming experience in C, C#, C++
  • Proficiency in troubleshooting and debugging
  • 4+ years of commercial systems level software development experience
  • Ability to meet Microsoft, customer and/or government security screening requirements
  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Design, implement and maintain services and components that provide secure and resilient platform for SQL control plane and data plane services
  • Develop innovative technology for managing massive-scale operations for large customers tolerating underlying system failures, software and hardware upgrades and reconfiguration, while enabling optimal placement and utilization of Azure clusters and regions
  • Design and implement solutions for cluster expansions at a global scale, analyze telemetry and the behavior of large distributed systems to mine actionable insights
  • Ensure the highest standards of quality and reliability across all services and solutions
  • Contribute to design of service software stack, datacenter design and network topology
  • Release features on time, with high quality, meeting functional, performance, scalability, and compliance requirements
  • Research and adopt modern technology to improve quality of the service, increase customer value or reduce operating cost
  • Participate in on-call rotation for the team
  • Embody our culture and values
Read More
Arrow Right

Senior Software Engineer

The Budget Optimization Engineering team at Microsoft builds the real-time data ...
Location
Location
United States , Redmond
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C#, Java, Go, or Python OR equivalent experience.
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
  • 7+ years of technical experience in software development, service engineering, or systems engineering.
  • 5+ years of experience building and operating large-scale distributed systems, backend services, or data platforms with strict SLA requirements.
  • Apache Kafka — solid understanding of consumers, producers, offset management, partition strategies, performance tuning, and cross-datacenter replication patterns.
  • Kubernetes — production experience writing and deploying Helm charts
  • hands-on with Deployments, StatefulSets, Services, ConfigMaps, Secrets, Jobs, and HPAs
  • comfortable with multi-cluster and multi-datacenter environments.
  • Cloud infrastructure — practical experience with Azure (AKS, ACR, Azure Key Vault, Azure Application Insights, Azure Log Analytics)
  • familiarity with Azure DevOps or equivalent CI/CD platforms.
Job Responsibility
Job Responsibility
  • Design and build highly scalable backend services and data pipelines that support privacy-preserving measurement and analytics scenarios using Java, Python (and C# where applicable).
  • Maintain and improve production services across the optimization platform — including Kafka streaming pipelines, budget controllers, job orchestration (job-broker), and deal monitoring — with a focus on reliability and strict SLA adherence.
  • Drive integrations with external data and measurement partners, designing stable interfaces, schema governance patterns, and robust validation pipelines.
  • Work closely with PMs, data science, privacy, and security teams to translate measurement needs into scalable platform capabilities.
  • Contribute to the full service lifecycle: design, implementation, testing, code review, and deployment.
  • Improve reliability and observability of Kafka consumer/producer pipelines (offset management, retry strategies, delivery guarantees) across cross-datacenter replication flows.
  • Design and implement Kubernetes/Helm deployments for services currently running on legacy orchestration (Maestro, SAND instances, bare Docker), targeting Azure-native cloud infrastructure.
  • Integrate application telemetry (Prometheus/Dropwizard Metrics) with Azure Application Insights and Azure Log Analytics to support production observability and SLA monitoring.
  • Apply practical experience with Azure services — including AKS, ACR, and Azure Key Vault — to support secure, cloud-native deployments.
  • Lead initiatives to make delivery of high-quality software routine and efficient across the full SDLC, from inception and technical design through testing and production operations.
  • Fulltime
Read More
Arrow Right

Senior Software Engineer

Be part of the Datacenter Management Transformation. Cloud Technology dominates ...
Location
Location
United States , Multiple Locations
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience.
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:  Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
Job Responsibility
Job Responsibility
  • Collaborates with appropriate stakeholders to determine user requirements for a scenario.
  • Drives identification of dependencies and the development of design documents for a product, application, service, or platform.
  • Creates, implements, optimizes, debugs, refactors, and reuses code to establish and improve performance and maintainability, effectiveness, and return on investment (ROI).
  • Leverages subject-matter expertise of product features and partners with appropriate stakeholders (e.g., project managers) to drive a workgroup's project plans, release plans, and work items.
  • Acts as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor system/product/service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system/product/service for simple and complex problems when appropriate.
  • Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale.
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, Enterprise Resilience

At Vanta, our mission is to help businesses earn and prove trust. We believe tha...
Location
Location
United States
Salary
Salary:
207000.00 - 244000.00 USD / Year
vanta.com Logo
Vanta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience operating services in multiple environments requiring strict compliance including FedRAMP
  • Technical lead in successfully driving large scale reliability initiatives across an entire product engineering organization
  • Played technical leadership roles on Infrastructure or platform teams
  • Experience with infrastructure, AWS services, and scaling platforms in fast-growing environments
  • Cares deeply about empowering other teams to build highly resilient and scalable production services
  • Thoughtful about trade-offs and has good product sense when creating highly available infrastructure/services
  • Open to using AI to amplify their skills and strengthen their work - demonstrating curiosity, a willingness to learn, and sound judgment in applying AI responsibly to improve efficiency and impact
Job Responsibility
Job Responsibility
  • Build and operate the systems that power Vanta’s FedRAMP environments, including automated release, vulnerability remediation, and evidence generation pipelines that meet strict compliance timelines
  • Design and maintain Vanta’s vulnerability management platform, automating detection, remediation, and compliance reporting across both FedRAMP and non-FedRAMP environments
  • Define and evolve Vanta’s production reliability framework, including SLOs, incident response patterns, observability standards, service catalog, metrics dashboards, and the Vanta SLA definition
  • Improve incident response workflows and systems for faster recovery
  • Engineer reliability improvements for CI and deploy workflows, reducing production friction and operational load, while maintaining deployment velocity
  • Collaborate with product teams to embed reliability best practices, guiding operational readiness reviews and helping teams design for resilience
  • Lead design and improvement of datacenter and environment build-outs for future FedRAMP levels and regional expansion
  • Identify and solve complex scalability and performance challenges, particularly related to service reliability and data throughput
  • Work with talented and kind engineers to make a significant impact on our customer base, enabling them to improve their security and prove it
  • Contribute to building Vanta’s engineering culture as we grow
What we offer
What we offer
  • Offers Equity
  • Medical benefits
  • 401(k) plan
  • Other company perk programs
  • Comprehensive medical, dental, and vision coverage, with 100% of employee-only benefit premiums covered for most medical plans
  • 16 weeks fully-paid Parental Leave for all new parents
  • Health & wellness stipend
  • Remote workspace, internet, and cellphone stipend
  • Commuter benefits for team members who report to the SF and NYC office
  • Family planning benefits
  • Fulltime
Read More
Arrow Right

Senior Software Engineer

The Azure Core New Tech team is seeking engineers who are eager to automate how ...
Location
Location
United States , Multiple Locations
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java - OR equivalent experience
  • 2+ year(s) experience where designed, proposed, and managed software features across teams: APIs, schema, etc
  • 1+ year(s) experience with Validation of datacenter hardware, managing multiple types of hardware/firmware OR Networking concepts including specific network protocols and devices. OR Platform development: orchestrator/policy engines/test platforms/core libraries used across multiple teams
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Drives identification of dependencies and the development of design documents for a product, application, service, or platform
  • Creates, implements, optimizes, debugs, refactors, and reuses code to establish and improve performance and maintainability, effectiveness, and return on investment (ROI)
  • Leverages subject-matter expertise of product features and partners with appropriate stakeholders (e.g., project managers) to drive a workgroup's project plans, release plans, and work items
  • Acts as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor system/product/service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system/product/service for simple and complex problems when appropriate
  • Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale
  • Fulltime
Read More
Arrow Right

Senior Software Engineer

Do you want to join a world-class engineering team in India and work on hard tec...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science or Engineering or Mathematics or Physics or IT technical discipline
  • 8+ years of programming experience in C#, C++, or C
  • Proficiency in troubleshooting and debugging
  • 8+ years of commercial systems level software development experience
  • Experience with large scale distributed systems, multithreading and object-oriented programming
Job Responsibility
Job Responsibility
  • Design, implement and maintain services and components that provide secure and resilient platform for SQL control plane and data plane services
  • Develop innovative technology for managing massive-scale operations for large customers tolerating underlying system failures, software and hardware upgrades and reconfiguration, while enabling optimal placement and utilization of Azure clusters and regions
  • Design and implement solutions for cluster expansions at a global scale, analyze telemetry and the behavior of large distributed systems to mine actionable insights
  • Ensure the highest standards of quality and reliability across all services and solutions
  • Contribute to design of service software stack, datacenter design and network topology
  • Release features on time, with high quality, meeting functional, performance, scalability, and compliance requirements
  • Research and adopt modern technology to improve quality of the service, increase customer value or reduce operating cost
  • Participate in on-call rotation for the team
  • Mentor and grow junior members in the team
  • Partner with Program Management, architects, and leaders to define requirements, scope projects and validate solutions
  • Fulltime
Read More
Arrow Right