Senior Software Engineer

Senior+ Software Engineer - Cloud Availability Platform Engineering (Observability)

We are looking for a highly skilled engineer with deep expertise in building and...

Location

United States , San Francisco

Salary:

166000.00 - 201000.00 USD / Year

Crusoe

Expiration Date

Until further notice

Requirements

7+ years of experience in infrastructure or platform engineering, with a focus on observability and monitoring systems
Deep expertise with metrics systems (Prometheus, Thanos, Mimir, Cortex), logging pipelines (Fluent Bit, Vector, Loki, ELK/Opensearch), and tracing platforms (Jaeger, Tempo, OpenTelemetry)
Strong programming skills in Go or Python for automation, operators, and custom integrations
Experience running observability platforms on Kubernetes and operating them at scale across multi-datacenter environments
Proven ability to design, optimize, and scale telemetry pipelines handling high cardinality and high throughput data
Solid understanding of distributed systems, performance engineering, and debugging complex workloads
Strong collaboration skills and the ability to influence engineering teams to adopt observability best practices

Job Responsibility

Designing and operating scalable observability systems (metrics, logging, tracing) across multi-datacenter Kubernetes environments
Architecting end-to-end telemetry pipelines, including ingestion, storage, querying, and visualization
Extending monitoring and alerting with Prometheus, Alertmanager, Thanos/Cortex, Grafana, and OpenTelemetry
Building scalable log collection and processing pipelines with Fluent Bit, Vector, Loki, or ELK/Opensearch stacks
Implementing distributed tracing platforms (Tempo, Jaeger, OpenTelemetry) and integrating with service meshes, load balancers, and APIs
Defining and driving adoption of SLOs, SLIs, and error budgets across services and teams
Automating provisioning and scaling of observability infrastructure with Kubernetes, Terraform, and custom tooling (Go, Python)
Ensuring reliability and cost efficiency of telemetry pipelines while supporting high-volume workloads (AI/ML, HPC clusters, GPU infrastructure)
Embedding security best practices into observability platforms, including RBAC, TLS, secret management, and multi-tenant access controls
Partnering with engineering teams to embed observability into applications, services, and infrastructure

What we offer

Restricted Stock Units in a fast growing, well-funded technology company
Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
Employer contributions to HSA accounts
Paid Parental Leave
Paid life insurance, short-term and long-term disability
Teladoc
401(k) with a 100% match up to 4% of salary
Generous paid time off and holiday schedule
Cell phone reimbursement
Tuition reimbursement

Fulltime

Network Software Test – Senior Software Engineer

About Arrcus: Arrcus was founded to enhance business efficiency through superior...

Location

India , Bangalore

Salary:

Not provided

Arrcus

Expiration Date

Until further notice

Requirements

BS/MS in Computer Engineering/Computer Science or equivalent degree
Ability to write high quality automated test cases using Python
5+ years of hands-on test experience of Networking protocols such as OSPF, BGP, ISIS, MPLS, BFD, MLAG, EVPN, VxLAN, SR-MPLS, SRv6
Proficient in the use of traffic generators to develop Data Path and Control Plane Test cases
Growing the existing automation framework to support customer user case testing scenarios and cross-feature integrations
Working knowledge of Test Harness like Robot framework, Jinja2 templating
Expertise in Scale and Performance Testing using simulation for customer networks
Using development infrastructure tools, such as Jenkins, Git, JIRA, etc.
Familiarity with Docker Containers, VMs expected
Knowledge of Network merchant silicon chipsets and Whitebox platforms

Job Responsibility

Deep understanding of Layer 2/3 protocols like BGP, BGP EVPN, ISIS, SR, MPLS,L3VPN, SRv6, and ability to validate networking functionality and performance through automation
Ability to understand and learn Service Provider, Datacenter, Campus/ Enterprise Customer Solutions
Influence development team to align with customer expectations with respect to deployment and UX needs
Creative problem solving and excellent Troubleshooting skills
Ability to handle multiple tasks and complete them on time
Good documentation and presentation skills

What we offer

Generous compensation packages including equity
Medical Insurance
Parental Leave
Sabbatical leave (After 4 years of service)

Fulltime

New

Microsoft is a company where passionate innovators come to collaborate, envision...

Location

India , Bangalore

Salary:

Not provided

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor’s degree in Computer Science or Engineering or Mathematics or Physics or IT technical discipline
7+ years of programming experience in C, C#, C++
Proficiency in troubleshooting and debugging
4+ years of commercial systems level software development experience
Ability to meet Microsoft, customer and/or government security screening requirements
This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter

Job Responsibility

Design, implement and maintain services and components that provide secure and resilient platform for SQL control plane and data plane services
Develop innovative technology for managing massive-scale operations for large customers tolerating underlying system failures, software and hardware upgrades and reconfiguration, while enabling optimal placement and utilization of Azure clusters and regions
Design and implement solutions for cluster expansions at a global scale, analyze telemetry and the behavior of large distributed systems to mine actionable insights
Ensure the highest standards of quality and reliability across all services and solutions
Contribute to design of service software stack, datacenter design and network topology
Release features on time, with high quality, meeting functional, performance, scalability, and compliance requirements
Research and adopt modern technology to improve quality of the service, increase customer value or reduce operating cost
Participate in on-call rotation for the team
Embody our culture and values

Senior Software Engineer

The Budget Optimization Engineering team at Microsoft builds the real-time data ...

Location

United States , Redmond

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C#, Java, Go, or Python OR equivalent experience.
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
7+ years of technical experience in software development, service engineering, or systems engineering.
5+ years of experience building and operating large-scale distributed systems, backend services, or data platforms with strict SLA requirements.
Apache Kafka — solid understanding of consumers, producers, offset management, partition strategies, performance tuning, and cross-datacenter replication patterns.
Kubernetes — production experience writing and deploying Helm charts
hands-on with Deployments, StatefulSets, Services, ConfigMaps, Secrets, Jobs, and HPAs
comfortable with multi-cluster and multi-datacenter environments.
Cloud infrastructure — practical experience with Azure (AKS, ACR, Azure Key Vault, Azure Application Insights, Azure Log Analytics)
familiarity with Azure DevOps or equivalent CI/CD platforms.

Job Responsibility

Design and build highly scalable backend services and data pipelines that support privacy-preserving measurement and analytics scenarios using Java, Python (and C# where applicable).
Maintain and improve production services across the optimization platform — including Kafka streaming pipelines, budget controllers, job orchestration (job-broker), and deal monitoring — with a focus on reliability and strict SLA adherence.
Drive integrations with external data and measurement partners, designing stable interfaces, schema governance patterns, and robust validation pipelines.
Work closely with PMs, data science, privacy, and security teams to translate measurement needs into scalable platform capabilities.
Contribute to the full service lifecycle: design, implementation, testing, code review, and deployment.
Improve reliability and observability of Kafka consumer/producer pipelines (offset management, retry strategies, delivery guarantees) across cross-datacenter replication flows.
Design and implement Kubernetes/Helm deployments for services currently running on legacy orchestration (Maestro, SAND instances, bare Docker), targeting Azure-native cloud infrastructure.
Integrate application telemetry (Prometheus/Dropwizard Metrics) with Azure Application Insights and Azure Log Analytics to support production observability and SLA monitoring.
Apply practical experience with Azure services — including AKS, ACR, and Azure Key Vault — to support secure, cloud-native deployments.
Lead initiatives to make delivery of high-quality software routine and efficient across the full SDLC, from inception and technical design through testing and production operations.

Fulltime

Senior Software Engineer

Be part of the Datacenter Management Transformation. Cloud Technology dominates ...

Location

United States , Multiple Locations

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
OR equivalent experience.
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:  Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

Job Responsibility

Collaborates with appropriate stakeholders to determine user requirements for a scenario.
Drives identification of dependencies and the development of design documents for a product, application, service, or platform.
Creates, implements, optimizes, debugs, refactors, and reuses code to establish and improve performance and maintainability, effectiveness, and return on investment (ROI).
Leverages subject-matter expertise of product features and partners with appropriate stakeholders (e.g., project managers) to drive a workgroup's project plans, release plans, and work items.
Acts as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor system/product/service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system/product/service for simple and complex problems when appropriate.
Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale.

Fulltime

Senior Software Engineer, Enterprise Resilience

At Vanta, our mission is to help businesses earn and prove trust. We believe tha...

Location

United States

Salary:

207000.00 - 244000.00 USD / Year

Vanta

Expiration Date

Until further notice

Requirements

Experience operating services in multiple environments requiring strict compliance including FedRAMP
Technical lead in successfully driving large scale reliability initiatives across an entire product engineering organization
Played technical leadership roles on Infrastructure or platform teams
Experience with infrastructure, AWS services, and scaling platforms in fast-growing environments
Cares deeply about empowering other teams to build highly resilient and scalable production services
Thoughtful about trade-offs and has good product sense when creating highly available infrastructure/services
Open to using AI to amplify their skills and strengthen their work - demonstrating curiosity, a willingness to learn, and sound judgment in applying AI responsibly to improve efficiency and impact

Job Responsibility

Build and operate the systems that power Vanta’s FedRAMP environments, including automated release, vulnerability remediation, and evidence generation pipelines that meet strict compliance timelines
Design and maintain Vanta’s vulnerability management platform, automating detection, remediation, and compliance reporting across both FedRAMP and non-FedRAMP environments
Define and evolve Vanta’s production reliability framework, including SLOs, incident response patterns, observability standards, service catalog, metrics dashboards, and the Vanta SLA definition
Improve incident response workflows and systems for faster recovery
Engineer reliability improvements for CI and deploy workflows, reducing production friction and operational load, while maintaining deployment velocity
Collaborate with product teams to embed reliability best practices, guiding operational readiness reviews and helping teams design for resilience
Lead design and improvement of datacenter and environment build-outs for future FedRAMP levels and regional expansion
Identify and solve complex scalability and performance challenges, particularly related to service reliability and data throughput
Work with talented and kind engineers to make a significant impact on our customer base, enabling them to improve their security and prove it
Contribute to building Vanta’s engineering culture as we grow

What we offer

Offers Equity
Medical benefits
401(k) plan
Other company perk programs
Comprehensive medical, dental, and vision coverage, with 100% of employee-only benefit premiums covered for most medical plans
16 weeks fully-paid Parental Leave for all new parents
Health & wellness stipend
Remote workspace, internet, and cellphone stipend
Commuter benefits for team members who report to the SF and NYC office
Family planning benefits

Fulltime

Senior Software Engineer

The Azure Core New Tech team is seeking engineers who are eager to automate how ...

Location

United States , Multiple Locations

Salary:

119800.00 - 234700.00 USD / Year

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor's Degree in Computer Science, or related technical discipline AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java - OR equivalent experience
2+ year(s) experience where designed, proposed, and managed software features across teams: APIs, schema, etc
1+ year(s) experience with Validation of datacenter hardware, managing multiple types of hardware/firmware OR Networking concepts including specific network protocols and devices. OR Platform development: orchestrator/policy engines/test platforms/core libraries used across multiple teams
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter

Job Responsibility

Drives identification of dependencies and the development of design documents for a product, application, service, or platform
Creates, implements, optimizes, debugs, refactors, and reuses code to establish and improve performance and maintainability, effectiveness, and return on investment (ROI)
Leverages subject-matter expertise of product features and partners with appropriate stakeholders (e.g., project managers) to drive a workgroup's project plans, release plans, and work items
Acts as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor system/product/service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system/product/service for simple and complex problems when appropriate
Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale

Fulltime

Senior Software Engineer

Do you want to join a world-class engineering team in India and work on hard tec...

Location

India , Bangalore

Salary:

Not provided

Microsoft Corporation

Expiration Date

Until further notice

Requirements

Bachelor’s degree in Computer Science or Engineering or Mathematics or Physics or IT technical discipline
8+ years of programming experience in C#, C++, or C
Proficiency in troubleshooting and debugging
8+ years of commercial systems level software development experience
Experience with large scale distributed systems, multithreading and object-oriented programming

Job Responsibility

Design, implement and maintain services and components that provide secure and resilient platform for SQL control plane and data plane services
Develop innovative technology for managing massive-scale operations for large customers tolerating underlying system failures, software and hardware upgrades and reconfiguration, while enabling optimal placement and utilization of Azure clusters and regions
Design and implement solutions for cluster expansions at a global scale, analyze telemetry and the behavior of large distributed systems to mine actionable insights
Ensure the highest standards of quality and reliability across all services and solutions
Contribute to design of service software stack, datacenter design and network topology
Release features on time, with high quality, meeting functional, performance, scalability, and compliance requirements
Research and adopt modern technology to improve quality of the service, increase customer value or reduce operating cost
Participate in on-call rotation for the team
Mentor and grow junior members in the team
Partner with Program Management, architects, and leaders to define requirements, scope projects and validate solutions

Fulltime

Select Country

Senior Software Engineer - Datacenter Platform

Job Description

Job Responsibility

Requirements

Nice to have

Looking for more opportunities?

Senior Software Engineer - Datacenter Platform

Senior+ Software Engineer - Cloud Availability Platform Engineering (Observability)

Network Software Test – Senior Software Engineer

Senior Software Engineer

Senior Software Engineer

Senior Software Engineer

Senior Software Engineer, Enterprise Resilience

Senior Software Engineer

Senior Software Engineer

Our AI answers in your language