CrawlJobs Logo

Software Reliability Engineer

United States, Atlanta Employment contract 83900.00 - 151200.00 USD / Year · Job Posted May 16, 2026
Apply Position
Job Link Share

Job Description

This role improves and protects software and systems supporting IT services by managing scalability, availability, latency, performance, security, and capacity. This role supports the Subscription Product Engineering organization, including in-house subscription and customer lifecycle platforms that support critical business operations and customer-facing services across production and non-production environments. The role primarily involves designing and maintaining continuous integration and continuous delivery, CI/CD, pipelines and building applications on cloud-native platforms. The role differentiates itself by enabling continuous improvement of operational support through automation, monitoring, and reliability-focused practices across production and non-production environments. Success is measured by enhanced software delivery speed, reliability, operational efficiency, platform stability, and a consistent customer experience.

Job Responsibility

  • Apply DevOps automation tools to manage CI/CD pipelines and configuration for production and non-production environments
  • Perform environment management and automated server provisioning to support scalable infrastructure
  • Deliver software improvements that improve availability, scalability, latency, and efficiency of IT services
  • Create and manage dashboards, alerts, logging standards, and health checks to improve service quality, supportability, and visibility across services
  • Contribute to software delivery process improvements including cloud enablement, containerization, and deployment automation
  • Support cloud-native applications, APIs, microservices, and platform operations across production and non-production environments
  • Troubleshoot production incidents, participate in root cause analysis, and support implementation of long-term reliability improvements with assistance from leadership and senior technical team members
  • Partner with Software Engineering, DevOps, and platform teams to improve application resiliency, scalability, and deployment automation under established technical direction
  • Contribute to operational readiness activities, including release validation, capacity planning, disaster recovery support, and environment support, under the guidance of senior leadership
  • Participate in Agile ceremonies, production support activities, and continuous improvement initiatives

Requirements

  • Legally authorized to work in the United States
  • At least 18 years of age
  • Bachelor's Degree plus 2 years of related work experience OR combination of education and experience deemed equivalent (Required)
  • 2-4 years Relevant experience (Preferred)
  • Experience working in an Agile and DevOps environment (Preferred)
  • Experience in one or more of: C, C#, Java, Perl, Python, Go, or scripting experience in Shell and Perl (Preferred)
  • Experience in Continuous Integration/Continuous Delivery tools, such as, Jenkins, Cloudbees, etc., and other automation tools (Preferred)
  • Experience with DevOps tools, such as, Ansible, Chef, Puppet, etc. Experience in Docker, Kubernetes, etc. is preferable (Preferred)
  • Experience in APM tool, like, AppDynamics, logging tool, like Splunk (Preferred)
  • Experience working in a cloud environment (public/private) (Preferred)
  • Experience in migrating to cloud or cloud native environments (Preferred)
  • Experience supporting APIs, microservices, distributed applications, or enterprise production platforms (Preferred)
  • Experience with infrastructure automation tools such as Terraform, Ansible, Chef, or Puppet (Preferred)
  • Experience with observability or monitoring tools such as Splunk, AppDynamics, Dynatrace, Grafana, Prometheus, or similar platforms (Preferred)
  • DevOps (Required)
  • Integration (Required)
  • CI/CD pipeline support and deployment automation (Desired)
  • Production and non-production environment support (Desired)
  • Monitoring, observability, or operational support practices (Required)

What we offer

  • Annual stock grant
  • Employee stock purchase plan
  • 401(k)
  • Access to free, year-round money coaches
  • Medical insurance
  • Dental insurance
  • Vision insurance
  • Flexible spending account
  • Paid time off
  • Up to 12 paid holidays
  • Paid parental and family leave
  • Family building benefits
  • Back-up care
  • Enhanced family support
  • Childcare subsidy
  • Tuition assistance
  • College coaching
  • Short-term disability
  • Long-term disability
  • Voluntary AD&D coverage
  • Voluntary accident coverage
  • Voluntary life insurance
  • Voluntary disability insurance
  • Voluntary long-term care insurance
  • Mobile service & home internet discounts
  • Pet insurance
  • Access to commuter and transit programs

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Software Reliability Engineer

8 matching positions

Software Reliability Engineer

This is built for engineers who see software through a systems lens rather than ...
Location
Location
United Kingdom , North West
Salary
Salary:
45000.00 - 55000.00 GBP / Year
linuxrecruit.co.uk Logo
Linux Recruit
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong software development backgrounds
  • comfortable with modern programming languages like Python, JavaScript, or Go
  • open to picking up new tools as needed
  • experience with distributed or cloud-native systems
  • proactive approach to performance and system health
Job Responsibility
Job Responsibility
  • Designing, building, and maintaining cloud native systems where low latency, security, and scalability are non-negotiable
  • engineering solutions that prevent incidents rather than just responding to them
  • writing production-grade code daily
  • thinking in terms of SLAs and SLOs
  • monitoring, observability, and metrics using Open Telemetry, Grafana, Splunk and Pager Duty
  • Fulltime
Read More
Arrow Right

Senior Software Engineer / Principal Software Engineer - Copilot CLI

Within GitHub and Microsoft CoreAI, the Copilot CLI team builds GitHub's coding ...
Location
Location
United States , Redmond
Salary
Salary:
119800.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years
Job Responsibility
Job Responsibility
  • Take ownership of critical product and platform areas of the Copilot CLI and shared agent runtime
  • Set a high technical and quality bar for agentic systems and developer-facing tooling
  • Design and ship performant, reliable terminal experiences that developers depend on for daily work
  • Use data, benchmarks, and direct user feedback to guide iteration and investment
  • Collaborate across org boundaries to enable other teams to build agentic products on top of a shared foundation
  • Influence architecture, technical direction, and engineering standards beyond your immediate team
What we offer
What we offer
  • Certain roles may be eligible for benefits and other compensation
  • Fulltime
Read More
Arrow Right

Backend Software Engineer / Senior Software Engineer- Kusto

Are you excited by the challenge of redefining how people explore and analyze ma...
Location
Location
Israel , Tel Aviv, Herzliya
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of technical engineering experience with coding in languages including, but not limited to, C#, Python or Java
  • 2+ years building and running services in a cloud environment (Azure, AWS, or GCP)
  • Experience in designing and operating large-scale distributed systems with high availability and reliability
Job Responsibility
Job Responsibility
  • Design, develop, and improve cloud-native services that are scalable, secure, and easy to operate
  • Drive architectural decisions and lead the development of major components in a distributed, high-SLA system
  • Collaborate with cross-functional teams in ILDC and abroad to deliver end-to-end solutions
  • Conduct code and design reviews and mentor junior engineers to grow technical excellence across the team
  • Help shape the future of real-time analytics in Microsoft Fabric RTI, with customer impact as your north star
  • Fulltime
Read More
Arrow Right

Senior Software Engineer and Software Engineer II

OneDrive and SharePoint are rapidly growing services at the center of Microsoft'...
Location
Location
United States , Redmond
Salary
Salary:
100600.00 - 199000.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Experience in related to cloud scale distributed design and patterns
  • The ability to deliver informed designs and plans ahead of production and execution
  • Knowledge of others' expertise and the ability to involve multiple players (within and outside the organization) in the creation or development of novel products, processes, or research streams
Job Responsibility
Job Responsibility
  • Design and deliver systems that enable partners and ISVs to migrate from other cloud providers, improve core systems performance and efficiencies, and ensure zero customer impact throughout the change management cycle
  • Deliver systems to meet our business continuity planning goals, provide telemetry for optimizing the service and drive our response time for detecting and resolving service issues down
  • Create, implement, optimize, debug, refactor, and reuses code to establish and improve performance and maintainability, effectiveness, and return on investment (ROI)
  • Contribue to the identification of dependencies, and the development of design documents for a product area with little oversight
  • Helps to identify other teams and technologies that will be leveraged, how they will interact, and when one's system may provide support to others
  • Contributes to determining back-end dependencies associated with product, application, service, or platform functionality for product features
  • Understands downstream effects of solutions and work provided
  • Helps to identify areas of dependency and overlap with other teams or team members and drives coordination
  • Remain current in skills by investing time and effort into staying abreast of current developments that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale
  • Reviews work items to deepen knowledge of product features in partnership with appropriate stakeholders (e.g., project managers) and executes project plans, release plans, and work items
  • Fulltime
Read More
Arrow Right

Software Engineer II/Sr. Software Engineer

Join Microsoft’s Core AI team and help shape the future of intelligent software ...
Location
Location
United States , Redmond
Salary
Salary:
100600.00 - 199000.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements is required for this role
  • This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter
Job Responsibility
Job Responsibility
  • Design and ship AI-assisted features in Visual Studio that help developers generate, explain, and refactor code—measured by adoption, reliability, and user satisfaction
  • Bring intelligence into IDE by integrating GitHub Copilot/MCP tools into core IDE workflows with strong attention to performance, privacy, and safety-by-default
  • Collaborate with partner teams across Microsoft and GitHub to deliver secure, performant solutions and iterate quickly based on real developer feedback
  • Contribute to designs (APIs, data flows, extensibility points) and participate in code/design reviews to maintain quality and scalability for a large codebase
  • Instrument and learn using telemetry, experimentation, and diagnostics to improve latency, reliability, and relevance over time
  • Fulltime
Read More
Arrow Right
New

Software Engineer II & Senior Software Engineer

Attacks begin on the endpoint—it’s where an intruder first lands, runs malware, ...
Location
Location
United States , Redmond
Salary
Salary:
102100.00 - 202200.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • Ability to meet Microsoft, customer, and/or government security screening requirements
  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years.
Job Responsibility
Job Responsibility
  • Build and improve detection, OS event monitoring, telemetry, and reporting in the on-device agent
  • Develop native components that detect and respond to real-world endpoint threats
  • Design, deliver, and maintain components with a focus on reliability, performance, and operational health
  • Help ensure the agent remains efficient, safe, and aligned with the macOS security model
  • Contribute to performance, concurrency, and compatibility across macOS versions
  • Produce high-quality telemetry that supports investigation, analytics, and security insights
  • Investigate and resolve issues such as crashes, performance regressions, and compatibility challenges
  • Apply secure coding practices and validate assumptions to build resilient systems
  • Collaborate with engineers and partner teams to deliver integrated security solutions
  • Contribute to design discussions, code reviews, and shared engineering practices
  • Fulltime
Read More
Arrow Right

Software engineer 2 / Senior Software engineer - Azure Data

Microsoft's Azure Data engineering team is leading the transformation of analyti...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 3+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • OR equivalent experience
  • Experience with the Azure stack including Storage, Compute, Networking, Fabric, Purview, Synapse, AKS, DevOps, Data Factory, or Power BI
  • Experience with big data technologies such as Spark, Kafka, Hadoop, or HBase
  • Experience building data lake or data engineering products, tools, or pipelines
  • Familiarity with container-based architectures (Docker, Kubernetes)
  • Ability to debug complex distributed systems on Linux and/or Windows platforms
Job Responsibility
Job Responsibility
  • Write extensible, maintainable code in C#, Java, Scala, or Python for Fabric Materialized Lake View services and HDInsight components
  • Use AI tools and coding best practices across the development lifecycle
  • Design data refresh, scheduling, and query optimisation features with minimal supervision
  • Review code from teammates for correctness, test coverage, security risks, and adherence to team standards
  • Coach junior engineers through code reviews
  • Debug complex issues in distributed systems running on Azure, Linux, and Windows
  • Run live site operations on a rotational, on-call basis
  • Integrate logging and instrumentation to gather telemetry on system health, performance, reliability, and security
  • Work with product managers, technical leads, and partners across geographies to define customer requirements for Materialized Lake View features
  • Fulltime
Read More
Arrow Right

Software Engineer II & Senior Software Engineer

Security represents the most critical priorities for our customers in a world aw...
Location
Location
United States , Redmond
Salary
Salary:
100600.00 - 199000.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 2+ years technical engineering experience with coding in languages including, C, C++, C#, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements
  • Microsoft Cloud Background Check
  • Master's Degree in Computer Science or related technical field AND 3+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 5+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Experience with Troubleshoot and optimize automation, reliability, and monitoring for Live Site running as part of an on-call rotation owned by engineering team
  • Experience with distributed systems, messaging systems like Kafka etc - Large scale system design
Job Responsibility
Job Responsibility
  • Lead the architecture, design and implementation of services for extremely high scale, throughput, durability, and low latency
  • Innovate and make service deployment and maintenance an efficient well-oiled machine that provides excellent reliability with minimal manual engineer intervention
  • Ability to conduct in-depth triage, troubleshooting, and forensics across all facets of the cloud stack while executing processes corrective action and continual service improvement
  • Drive Infrastructure security improvements for mission critical high scale workloads
  • Lead the definition of requirements, KPIs, priorities and planning of engineering deliverables
  • Mentor and grow the energetic, diverse, and driven team with a good mix of senior and mid-level
  • Fulltime
Read More
Arrow Right