This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Robert Half is partnering with an Austin-based client to hire a Site Reliability Engineer. This position offers an exciting opportunity to work with a rapidly growing team, ensuring the reliability, performance, and scalability of critical systems. The ideal candidate will bring technical expertise in Azure infrastructure and scripting languages, along with a proactive approach to automation and problem-solving.
Job Responsibility:
Monitor, maintain, and optimize Azure infrastructure to ensure high performance, availability, and reliability across IaaS, PaaS, and SaaS environments
Define, measure, and continuously improve Service Level Indicators (SLIs) and Service Level Objectives (SLOs) using tools such as Azure Monitor, Application Insights, Log Analytics, and API Management
Design and develop automation tools and scripts using PowerShell, Bash, Python, and C# to eliminate manual processes and improve operational efficiency
Participate in a 24/7 on-call rotation, leading incident response efforts, performing root cause analysis, and implementing long-term preventive and reliability improvements
Partner closely with software engineering, QA, and other technology teams to ensure systems meet reliability, scalability, and performance standards
Support capacity planning, load testing, and performance tuning initiatives within a microservices architecture built on .NET and React
Troubleshoot complex system issues, including integrations with third-party platforms and APIs
Create, maintain, and enhance technical documentation, operational runbooks, and system diagrams to support knowledge sharing and operational excellence
Requirements:
Proven experience working with .NET and C#, with a focus on system reliability, scalability, and performance
Strong hands-on expertise with Azure services, including Azure Monitor, Application Insights, Log Analytics, and API Management
Proficiency in scripting and automation using PowerShell, Bash, and Python
Experience supporting and operating microservices-based architectures, including front-end technologies such as React
Demonstrated ability to effectively manage incidents, participate in on-call rotations, and perform root cause analysis
Experience with capacity planning, load testing, and performance optimization in cloud environments
Strong troubleshooting skills, particularly in supporting API integrations and third-party systems
Excellent written and verbal communication skills, with a strong emphasis on clear documentation and cross-team collaboration
What we offer:
medical, vision, dental, and life and disability insurance