CrawlJobs Logo

Sr Systems Reliability Engineer - Legal Technology

United States, Frisco 98500.00 - 177700.00 USD / Year · Job Posted March 21, 2026
Apply Position
Job Link Share

Job Description

The System Reliability Engineer (SRE) guides and mentors other SREs and improves and protects the software and systems behind T-Mobile's IT services, including scalability, availability, latency, performance, security, and capacity, while enabling faster, higher-quality software delivery. In this role, you will support T-Mobile’s Legal & Emergency Response platforms—mission-critical systems that enable response to urgent law enforcement and emergency situations. These systems operate in high-stakes environments where reliability, speed, and accuracy truly matter. When these systems are up and running, they can help enable real-world outcomes that impact people in critical moments. This is not a maintenance role. You’ll step into a modern, cloud-native platform that is still evolving—giving you the opportunity to shape how it scales, improves, and becomes truly resilient. You’ll work on meaningful problems, own production systems end-to-end, and directly influence how reliability is built into the platform. You’ll also work in an environment that embraces modern engineering practices, including strong adoption of AI-assisted tools, and a culture that values ownership, collaboration, and continuous improvement.

Job Responsibility

  • Apply DevOps automation for CI/CD, configuration management, and environment management (non-prod and prod)
  • Provision and manage environments
  • configure pipelines and infrastructure (VMs/containers)
  • Improve availability, scalability, latency, and efficiency of services, with emphasis on Legal Technology platforms
  • Own reliability and performance of critical applications (LRS, E-Core, LEEP)
  • Participate in on-call rotation (~1 week every 2 months)
  • respond to alerts/incidents
  • Lead incident response, root cause analysis, and post-incident improvements
  • Build and enhance observability (dashboards, alerts), runbooks, and automation
  • Partner with engineering to design for reliability and eliminate recurring issues in distributed systems
  • Drive improvements in delivery and operations (cloud enablement, microservices, containerization, zero-downtime deployments)
  • Mentor SREs and guide reliability practices across the team

Requirements

  • Bachelor’s Degree plus 5 years of related work experience OR Advanced degree with 3 years of related experience
  • 4–7+ years relevant experience (Required)
  • Experience in Agile/DevOps environments (Required)
  • Proficiency in one or more: Java, Python, Go, C/C#, or scripting (Shell/Perl) (Required)
  • Experience with DBMS (Postgres or Oracle) (Required)
  • Experience with CI/CD tools (e.g., Jenkins) and DevOps tools (GitHub/GitLab, Chef/Puppet) (Required)
  • Experience with Docker, Kubernetes (Required)
  • Experience with APM/observability tools (e.g., Splunk, Grafana, AppDynamics) (Required)
  • Experience troubleshooting distributed systems using logs/metrics/traces (Required)
  • DevOps (Required)
  • Integration (Required)
  • Strong troubleshooting in distributed systems
  • Ability to operate in production environments and respond to incidents
  • Ownership mindset with focus on reliability and continuous improvement
  • At least 18 years of age
  • Legally authorized to work in the United States
  • U.S. citizenship (Required)

Nice to have

  • Experience in cloud environments (Preferred)
  • Experience with high-availability or regulated environments (Preferred)
  • Experience leveraging AI-assisted tools (e.g., Copilot, ChatGPT) (Preferred)
  • Cloud Computing (Preferred)

What we offer

  • Competitive base salary and compensation package
  • Annual stock grant
  • Employee stock purchase plan
  • 401(k)
  • Access to free, year-round money coaches
  • Medical, dental and vision insurance
  • Flexible spending account
  • Employee stock grants
  • Employee stock purchase plan
  • Paid time off
  • Up to 12 paid holidays
  • Paid parental and family leave
  • Family building benefits
  • Back-up care
  • Enhanced family support
  • Childcare subsidy
  • Tuition assistance
  • College coaching
  • Short- and long-term disability
  • Voluntary AD&D coverage
  • Voluntary accident coverage
  • Voluntary life insurance
  • Voluntary disability insurance
  • Voluntary long-term care insurance
  • Mobile service & home internet discounts
  • Pet insurance
  • Access to commuter and transit programs
  • Annual bonus or periodic sales incentive/bonus

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Sr Systems Reliability Engineer - Legal Technology

8 matching positions

Sr Site Reliability Engineer, Secure Federal Operations

This role is responsible for designing and implementing secure, scalable, and hi...
Location
Location
United States , Herndon
Salary
Salary:
107300.00 - 193500.00 USD / Year
https://www.t-mobile.com Logo
T-Mobile
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Engineering, Information Technology, or related field plus 3 years of related work experience. Or, advanced degree with 1 year of related experience. Or, combination of education and experience deemed equivalent
  • 4+ years of progressive experience in systems architecture, platform engineering, or site reliability engineering
  • Hands-on experience with Azure and AWS cloud platforms
  • Expertise in Active Directory, DNS, 802.1X, and certificate lifecycle management
  • Strong background in Windows and Linux operating systems
  • Proficiency in TCP/IP networking and network security principles
  • Administration of Microsoft 365 (M365) services (Exchange Online, SharePoint, Teams)
  • US citizenship (without dual citizenship)
  • At least 18 years of age and legally authorized to work in the United States
  • Active security clearance or ability to obtain one
Job Responsibility
Job Responsibility
  • Develop and implement system designs to improve software delivery speed and operational efficiency
  • Lead architecture for cross-domain programs ensuring alignment with enterprise standards
  • Deliver solutions that enhance service availability, scalability, latency, and efficiency
  • Design and deploy solutions on Azure and AWS
  • Build and operate cloud-native platforms (Kubernetes, service mesh, ingress, policy engines)
  • Implement Infrastructure as Code (IaC) for automated deployments
  • Administer Active Directory and integrate with cloud identity solutions
  • Configure 802.1X authentication for secure network access
  • Manage digital certificates lifecycle (issuance, renewal, revocation)
  • Manage DNS, TCP/IP networks, and network segmentation
What we offer
What we offer
  • competitive base salary
  • annual stock grant
  • employee stock purchase plan
  • 401(k)
  • free year-round money coaches
  • medical insurance
  • dental insurance
  • vision insurance
  • flexible spending account
  • paid time off
  • Fulltime
Read More
Arrow Right

Sr Engineers, Systems Reliability

At T-Mobile, we invest in YOU! Our Total Rewards Package ensures that employees...
Location
Location
United States , Frisco
Salary
Salary:
156998.00 - 165000.00 USD / Year
https://www.t-mobile.com Logo
T-Mobile
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master’s degree in Computer and information technology, Electrical and Computer Engineering, or related, and 6 years of relevant work experience
  • Bachelor’s degree in Computer and information technology, Electrical and Communication Engineering, or related, and 8 years of relevant work experience
  • Design, develop, and deliver complex GitLab CI/CD pipelines for enterprise billing platforms
  • Build and administer Kubernetes clusters using Conductor for application lifecycle management, packaging with helm and duck templates for infrastructure automation
  • Develop custom tools in Shell, Perl, YAML, Jython and Python (including Boto3) to support zero-downtime deployments and operations
  • Implement Infrastructure as Code with Terraform and AWS CloudFormation to provision infrastructure across AWS, PCF, Google and Azure cloud platforms
  • Develop AWS Lambda function to migrate historical billing information from RDS to S3
  • Support and administer Skava-based ecommerce platforms, Java/J2EE and REST API’s including deployment, scaling, and operational troubleshooting in production
  • Provision and manage relational and NoSQL databases, including PostgreSQL, MySQL, Oracle, and MongoDB (Atlas) and develop, optimize SQL scripts for billing workflows and for generating monthly consumer and business reports
  • Develop scripts and controls to enforce access management using Azure AD and prevent public exposure of secrets using GitGuardian, T-Vault and CyberArk ensuring compliance with cybersecurity standards
Job Responsibility
Job Responsibility
  • Perform environment management, automated server provisioning, pipeline configuration (VMs)
  • Deliver software to improve the availability, scalability, latency, and efficiency of T-Mobile’s services
  • Craft, manage, and use dashboard for continuous monitoring and health check of applications, and the underlying infrastructure, improve the quality of services using the monitoring feedback for production environment
  • Contribute to future improvement of software delivery processes and operations, e.g., cloud enablement, use of microservices with containerization
  • Relationship and People Management: Mentors/guides other Systems Reliability Engineers, Software Engineers and vendor resources as needed
What we offer
What we offer
  • Competitive base salary and compensation package
  • Annual stock grant
  • Employee stock purchase plan
  • 401(k)
  • Access to free, year-round money coaches
  • Annual bonus or periodic sales incentive or bonus based on role
  • Medical, dental and vision insurance
  • Flexible spending account
  • Paid time off and up to 12 paid holidays
  • Paid parental and family leave
  • Fulltime
Read More
Arrow Right
New

Sr Engineer, Enterprise AI

The Senior Engineer, Enterprise AI helps design, build, and scale AI-powered app...
Location
Location
United States , Bellevue; Frisco
Salary
Salary:
133100.00 - 240100.00 USD / Year
https://www.t-mobile.com Logo
T-Mobile
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4–7 years of experience in software engineering, AI/ML engineering, platform engineering, or enterprise application development
  • Experience building and deploying scalable software applications or platforms in enterprise environments
  • Experience developing AI-enabled applications, intelligent automation workflows, or LLM-powered solutions using modern AI frameworks and tools
  • Experience designing or integrating distributed systems, APIs, enterprise platforms, or cloud-native applications
  • Experience collaborating cross-functionally with engineering, product, architecture, and business teams to deliver production solutions
  • Bachelor's Degree Bachelor’s degree plus 3 years of related work experience OR advanced degree with 1 year of related work experience OR combination of education and experience deemed equivalent
  • At least 18 years of age
  • Legally authorized to work in the United States
Job Responsibility
Job Responsibility
  • Design, develop, deploy, and support enterprise AI applications, autonomous agents, and intelligent workflow solutions that improve employee productivity and operational efficiency
  • Build and optimize Retrieval-Augmented Generation (RAG) pipelines, semantic search capabilities, and AI orchestration workflows integrated with enterprise platforms and data sources
  • Develop scalable integrations and AI-enabled services connecting systems such as Salesforce, ServiceNow, Snowflake, Databricks, GitLab, Atlassian, and other enterprise platforms
  • Partner with engineering, product, architecture, and business stakeholders to deliver secure, reliable, and scalable AI-driven solutions in production environments
  • Apply strong software engineering principles across system design, coding, CI/CD, observability, testing, troubleshooting, and operational support
  • Utilize modern AI development tools and coding assistants to rapidly prototype, iterate, and improve AI-enabled applications and workflows
  • Contribute to AI governance, security, reliability, and compliance standards for enterprise AI deployments
  • Evaluate emerging AI technologies, frameworks, and methodologies to continuously improve platform capabilities and engineering effectiveness
  • Participate in architectural discussions, technical reviews, and engineering best practices to help raise the overall technical maturity of the team
  • Support a highly iterative development environment where rapid experimentation, continuous learning, and fast adaptation are critical to success
What we offer
What we offer
  • Competitive base salary and compensation package
  • Annual stock grant
  • Employee stock purchase plan
  • 401(k)
  • Free year-round money coaches
  • Medical, dental and vision insurance
  • Flexible spending account
  • Paid time off
  • Up to 12 paid holidays
  • Paid parental and family leave
  • Fulltime
Read More
Arrow Right

Sr. Engineer, Software - EPM

This role is responsible for designing, building, and supporting scalable Oracle...
Location
Location
United States , Bellevue; Frisco
Salary
Salary:
113600.00 - 205000.00 USD / Year
https://www.t-mobile.com Logo
T-Mobile
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Information Systems, Software Engineering, or related field, or equivalent experience
  • 5+ years of software engineering or EPM Platform experience (Oracle, OneStream)
  • 3+ years of hands-on Oracle EPBCS / Planning Cloud experience
  • Proven experience owning end-to-end solution design for enterprise EPM implementations
  • Demonstrated experience with enterprise metadata management solutions (i.e. Oracle EDMCS)
  • Strong experience developing business rules and calculations within Oracle EPM platforms
  • Experience supporting integrations using OIC, Data Management, Data Exchange, or related technologies
  • Strong analytical, problem-solving, and communication skills
  • Experience collaborating with FP&A, or enterprise business stakeholders
  • Ability to facilitate requirements workshops with FP&A stakeholders and translate outputs into technical design documents
Job Responsibility
Job Responsibility
  • Design, develop, and enhance Oracle EPBCS (Planning Cloud) supporting enterprise planning models across revenue, customers, capex, etc.
  • Lead end-to-end solution design from requirements through deployment, including technical documentation and peer review
  • Build scalable planning applications supporting forecasting, workforce planning, and connected planning capabilities
  • Develop driver-based planning models, scenario analysis, and rolling forecast capabilities
  • Translate FP&A business requirements into scalable, enterprise-grade planning solutions
  • Develop and maintain business rules and calculations using Groovy scripting and native Oracle EPM capabilities
  • Support application performance optimization, cube tuning, metadata management, and usability improvements
  • Contribute to scalable application design, reusable components, and engineering best practices
  • Participate in new capability development and platform expansion initiatives
  • Design and support integrations using Oracle Integration Cloud (OIC), Data Management, and Data Exchange
What we offer
What we offer
  • Medical insurance
  • dental insurance
  • vision insurance
  • flexible spending account
  • 401(k)
  • employee stock grants
  • employee stock purchase plan
  • paid time off
  • paid holidays
  • paid parental leave
  • Fulltime
Read More
Arrow Right

Sr Engineer, Software - T-Cloud & Enterprise Vault

The Sr Software Engineer - T-Cloud & Enterprise Vault works with a team of other...
Location
Location
United States , Atlanta; Overland Park; Bothell
Salary
Salary:
113600.00 - 205000.00 USD / Year
https://www.t-mobile.com Logo
T-Mobile
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree Computer Science or Engineering
  • 4-7 years Technical engineering experience
  • Communication
  • Customer Service
  • Analytics
  • Technical Writing
  • Kubernetes, Ansible, Terraform, Gitlab, scripting knowledge
  • At least 18 years of age
  • Legally authorized to work in the United States
Job Responsibility
Job Responsibility
  • Drives engineering projects by developing software solutions
  • conducting tests and inspections
  • preparing reports and calculations
  • Expected to supervise base and associate level engineers as needed
  • Understands system protocols, how systems operate and data flows
  • Expected to independently develop a full software stack
  • Interact with system engineers to define system requirement and/or necessary requirements for automation
  • Utilizes fluent knowledge and skill in emerging DevOps-centric automation tools and technologies for CICD, configuration management, etc. for non-prod environments
  • Contributes to designs to implement new ideas which utilize new frameworks to improve an existing or new system/process/service
  • Review existing designs and processes to highlight more efficient ways to complete existing workload more effectively through industry perspectives
What we offer
What we offer
  • competitive base salary and compensation package
  • annual stock grant
  • employee stock purchase plan
  • 401(k)
  • access to free, year-round money coaches
  • medical, dental and vision insurance
  • flexible spending account
  • employee stock grants
  • employee stock purchase plan
  • paid time off
  • Fulltime
Read More
Arrow Right

Sr. Service Engineer

Take ownership of Microsoft 365 service operations in sovereign cloud environmen...
Location
Location
United States , Multiple Locations
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Information Technology, Mechanical Engineering, Electrical Engineering, Aerospace Engineering, Data Science, Cybersecurity, or related field AND 4+ years technical experience in software engineering, network engineering, service engineering, systems engineering, or industrial controls OR equivalent experience
  • Hands-on experience supporting complex IT environments, with a strong understanding of system and service management challenges
  • Experience operating in large distributed or air-gapped environments, with a focus on reliability, security, and compliance
  • Ability to build consensus and influence across teams to achieve common goals
  • Recent experience with Azure or equivalent hyperscale cloud technologies
  • Candidates must be able to meet Microsoft, customer and/or government security screening requirements are required for this role, including an active U.S. Government Top Secret Clearance with access to Sensitive Compartmented Information (SCI) based on a Single Scope Background Investigation (SSBI) with Polygraph
  • This position requires verification of U.S. citizenship due to citizenship-based legal restrictions
Job Responsibility
Job Responsibility
  • Responds to incidents during regular on-call rotations, including complex incidents with major customer or business impact, by identifying the level of impact, troubleshooting, contributing to difficult decisions based on business impact, deploying appropriate fixes to resolve root cause(s), and implementing automations for prevention of recurring incidents through coordinating resources required for incident resolution, which may include product teams, owners, leadership, other engineering teams, and/or subject matter experts
  • Creates, monitors, and takes action on telemetry data and influences telemetry analytics to better identify patterns that reveal errors and unexpected problems that are affecting the system's availability, reliability, performance, and/or efficiency
  • Independently implements reliable, scalable, and high-performance solutions across teams
  • Leverages advanced technical expertise, judgment, and decision making to coordinate multiple work streams and resources in crisis situations to drive mitigation plan and resolve, reduce, or mitigate the impact of a crisis
  • Collaborates within and across teams by proactively and systematically sharing information with an appropriate level of detail for their audience
  • Shares insights and best practices that can be applied to improve development and operations across related sets of the systems, services, platforms, and/or products
  • Monitors and maintains security by addressing security vulnerabilities through patches, reconfigurations, and/or settings updates
  • Fulltime
Read More
Arrow Right

Sr. Software Engineer

This role is essential for designing, implementing, and deploying scalable softw...
Location
Location
United States , Atlanta; Overland Park; Frisco
Salary
Salary:
Not provided
https://www.t-mobile.com Logo
T-Mobile
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree plus 5 years of related work experience OR Advanced degree with 3 years of related experience
  • Acceptable areas of study include Computer Science, Software Engineering, Information Management or equivalent experience in field
  • 4-7 years Technical engineering experience
  • Communication
  • Customer Service
  • Analytics
  • Technical Writing
  • Analytical Thinking
  • Collaboration
  • Mentorship
Job Responsibility
Job Responsibility
  • Develop software solutions and conduct tests to drive engineering projects and ensure quality deliverables
  • Design, develop, test, and deploy scalable backend services, APIs, and microservices supporting subscription management and customer lifecycle workflows
  • Contribute to design innovations that improve systems, processes, or services using new frameworks and industry best practices
  • Collaborate with technical teams, product managers, architects, QA engineers, DevOps, and managed service partners to deliver reliable, secure, and scalable software solutions
  • Participate in system design discussions, code reviews, and technical planning activities to improve software quality, platform reliability, and scalability
  • Troubleshoot software issues, analyze root causes, and implement sustainable solutions to improve system performance and stability
  • Support technology strategy by evaluating and applying current technologies that align with business goals
  • Create clear documentation for software code, system designs, APIs, operational processes, and business requirements to support knowledge sharing
  • Mentor others through knowledge sharing, code review participation, and training sessions aligned to engineering best practices
  • Also responsible for other duties/projects as assigned by business management as needed
What we offer
What we offer
  • Competitive base salary and compensation package
  • Annual stock grant
  • Employee stock purchase plan
  • 401(k)
  • Free, year-round money coaches
  • Medical, dental and vision insurance
  • Flexible spending account
  • Paid time off and up to 12 paid holidays
  • Paid parental and family leave
  • Family building benefits
  • Fulltime
Read More
Arrow Right

Sr Software Engineer, Agentic AI

This role is responsible for designing, developing, and deploying scalable softw...
Location
Location
United States , Atlanta; Bellevue
Salary
Salary:
113600.00 - 205000.00 USD / Year
https://www.t-mobile.com Logo
T-Mobile
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree plus 5 years of related work experience OR Advanced degree with 3 years of related experience
  • Acceptable areas of study include Computer Science, Software Engineering, Information Management or equivalent experience in field
  • 4-7 years Technical engineering experience
  • Strong communication, collaboration, and customer-focused problem-solving skills
  • Strong analytical, troubleshooting, and technical documentation abilities
  • Experience developing scalable software applications and AI-enabled services using Python, Java, or C++
  • Experience with cloud-native distributed systems, APIs, microservices, and real-time integration platforms
  • Familiarity with LLMs, conversational AI, agentic AI workflows, and AI orchestration frameworks
  • Experience building AI-driven automation, RAG solutions, and enterprise AI integrations
  • Understanding of scalability, reliability, observability, and secure software engineering best practices for production AI systems
Job Responsibility
Job Responsibility
  • Design, develop, test, and deploy scalable software and Agentic AI solutions to support enterprise automation, intelligent workflows, and customer engagement platforms
  • Build and enhance AI-enabled applications, backend services, APIs, and integration components using modern software engineering and cloud-native best practices
  • Develop and implement multi-step AI workflows, orchestration logic, and agent-based systems leveraging LLMs, RAG architectures, and AI automation frameworks
  • Contribute to the design of microservices and distributed systems supporting real-time voice and text-based customer interactions at scale
  • Collaborate with cross-functional engineering, AI, platform, and product teams to deliver secure, reliable, and high-performing AI-driven solutions
  • Evaluate emerging AI technologies, frameworks, and engineering practices to support innovation and align with business and technology strategy
  • Implement AI reliability, monitoring, observability, security, and governance best practices, including guardrails and human-in-the-loop workflows
  • Create and maintain technical documentation for software solutions, AI workflows, APIs, system architecture, and operational processes
  • Mentor team members through technical guidance, code reviews, knowledge sharing, and adoption of AI engineering best practices
  • Support continuous improvement initiatives, operational excellence, and other engineering projects as assigned by business leadership
What we offer
What we offer
  • Medical, dental and vision insurance
  • flexible spending account
  • 401(k)
  • employee stock grants
  • employee stock purchase plan
  • paid time off and up to 12 paid holidays
  • paid parental and family leave
  • family building benefits
  • back-up care
  • enhanced family support
  • Fulltime
Read More
Arrow Right