This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
At T-Mobile, we invest in YOU! Our Total Rewards Package ensures that employees get the same big love we give our customers. All team members receive a competitive base salary and compensation package - this is Total Rewards. Employees enjoy multiple wealth-building opportunities through our annual stock grant, employee stock purchase plan, 401(k), and access to free, year-round money coaches. That’s how we’re UNSTOPPABLE for our employees! T-Mobile is America’s supercharged Un-carrier, delivering an advanced 4G LTE and transformative nationwide 5G network that will offer reliable connectivity for all. Sr Engineers, Systems Reliability is located in Frisco, TX and will utilize proficient knowledge and skill in emerging DevOps-centric automation tools and technologies for CICD, configuration management, etc. for production environments.
Job Responsibility:
Perform environment management, automated server provisioning, pipeline configuration (VMs)
Deliver software to improve the availability, scalability, latency, and efficiency of T-Mobile’s services
Craft, manage, and use dashboard for continuous monitoring and health check of applications, and the underlying infrastructure, improve the quality of services using the monitoring feedback for production environment
Contribute to future improvement of software delivery processes and operations, e.g., cloud enablement, use of microservices with containerization
Relationship and People Management: Mentors/guides other Systems Reliability Engineers, Software Engineers and vendor resources as needed
Requirements:
Master’s degree in Computer and information technology, Electrical and Computer Engineering, or related, and 6 years of relevant work experience
Bachelor’s degree in Computer and information technology, Electrical and Communication Engineering, or related, and 8 years of relevant work experience
Design, develop, and deliver complex GitLab CI/CD pipelines for enterprise billing platforms
Build and administer Kubernetes clusters using Conductor for application lifecycle management, packaging with helm and duck templates for infrastructure automation
Develop custom tools in Shell, Perl, YAML, Jython and Python (including Boto3) to support zero-downtime deployments and operations
Implement Infrastructure as Code with Terraform and AWS CloudFormation to provision infrastructure across AWS, PCF, Google and Azure cloud platforms
Develop AWS Lambda function to migrate historical billing information from RDS to S3
Support and administer Skava-based ecommerce platforms, Java/J2EE and REST API’s including deployment, scaling, and operational troubleshooting in production
Provision and manage relational and NoSQL databases, including PostgreSQL, MySQL, Oracle, and MongoDB (Atlas) and develop, optimize SQL scripts for billing workflows and for generating monthly consumer and business reports
Develop scripts and controls to enforce access management using Azure AD and prevent public exposure of secrets using GitGuardian, T-Vault and CyberArk ensuring compliance with cybersecurity standards
Automate Windows system administration and deployment processes using PowerShell, create and maintain Power BI reports and dashboards
Expert-level experience in implementing and managing observability platforms like Splunk, AppDynamics, and Grafana, with a focus on developing real-time dashboards and actionable alerts for microservice health, API latency, and system fault detection
At least 18 years of age
Legally authorized to work in the United States
What we offer:
Competitive base salary and compensation package
Annual stock grant
Employee stock purchase plan
401(k)
Access to free, year-round money coaches
Annual bonus or periodic sales incentive or bonus based on role