This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
The System Architecture Engineer's role is to develop and evolve technical network and service architectures and design strategies to maintain, enhance, and evolve the T-Mobile Technology portfolio. This role is responsible to improves and protect the software, infrastructure, and network systems that power T-Mobile’s IT and customer-facing services. This role ensures scalability, availability, performance, security, and reliability across applications and networks, with a strong focus on proactively identifying and preventing network issues before they impact customers. The engineer will play a critical role in outage bridges, leveraging KPIs, telemetry, and AI-driven analytics to pinpoint problems with speed and accuracy. From designing & maintaining CI/CD pipelines to operating large-scale network infrastructure (Cisco, Juniper, Check Point, F5, Infoblox, A10), the SNRE enables great customer experiences through automation, observability, and innovation.
Job Responsibility:
Develop and evolve technical network and service architectures and design strategies
Improve and protect the software, infrastructure, and network systems that power T-Mobile’s IT and customer-facing services
Ensure scalability, availability, performance, security, and reliability across applications and networks
Proactively identify and prevent network issues before they impact customers
Play a critical role in outage bridges, leveraging KPIs, telemetry, and AI-driven analytics to pinpoint problems
Create new designs, architectures, and standards for delivering software and network services
Improve scalability, latency, and efficiency of T-Mobile’s applications and network services
Contribute to cloud enablement, containerization, and microservices reliability
Manage improvement work, PoCs, and future automation projects
Diagnose and resolve complex issues in routers, firewalls, load balancers, DNS, and global traffic managers
Proactively forecast and address capacity and performance bottlenecks
Provide network expertise on outage bridges
Define standards and playbooks for fault prevention and outage handling
Build and operationalize telemetry pipelines
Develop AI/ML-driven models to detect anomalies and predict failures
Create and maintain dashboards and health checks for real-time visibility
Use Terraform and IaC to deploy and manage infrastructure and applications in AWS cloud
Integrate reliability practices into CI/CD pipelines and DevOps workflows
Define, measure, and optimize performance KPIs, SLIs, and SLOs across hybrid environments
Lead engineering projects and team members
Engage with vendor(s) or industry fellows to innovate procedures and processes
Create, present, design, and implement new ideas
Proactively investigate architecture, design strategies and standards options
Contribute to industry or vendor forums
Analyze industry trends and competitive moves
Write advanced documentation, architecture, capabilities, limitations, and advantages for technologies
Monitor and influence relevant industry technologies and standards
Present highly technical concepts
Continuously learn, create content, and facilitate training
Informally coach and contribute to the development of others
Influence and recommend technology and policy decisions
Research and create new technology options to drive business transformation
Requirements:
Master’s/Advanced degree in Computer Science, Engineering, or related field. Equivalent experience considered
7–10 years in system, network, or reliability engineering roles
Deep expertise in network infrastructure (Cisco, Juniper, Check Point, F5, A10, Infoblox, BIND, DNS)