This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
The Platform Engineer is essential for designing and optimizing infrastructure that supports internal services and platforms within the organization. It involves building resilient and scalable systems to ensure robust infrastructure and enterprise-wide governance. The role focuses on implementing continuous integration and continuous deployment (CI/CD) pipelines and enhancing data platform interoperability. Success is measured by system reliability, efficient software deployment, and seamless technology migrations and integrations. The work impacts the organization by enabling adaptability and operational excellence in a dynamic technological environment.
Job Responsibility:
Designs and develops resilient and scalable infrastructure systems
Own day-to-day operations of hybrid infrastructure supporting T-Mobile’s platform services
Ensure uptime, performance, and security across on-prem, AWS, and Azure environments
Troubleshoot complex infrastructure, configuration, and deployment issues impacting platform reliability
Lead patching, updates, and configuration management with minimal oversight
Participate in on-call rotation and drive improvements in incident response and postmortem practices
Optimizes existing infrastructure to enhance functionality and interoperability
Ensure system reliability, performance, and security through proactive monitoring, automation, and performance tuning
Troubleshoot complex platform and application integration issues impacting performance or availability
Develop, execute, and tune DML logic — queries, data migrations, transformations, and batch operations — for performance and reliability
Drive incident analysis and reliability reviews to improve operational posture and system resilience
Implements and maintains CI/CD pipelines for efficient software deployment
Support containerization and orchestration technologies, including Docker and Kubernetes, to standardize deployment practices
Design, develop, and maintain automation using Python, Bash, or PowerShell to increase operational efficiency
Implement and manage Infrastructure-as-Code (IaC) using Terraform, Ansible, or equivalent frameworks
Build and maintain CI/CD pipelines for platform infrastructure and environment deployments (GitLab CI/CD, Jenkins)
Establish documentation standards and reusable modules for consistent automation delivery
Collaborates within Agile teams to drive continuous improvement and operational excellence
Contribute to Agile ceremonies, driving infrastructure readiness and delivery excellence
Advocate for DevOps and automation best practices across engineering teams
Participate in on-call rotations and lead incident resolution and root cause analysis
Lead efforts in automation for self-healing, scaling, and performance tuning
Facilitates seamless migrations and integrations across different technologies
Develop and manage observability solutions leveraging Prometheus, Grafana, CloudWatch, Azure Monitor, or ELK
Implement proactive monitoring, log analysis, and metrics-based alerting for early issue detection
Collaborate with SREs to improve mean time to resolution (MTTR) and overall platform reliability
Execute and optimize DML operations within Postgres, Oracle, and Cassandra environments under existing organizational DDL structures
Develop and maintain integrations with Kafka for event-driven data pipelines, message publishing, and asynchronous workloads
Implement caching strategies (Redis, in-memory caches) to reduce query latency and improve application performance
Support database performance tuning, data ingestion, and transformation activities aligned with enterprise governance standards
Partner across engineering, DevOps, and data teams to design scalable solutions and resolve complex issues
Requirements:
Bachelor's Degree plus 2 years of related work experience OR combination of education and experience deemed equivalent
Acceptable areas of study include Computer Science, Data Science or Related Field (Preferred)
2-4+ years - Designing and developing scalable and resilient infrastructure systems
2-4+ years - Implementing and managing CI/CD pipelines for enterprise applications
2-4+ years - Integrating and optimizing data platforms for interoperability across various technologies
Hands-on development experience using Python, Bash, or PowerShell
Experience with observability solutions leveraging Prometheus, Grafana, CloudWatch, Azure Monitor, or ELK
At least 18 years of age
Legally authorized to work in the United States
Nice to have:
Certified Kubernetes Administrator (CKA)
AWS Certified Solutions Architect
Certified Information Systems Security Professional (CISSP)