This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We are looking for an experienced IT leader to oversee cloud infrastructure, DevOps practices, and production reliability for SaaS platforms in Jacksonville, Florida. This role will work closely with engineering teams to strengthen system performance, support scalable growth, and maintain secure, highly available operations. The ideal candidate brings hands-on technical depth, sound leadership judgment, and a strong focus on automation, service stability, and continuous improvement.
Job Responsibility
Lead cloud infrastructure and DevOps operations to maintain dependable, scalable, and secure SaaS environments
Partner with software and R&D teams to coordinate releases, platform upgrades, capacity forecasting, and performance improvements
Direct 24/7 production support activities, including system monitoring, incident response, escalation management, and service recovery efforts
Expand automation across deployment and maintenance workflows by promoting CI/CD practices and infrastructure-as-code methods
Implement and uphold operational security measures such as patching, access governance, system hardening, compliance readiness, and audit support
Improve observability by enhancing monitoring, alerting, and reliability processes that help reduce downtime and manual intervention
Guide and mentor technical team members, manage on-call participation, and build a culture centered on accountability and operational excellence
Oversee core infrastructure and platform components, including web, database, caching, messaging, backup, network, and configuration management technologies
Requirements
Bachelor's degree in Computer Science or a related technical discipline
3–5 years of hands-on cloud operations experience, including at least 1 year leading projects or supervising technical teams
Practical experience supporting Kubernetes in production, with the ability to diagnose and resolve common platform issues independently
Strong working knowledge of middleware and infrastructure components such as Nginx, MySQL, Redis, and Kafka, including configuration and performance tuning
Proficiency in Shell or Python scripting for automation, administration, and operational support tasks
Familiarity with infrastructure-as-code and cloud automation tools such as Terraform or CloudFormation, along with CI/CD platforms like Jenkins or GitHub Actions
Experience with enterprise infrastructure technologies, including Active Directory, backup solutions, Cisco environments, computer hardware, and configuration management tools
Fluency in Mandarin Chinese and strong English communication skills for effective collaboration across global and local teams
What we offer
medical, vision, dental, and life and disability insurance