CrawlJobs Logo

Staff Software Engineer, Cloud Infrastructure

Singapore, Singapore · Job Posted February 21, 2026
Apply Position
Job Link Share

Job Description

The Airwallex Cloud Infrastructure team is a group of highly skilled, innovative, and collaborative professionals. The team brings together experts in Kubernetes, service mesh, cloud platforms, and core services, all working seamlessly to ensure the highest levels of performance, reliability, and security. We are looking for a Staff Software Engineer to join the team to create and maintain a cutting-edge, secure, and scalable infrastructure that supports our cross-border payment solutions. We build and run the cloud systems that keep Airwallex fast, safe, and always available. Our mission? Make sure everything just works—no matter how big we grow.

Job Responsibility

  • Create and maintain a cutting-edge, secure, and scalable infrastructure that supports our cross-border payment solutions
  • Build and run the cloud systems that keep Airwallex fast, safe, and always available
  • Make our systems secure, reliable, and easy for our engineers to use
  • Help us scale—globally
  • Automate everything
  • Work with awesome teammates in engineering, product, and security

Requirements

  • 8+ years working with large, distributed systems
  • Solid understanding of RESTful API design principles and patterns
  • Deep experience with public cloud (GCP, AWS, or Azure), Infrastructure as Code, and Kubernetes
  • Experience with Istio service mesh
  • Ability to write services in Golang(preferred) or Python
  • Bachelor’s degree (or higher) in Computer Science or similar field
  • Strong security skills and hands-on experience with security best practices
  • Strong communication and collaboration skills

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Staff Software Engineer, Cloud Infrastructure

8 matching positions

New

Senior Staff Engineer Software (Cloud Platform, Production & Reliability – Machine Identity Security)

The Production Engineering team is responsible for building, scaling, and operat...
Location
Location
United States , Santa Clara
Salary
Salary:
126000.00 - 203500.00 USD / Year
paloaltonetworks.com Logo
Palo Alto Networks
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in DevOps, Platform Engineering, or Site Reliability Engineering (SRE)
  • Strong experience designing and operating cloud infrastructure on AWS, Azure, or GCP
  • Deep expertise managing and scaling Kubernetes environments (EKS, AKS, or GKE)
  • Strong experience with Infrastructure as Code tools (Terraform, Ansible, or Pulumi)
  • Proven experience designing and maintaining complex CI/CD systems (Jenkins, GitLab CI, ArgoCD, GitHub Actions)
  • Strong programming/scripting skills (Python, Go, or similar) for automation and tooling
  • Experience operating in high-scale, 24/7 production environments with ownership of incident response and reliability
  • Solid understanding of Linux systems and networking fundamentals (DNS, TCP/IP, load balancing, VPC, mTLS)
  • Strong problem-solving skills and ability to work across teams
Job Responsibility
Job Responsibility
  • Design, build, and evolve highly available cloud infrastructure platforms with a focus on scalability, resilience, and reliability
  • Lead improvements across production systems, including performance, availability, and incident response
  • Drive and standardize Infrastructure as Code (IaC) practices to improve consistency and reduce operational overhead
  • Design and optimize CI/CD pipelines to support fast, secure, and reliable software delivery at scale
  • Partner with development teams to improve system reliability, observability, and cloud-native design patterns
  • Define and implement monitoring, alerting, and observability strategies across distributed systems
  • Lead incident response efforts, including root cause analysis and long-term remediation strategies
  • Identify and eliminate operational toil through automation and system improvements
  • Mentor engineers and contribute to raising the bar for production engineering practices
What we offer
What we offer
  • restricted stock units
  • bonus
  • Fulltime
Read More
Arrow Right

Staff Infrastructure Software Engineer, Enterprise AI

Scale GP is building the next generation of enterprise-grade Generative AI produ...
Location
Location
United States , New York; San Francisco
Salary
Salary:
216200.00 - 270250.00 USD / Year
scale.com Logo
Scale
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience in a senior role
  • 5+ years of full-time software engineering experience
  • Deep understanding of modern infrastructure practices, including CI/CD, IaC (e.g., Terraform, Helm Charts), container orchestration (e.g., Kubernetes) and observability platforms (e.g., Datadog, Prometheus, Grafana)
  • Extensive experience with at least one major cloud provider (AWS, Azure, or GCP)
  • Strong knowledge of security and compliance in enterprise environments, with a focus on access management, data isolation, and customer-specific VPC setups
  • Proficiency in Python or JavaScript/TypeScript, and SQL
Job Responsibility
Job Responsibility
  • Define the architectural patterns for our multi-cloud infrastructure to support secure, reliable, and scalable Agentic workflows for enterprise customers
  • Lead the infrastructure roadmap with a strong focus on compliance, privacy, and security standards, including designing change management and data isolation strategies
  • Own the development and maintenance of our best-in-class Agentic observability platform (logging, metrics, tracing, and analytics) to proactively ensure system health and enable rapid incident response
  • Drive developer efficiency by building automated tooling and championing Infrastructure-as-Code (IaC) paradigms throughout the engineering organization
  • Solve the toughest engineering problems related to multi-tenancy, data isolation, and high-performance inference at a massive scale, taking end-to-end ownership across the full product lifecycle
What we offer
What we offer
  • Comprehensive health, dental and vision coverage
  • retirement benefits
  • a learning and development stipend
  • generous PTO
  • equity based compensation
  • additional benefits such as a commuter stipend
  • Fulltime
Read More
Arrow Right

Staff Infrastructure Software Engineer - AI Platform

We are currently seeking a Staff Software Engineer to join the AI Platform team ...
Location
Location
United Kingdom , Edinburgh
Salary
Salary:
Not provided
addepar.com Logo
Addepar
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Extensive experience as a Software/Backend Engineer, with a track record of taking on increasing responsibility
  • Experience across the full product lifecycle: designing, implementing, shipping, scaling, operationalizing, and maintaining technology/SaaS products
  • Exceptional Programming skills and fundamentals in Python/Go/Java, with a proven track record of building large scale production systems
  • Proficient experience with diverse compute environments including microservices (K8s), Databricks and serverless architectures (e.g. AWS Lambda)
  • Demonstrable experience leading initiatives with infrastructure-as-code tools such as Terraform in complex, multi-account environments
  • Proficient experience with comprehensive monitoring and alerting stacks (e.g. Prometheus/Grafana/Sentry/cloud-native tools), with a focus on observability strategy
  • Excellent interpersonal and communication skills to effectively collaborate with multi-functional teams, articulate complex technical concepts, and influence outcomes
Job Responsibility
Job Responsibility
  • Design and build the production runtime for LLM-based agents and products, creating the services and infrastructure that serve autonomous agents
  • Develop deep application-level knowledge to proactively inform and influence requirements, constraints and best practices for implementing composable, complex AI systems
  • Lead the design, implementation, and automation of production infrastructure on a variety of cloud environments (Kubernetes/Databricks), to enable us to ship and scale AI features instantly
  • Evangelize and promote disciplined, best engineering practices to enforce strong production hygiene and culture
  • Initiate and lead collaborations with cross-functional teams to identify and resolve complex application or infrastructure issues, serving as a technical subject matter expert
  • Architect, build, and maintain advanced, automated CI/CD pipelines e.g. using Jenkins, ArgoCD, AWS CodeBuild/Pipeline, GitHub Actions, or similar, establishing best practices for deployment strategies (e.g., blue/green, canary)
  • Develop systems and best practices monitoring, alerting, and troubleshooting of our probabilistic and AI-driven systems and broader software stack
Read More
Arrow Right

Staff+ Software Engineer - Cloud Availability Platform Engineering

Be a part of the AI revolution with sustainable technology at Crusoe. Here, you'...
Location
Location
United States , San Francisco; Sunnyvale
Salary
Salary:
209000.00 - 253000.00 USD / Year
crusoe.ai Logo
Crusoe
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science or Software Engineering
  • 10+ years of relevant experience
  • 10+ years of experience building and operating distributed systems at scale
  • Proven experience with building reliable, scalable, efficient, and secure cloud platforms and systems and effectively running them in production environments
  • Fluency in programming languages such as Go, Rust, Java or C++
  • A collaborative approach (platform mindset) to working with development and operations teams
  • Understanding of cloud security best practices and the ability to implement secure configurations
  • Excellent troubleshooting and problem-solving skills
  • Excellent communication skills
  • Embody the Company values
Job Responsibility
Job Responsibility
  • Architect, design, and develop Cloud Infrastructure management systems and platforms
  • Deliver E2E use cases and workflows for a vertically integrated AI-First Crusoe Cloud
  • Build systems and platforms to efficiently plan, monitor, deploy and operate Crusoe Cloud
  • Evaluate and hands-on implement and build platforms, tools, and frameworks
  • Streamline infrastructure planning and management processes and workflows
  • Develop and refine technical designs and architecture
  • Mentor fellow engineers
  • Actively contribute to team growth
  • Collaborate extensively across teams to architect, design, implement physical infrastructure management software systems, availability platforms, and frameworks
  • Champion the reliability, scalability, and security of our systems and platforms
What we offer
What we offer
  • Restricted Stock Units
  • Health insurance package options that include HDHP and PPO, vision, and dental for you and your dependents
  • Employer contributions to HSA accounts
  • Paid Parental Leave
  • Paid life insurance, short-term and long-term disability
  • Teladoc
  • 401(k) with a 100% match up to 4% of salary
  • Generous paid time off and holiday schedule
  • Cell phone reimbursement
  • Tuition reimbursement
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Cloud Capacity

The Cloud Capacity team plays a critical role in ensuring the Temporal Cloud is ...
Location
Location
United States
Salary
Salary:
170000.00 - 250000.00 USD / Year
temporal.io Logo
Temporal
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience contributing to large-scale infrastructure efforts spanning cloud compute, storage, and networking systems
  • Strong product and operational intuition around managing cloud costs, utilization tracking, and workload forecasting
  • A track record of designing distributed systems and services in a production cloud environment (preferably AWS, GCP, or Azure)
  • Hands-on experience with container orchestration technologies (e.g., Kubernetes) and the surrounding ecosystem
  • Exceptional collaboration and communication skills
  • Comfortable aligning cross-functional stakeholders on complex infrastructure problems, including executives and finance partners
  • 6+ years of experience building production software using Go, Java, or similar languages
Job Responsibility
Job Responsibility
  • Drive the technical vision and roadmap for Temporal’s Cloud Capacity systems in partnership with engineering and product leadership
  • Design and implement infrastructure to track resource utilization, forecast consumption, and support automated capacity planning at scale
  • Lead development of a resource manager that optimizes infrastructure efficiency based on usage trends, cost insights, and evolving customer needs
  • Collaborate cross-functionally with Product, Cloud Infrastructure, and Finance to inform business-critical decisions around provisioning, pricing, and scaling
  • Guide long-term strategy to support intelligent autoscaling, workload isolation, and predictable performance in a multi-tenant cloud environment
What we offer
What we offer
  • Unlimited PTO, 12 Holidays + 2 Floating Holidays
  • 100% Premiums Coverage for Medical, Dental, and Vision
  • AD&D, LT & ST Disability, and Life Insurance (Standard & Supplemental Available)
  • Empower 401K Plan
  • Additional Perks for Learning & Development, Lifestyle Spending, In-Home Office Setup, Professional Memberships, WFH Meals, Internet Stipend and more
  • $3,600 / Year Work from Home Meals
  • $1,500 / Year Career Development & Learning
  • $1,200 / Year Lifestyle Spending Account
  • $1,000 / Year In-Home Office Setup (In addition to Temporal issued equipment)
  • $500 / Year Professional Memberships
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Infrastructure

We’re looking for early members of our software engineering infrastructure team....
Location
Location
United States , Boston, NYC
Salary
Salary:
170000.00 - 240000.00 USD / Year
suno.ai Logo
Suno
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years infrastructure experience preferred
  • Experience with cloud services (AWS/GCP), Kubernetes, Docker, and infrastructure as code (Pulumi/Terraform/CDK)
  • Experience scaling infrastructure from 0 to 1
  • Strong understanding of Postgres, distributed relational database, large scale database hosting a plus
  • Strong backend skills to help optimize application and service code
  • Strong understanding of security best practices in building/scaling infrastructure
  • An obsession with engineering excellence, iterating & learning rapidly, and working hard
  • Applicants must be eligible to work in the US
Job Responsibility
Job Responsibility
  • Architect and build services to handle massive consumer traffic, data, and usage
  • Design systems that are performant, secure, scalable, and easy to observe
  • Lead by example on operational and software engineering excellence
What we offer
What we offer
  • Company Equity Package
  • 401(k) with 3% Employer Match & Roth 401(k)
  • Medical, Dental, & Vision Insurance (PPO w/ HSA & FSA options)
  • 11 Paid Holidays + Unlimited PTO & Sick Time
  • 16 Weeks of Paid Parental Leave
  • Creative Education Stipend
  • Generous Commuter Allowance
  • In-Office Lunch (5 days per week)
  • Fulltime
Read More
Arrow Right

Staff Software Engineer, Core Infrastructure

Our Core Infrastructure team in Aarhus is at the forefront of building and scali...
Location
Location
Denmark , Aarhus
Salary
Salary:
Not provided
uber.com Logo
Uber
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience in backend software development with distributed systems, infrastructure, or cloud platforms
  • Strong expertise in Go, Java, or similar backend languages, with a deep understanding of Kubernetes, cloud infrastructure, and high-scale systems
  • Experience leading cross-team or team-wide projects focused on system modernization, performance optimizations, and deployment safety improvements
  • Experience designing and implementing highly available, efficient, and secure cloud-native/kubernetes architectures
  • Deep understanding of safe deployment strategies, workload automation, and resilience engineering
  • Strong experience in scaling autoscaling solutions, ARM adoption, hybrid cloud, or GPU support for ML workloads
  • Ability to lead complex, cross-team engineering projects and build strategic relationships with stakeholders across platform, security, and infrastructure teams
Job Responsibility
Job Responsibility
  • Design and implement backend infrastructure components to support Uber’s growing workloads, including deployment engines, autoscalers, and hybrid cloud environments
  • Lead cross-team projects focused on safe deployment and rollback automation across stateless, stateful, and batch workloads, improving resilience and developer efficiency
  • Improve infrastructure security and compliance, including encryption-at-rest, ransomware mitigation, and cloud security best practices
  • Contribute to and drive modernization efforts within the team and across related teams, including Kubernetes migration, unified workload platforms, and PaaS improvements
  • Optimize Uber’s infrastructure efficiency, focusing on ARM adoption, autoscaling enhancements, and cost-effective compute allocation
  • Proactively mentor other engineers and help define the technical direction for your team, ensuring Uber’s backend infrastructure remains reliable, scalable, and efficient
  • Fulltime
Read More
Arrow Right

Sr. Staff Engineer Software (Strata Cloud Control Plane)

Help build what is next. Our Cloud Management Platform is a public cloud deliver...
Location
Location
United States , Santa Clara
Salary
Salary:
126000.00 - 204500.00 USD / Year
paloaltonetworks.com Logo
Palo Alto Networks
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in building large enterprise applications
  • Proven ability to lead and collaborate with many cross-functional teams with an emphasis on end-to-end delivery
  • Experience in the full software development life cycle, including coding standards, code reviews, source control management, build processes, testing, peer review, and operations
  • Excellent programming skills in GoLang is a must
  • Strong fundamentals in object oriented design and development is a must
  • Experience with Test Driven Development and Continuous Integration is required
  • Experience developing microservice based solutions on public cloud infrastructure is highly desirable
  • Experience building data management solutions using transactional data stores is required
  • MS/BS in Computer Science or equivalent
Job Responsibility
Job Responsibility
  • Conceptualize, Collaborate and Develop highly scalable cloud control plane for managing services at scale in hybrid deployments
  • Hands-on participation in developing next generation configuration management architecture
  • Technical leadership and end-to-end delivery of solutions in collaboration with cross-functional product management, development and quality assurance teams in a fast paced environment
  • Deliberate and build frameworks to improve quality of micro services
  • Work with Devops and Technical Support teams to investigate and resolve critical customer defects
  • Recruit and Mentor new team members
What we offer
What we offer
  • Restricted stock units and a bonus
  • Employee benefits described via hyperlink
  • Fulltime
Read More
Arrow Right