This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
Help us scale. Socket ships daily to an ever-growing group of customers. As our customer base grows, we’re building features and systems at lightning speed, and we need to ensure that our infrastructure and tooling meets that demand. You’ll be critical part of the team that supports this growth. Grow the team and culture: As an early member of the team, you will form the defining DNA for the company's culture and our future team. Our ability to build a market-defining product is solely dependent on the culture we foster. Set up foundational frameworks: You'll join at the genesis of something totally new and come into a fast-paced environment. We value process-driven systems that enable us to work smarter as we scale, and you'll build out systems that will serve as guide rails for the engineering team.
Job Responsibility:
Partner closely with our engineers to debug production issues, improve performance, and design systems that scale reliably
Own and evolve Socket’s infrastructure, with a focus on reliability, performance, and cost as we scale
Help define and evolve SLIs and SLOs for new and existing systems, turning reliability into something that can be measured and improved
Debug, maintain, and improve our deployment pipeline, including addressing failures in production and driving meaningful improvements over time
Build and maintain observability across our systems (metrics, logs, traces) to support faster detection and resolution of issues
Participate in an on-call rotation and drive incident reviews with an emphasis on concrete follow-ups and system improvements
Requirements:
5+ years of software development experience
1+ year in a DevOps or SRE role
Experience scaling and operating production web applications, preferably in a TypeScript / NodeJS environment
Strong knowledge of relational databases, with Postgres preferred
Hands-on experience building and using observability systems (Prometheus/Mimir, OpenTelemetry, Grafana)
Experience with container orchestration (Docker, Kubernetes)
Practical experience managing infrastructure-as-code with Terraform
Experience running systems in a cloud environment, with GCP preferred
Experience building and maintaining CI/CD pipelines (e.g. GitHub Actions)
What we offer:
Market competitive salary bands
Meaningful equity program
Comprehensive health benefits for you and your family
Flexible time-off, holidays, and winter shutdown to rest & recharge