This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
We’re looking for a Backend Engineer to join Team Events and help us build and enhance foundational event & queuing services that power fantastic products at Zapier. Our primary focus is to build and support our Events system at Zapier as well as other queueing solutions used by our internal teams at Zapier. Our services and components are critical, run at scale, and operate as a foundation to all of Zapier.
Job Responsibility:
Work with AWS services like MSK, SQS, Redis, S3, Lambda and Aurora to build scalable solutions that process billions of events per day
Use Terraform to maintain and build our infrastructure
Build toolkits, libraries, and scripts to ease challenges faced by other teams at Zapier when they wish to emit to and consume from the Events system as well as other queue solutions we are currently working on building
Contribute to data governance practices across Zapier
Influence proper data structure and data hygiene
Refactor or improve existing code as languages, frameworks, or techniques evolve
Help the team pick appropriate tools to solve new problems as they arise
Provide feedback on tools, processes, and documentation in place to help us become a better, more effective organization
Work with your colleagues to develop new skills, through code review, discussions and mentoring
Participate in on-call rotations to ensure the reliability and availability of our systems, providing timely and effective support when issues arise
Requirements:
4+ years with software development in either Python, Go, Typescript
At least 2 years focused on building event / streaming systems at scale
Experience working with event architectures and services based on technologies like Kafka (MSK) and Avro
Supported event-system infrastructure to ensure resiliency and uptime
Participated in the design or maintenance of highly available, cloud-based infrastructure in AWS or another cloud provider
Understand how to leverage infrastructure-as-code tools (Terraform) and have learned best practices for reliability and observability
Strong experience with AWS services, cloud computing technologies, and distributed data stores
Experience with languages like Python or Go to create automated tools
Believe in hands-off deployments and infrastructure as code
Nice to have:
Strong problem-solving and analytical thinking skills, combined with excellent collaboration and communication abilities
A natural curiosity and eagerness to learn and explore new technologies and solutions
SRE experience working with and supporting existing systems to ensure up time and reliability
Experience working with queues in the cloud or SAAS solutions. SQS experience is highly preferred
Knowledge of CI/CD pipelines (e.g., using a tool like GitLab)