CrawlJobs Logo

Golang SRE / Production Engineer

doublezero.xyz Logo

DoubleZero

Location Icon

Location:

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

We’re looking for an SRE who thinks like a systems architect and builds reliability through automation. You should have lived inside large production environments and be allergic to manual processes.

Job Responsibility:

  • Design and build automation-first reliability systems in Go
  • Treat infrastructure as pipelines: provisioning, observability, deployment, failover
  • Build internal tooling, control planes, and reliability infrastructure
  • Own production readiness, alerting, and operational maturity
  • Work across protocol, networking, and hardware boundaries

Requirements:

  • Big tech SRE / Production Engineering background or similar
  • Strong Golang engineering skills in production environments
  • Systems thinker: you design for scale, reliability, and operational simplicity
  • Experience building internal platforms rather than just operating existing ones
  • Comfort with distributed systems, networking, and low-level system behavior

Additional Information:

Job Posted:
February 21, 2026

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Golang SRE / Production Engineer

Senior Backend Engineer - Product & Dev Tooling

Endor Labs is building the Application Security platform for the software develo...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
https://www.endorlabs.com Logo
Endor Labs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Engineering, or a related field
  • 5+ years of experience in software engineering, with a strong foundation in backend development
  • Proficiency in Golang, especially building APIs and tools in microservices architectures
  • Hands-on experience with observability ecosystems: Prometheus, Grafana, OpenTelemetry, etc.
  • A strong SRE mindset—understanding SLAs/SLOs, incident response, root cause analysis—but with a builder’s approach to creating software and automation
  • Familiarity with distributed systems, metrics pipelines, and scalable monitoring infrastructure
  • Proven ability to design and implement technical solutions from the ground up with minimal supervision
  • A passion for transforming complex data into actionable insights through intuitive dashboards
  • Excellent communication skills and a collaborative spirit
Job Responsibility
Job Responsibility
  • Build systems and dashboards that enable visibility into the health, performance, and usage of our SaaS platform
  • Automate troubleshooting by leveraging deep knowledge of the product to reduce time to repair and fix production issues
  • Build tooling and APIs in Golang to surface data insights via internal dashboards
  • Partner closely with architects, backend engineers, and product managers to define observability tooling and integrate them seamlessly into our platform
  • Drive instrumentation and metrics collection across distributed services using Prometheus, Grafana, and related technologies
  • Champion reliability, debuggability, and performance across the engineering organization
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, Infrastructure

The InfraOps team’s primary goal is to enable and empower Kiddom’s engineering b...
Location
Location
United States , New York City
Salary
Salary:
160000.00 - 200000.00 USD / Year
kiddom.co Logo
Kiddom
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • BS or MS in Computer Science or a related field
  • 5+ years professional software engineering experience
  • Experience with Java, or Python, Go, Clojure in a production environment
  • Experience designing and building REST APIs
  • Exposure to authorization technologies (OAuth)
  • Experience with continuous integration and automation tools and processes
  • Strong knowledge of design patterns and software engineering best practices
  • Excellent problem solving and debugging skills
  • Strong acumen or exposure to DevOps or SRE methodologies
  • Keen sense for SecOps.
Job Responsibility
Job Responsibility
  • Evangelizing and fostering a healthy DevOps culture here at Kiddom, working with teams to establish best practices and help guide new and existing services.
  • Practicing Infrastructure as Code (IaC) wherever possible, giving us the confidence in repeatable processes that can be automated.
  • Grow our DevOps efforts from small scale to large scale multi-region
  • Share ownership of the entire infrastructure architecture
  • Aim for high availability, high resiliency
  • Support the engineering team with tools to evaluate the performance of their code in production environments, speed up CI/CD pipeline, & feature verification
  • support the engineering team with tools to speed up CI/CD pipeline, feature verification
  • Design and build a scalable, generalized framework for third-party API integrations
  • Leverage existing infrastructure and components to build RESTful web services
  • Build APIs and robust testing environments for internal and external developers
What we offer
What we offer
  • Competitive salary
  • Meaningful equity
  • Health insurance benefits: medical (various PPO/HMO/HSA plans), dental, vision, disability and life insurance
  • One Medical membership (in participating locations)
  • Flexible vacation time policy (subject to internal approval). Average use 4 weeks off per year.
  • 10 paid sick days per year (pro rated depending on start date)
  • Paid holidays
  • Paid bereavement leave
  • Paid family leave after birth/adoption. Minimum of 16 paid weeks for birthing parents, 10 weeks for caretaker parents. Meant to supplement benefits offered by State.
  • Commuter and FSA plans
  • Fulltime
Read More
Arrow Right

Software Engineer Staff

Designs, develops, troubleshoots and debugs software programs for software enhan...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A minimum of 10 years of professional software development experience
  • Proven expertise in one or more backend programming languages such as Golang (highly preferred), Java, Python, or C/C++
  • Deep understanding of networking protocols, network architectures, network security, and common networking concepts
  • Proven experience in designing, building, and deploying scalable microservices using Docker, Kubernetes, etc.
  • Significant experience in building, deploying, and operating scalable SaaS applications in a Public Cloud (AWS/GCP) environment
  • Strong understanding of distributed systems principles, including concurrency, scalability, fault tolerance, and consistency
  • Experience with various database technologies, including relational (e.g., PostgreSQL, MySQL) and NoSQL (e.g., DynamoDB, Redis) databases
  • Experience designing, building, and consuming RESTful APIs and other integration technologies like WebSocket, Kafka, etc.
  • Experience with network security principles, threat modelling, and secure coding practices is an added advantage
  • Excellent analytical and problem-solving skills
Job Responsibility
Job Responsibility
  • Technical Leadership: Work with product managers, architects, and other engineers to understand the software requirements, and define corresponding functional and design specifications
  • Software Development: Design, develop, test, deploy, and maintain high-quality, production-grade software, with a strong emphasis on backend systems
  • System Design & Optimization: Design and implement micro-services for high availability, scalability, performance, and security within our SaaS platform
  • Networking Expertise: Apply deep knowledge of networking protocols (e.g., TCP/IP, HTTP/S, DNS, NAT), network security, and cloud networking concepts to build robust and secure solutions
  • SaaS & Cloud Native Development: Design and implement solutions leveraging cloud platforms (e.g., AWS, Azure, GCP) and containerization technologies (e.g., Kubernetes, Docker)
  • Collaboration: Collaborate effectively with cross-functional teams including product management, QA, SRE, and Juniper technical assistance team
  • Code Quality & Best Practices: Champion best practices in software development, including code reviews, testing methodologies, CI/CD, and DevOps principles
  • Problem Solving: Troubleshoot and resolve complex technical issues in a timely and effective manner, often in production environments
  • Innovation & Research: Stay abreast of emerging technologies and industry trends in networking, SaaS, and software engineering
  • Documentation: Create and maintain comprehensive technical documentation for designs, APIs, and operational procedures
What we offer
What we offer
  • Health & Wellbeing: Comprehensive suite of benefits that supports physical, financial and emotional wellbeing
  • Personal & Professional Development: Specific programs catered to helping you reach any career goals you have
  • Unconditional Inclusion: We are unconditionally inclusive in the way we work and celebrate individual uniqueness
  • Fulltime
Read More
Arrow Right

Software Engineer Staff

We are seeking a talented and motivated Staff Software Engineer to join our dyna...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A minimum of 10 years of professional software development experience
  • Proven expertise in one or more backend programming languages such as Golang (highly preferred), Java, Python, or C/C++
  • Deep understanding of networking protocols, network architectures, network security, and common networking concepts
  • Proven experience in designing, building, and deploying scalable microservices using Docker, Kubernetes, etc.
  • Significant experience in building, deploying, and operating scalable SaaS applications in a Public Cloud (AWS/GCP) environment
  • Strong understanding of distributed systems principles, including concurrency, scalability, fault tolerance, and consistency
  • Experience with various database technologies, including relational (e.g., PostgreSQL, MySQL) and NoSQL (e.g., DynamoDB, Redis) databases
  • Experience designing, building, and consuming RESTful APIs and other integration technologies like WebSocket, Kafka, etc.
  • Experience with network security principles, threat modelling, and secure coding practices is an added advantage
  • Excellent analytical and problem-solving skills
Job Responsibility
Job Responsibility
  • Technical Leadership: Work with product managers, architects, and other engineers to understand the software requirements, and define corresponding functional and design specifications
  • Software Development: Design, develop, test, deploy, and maintain high-quality, production-grade software, with a strong emphasis on backend systems
  • System Design & Optimization: Design and implement micro-services for high availability, scalability, performance, and security within our SaaS platform
  • Networking Expertise: Apply deep knowledge of networking protocols (e.g., TCP/IP, HTTP/S, DNS, NAT), network security, and cloud networking concepts to build robust and secure solutions
  • SaaS & Cloud Native Development: Design and implement solutions leveraging cloud platforms (e.g., AWS, Azure, GCP) and containerization technologies (e.g., Kubernetes, Docker)
  • Collaboration: Collaborate effectively with cross-functional teams including product management, QA, SRE, and Juniper technical assistance team
  • Code Quality & Best Practices: Champion best practices in software development, including code reviews, testing methodologies, CI/CD, and DevOps principles
  • Problem Solving: Troubleshoot and resolve complex technical issues in a timely and effective manner, often in production environments
  • Innovation & Research: Stay abreast of emerging technologies and industry trends in networking, SaaS, and software engineering
  • Documentation: Create and maintain comprehensive technical documentation for designs, APIs, and operational procedures
What we offer
What we offer
  • Health & Wellbeing: Comprehensive suite of benefits that supports physical, financial and emotional wellbeing
  • Personal & Professional Development: Programs catered to helping you reach any career goals
  • Unconditional Inclusion: We are unconditionally inclusive in the way we work and celebrate individual uniqueness
  • Fulltime
Read More
Arrow Right

Site Reliability Engineer

Coralogix is a modern, full-stack observability platform transforming how busine...
Location
Location
United States , Boston
Salary
Salary:
Not provided
coralogix.com Logo
Coralogix
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • At least 5 years of experience as a DevOps Engineer/ SRE in production environments
  • In-depth experience with Kubernetes - operating & monitoring are key parts
  • At least 2 years of experience Experience with FedRAMP compliance (High/Moderate levels), vulnerability management, and continuous monitoring, including scanning, patching, and reporting
  • High familiarity with monitoring tools such as Coralogix, Grafana, Prometheus
  • Experience in AWS or other cloud providers
  • Experience with infrastructure as a code (Terraform, Crossplane, etc.)
  • Understanding of networking - from networking layers to different networking protocols (http, grpc, ssl)
  • Some software engineering experience, preferably in Golang
Job Responsibility
Job Responsibility
  • Work in high scale environments - Coralogix data pipeline processes 55Tb of data each day
  • Adopt cutting edge technologies with end-to-end responsibility
  • Building internal tools to expand our platform capabilities
  • Collaborate with R&D to improve stability & reliability of the system
  • Lead the product roadmap - our product is designed for engineers. Therefore, our engineers promote, enhance, and take a crucial part in influencing the product roadmap
  • Perform operational duties for FedRAMP cloud products, including deployments, on-call support, and incident management
  • Fulltime
Read More
Arrow Right

Infrastructure Engineer/SRE

As a member of the infrastructure team you are responsible for designing, buildi...
Location
Location
Canada; United States
Salary
Salary:
Not provided
cresta.com Logo
Cresta
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years experience in DevOps, Site Reliability Engineering, Production Engineering, or equivalent field
  • Deep proficiency with coding languages such as Golang or Python
  • Deep familiarity with container-related security best practices
  • Production experience working with Kubernetes, and a deep understanding of the Kubernetes ecosystem, including popular open-source tooling such as cert-manager or external-dns
  • Production experience with Kubernetes templating tools such as Helm or Kustomize
  • Production experience with IAC tools such as Terraform or CloudFormation
  • Production experience working with AWS and services such as IAM, S3, EC2, and EKS
  • Production experience with database software such as PostgreSQL
  • Experience with GitOps tooling such as Flux or Argo
  • Experience with CI/CD such as GitHub Actions
Job Responsibility
Job Responsibility
  • Partner with engineers to build dev tools that empower developer workflows and deployment infrastructure
  • Ensure reliability of multi-cloud Kubernetes clusters and pipelines
  • Metrics, logging, analytics, and alerting for performance and security across all endpoints and applications
  • Infrastructure-as-code deployment tooling and supporting services on multiple cloud providers
  • Automate operations and engineering
  • Building machine learning infrastructure that enables AI teams to train, test, and deploy on large-scale datasets
What we offer
What we offer
  • We offer Cresta employees a variety of medical, dental, and vision plans, designed to fit you and your family’s needs
  • Paid parental leave to support you and your family
  • Monthly Health & Wellness allowance
  • Work from home office stipend to help you succeed in a remote environment
  • Lunch reimbursement for in-office employees
  • PTO: 3 weeks in Canada
  • Compensation for this position includes a base salary, equity, and a variety of benefits
Read More
Arrow Right
New

Software Engineer, Infrastructure

At Cape, we are not just another cellular service provider; we are the architect...
Location
Location
United States , Washington, DC; New York, NY
Salary
Salary:
150000.00 - 230000.00 USD / Year
cape.co Logo
Cape
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of software engineering or SRE experience
  • Strong familiarity with AWS
  • Fluency in Golang, Rust, Java/Kotlin, Python, or similar language
  • Experience with building, deploying, and using monitoring infrastructure & tools
  • Experience designing, building, and delivering high availability systems and infrastructure
  • Passion for privacy and national security
  • A desire to work on software that has real-world impact
Job Responsibility
Job Responsibility
  • Be responsible for the full lifecycle development of our privacy-focused telecommunications and deployment infrastructure
  • Build, integrate, and maintain our instrumentation and monitoring infrastructure and tooling for improving the reliability, availability, and performance of our system
  • Help solve issues proactively before they become issues
  • Build new or integrate with existing telecommunications infrastructure and components
  • Own the technical accreditation and compliance process end-to-end for FedRamp
  • Shape and influence what great software engineering practices look like
  • Balance short term critical business needs with long term product vision and roadmap
What we offer
What we offer
  • equity
  • 401K match
  • top-tier health care
  • generous vacation policy
  • Fulltime
Read More
Arrow Right
New

Staff Backend Engineer

At Docker, we make app development easier so developers can focus on what matter...
Location
Location
United States
Salary
Salary:
195400.00 - 275550.00 USD / Year
docker.com Logo
Docker
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years backend engineering experience with deep expertise in distributed systems and large-scale backend architectures
  • Strong production experience with Golang, including designing and operating large Go-based services in cloud environments
  • Strong production experience with Kubernetes, including operating services at scale
  • Experience designing and running high-scale storage systems (PostgreSQL, DynamoDB, or equivalent) in production
  • Experience building and operating cloud-based services (AWS preferred)
  • Experience with event-driven or streaming systems, such as Kafka, SNS/SQS, or equivalent
  • Strong foundation in software engineering best practices: design documentation, testing strategies, CI/CD, code review, observability
  • Comfortable functioning autonomously in a fully distributed, remote-first team and working effectively in a fast-paced environment
  • Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience
Job Responsibility
Job Responsibility
  • Architect, build, and operate high-scale distributed systems powering Docker Hub’s registry platform—spanning artifact storage, metadata services, indexing workflows, and performance-critical APIs
  • Lead the design and implementation of backend services with a strong emphasis on scalability, correctness, resilience, and performance
  • Drive major initiatives around multi-region replication, caching strategies, request-path optimization, and core registry reliability
  • Design, optimize, operate the data and storage layers - for both Relational and NoSql as well as object storage and related technologies
  • Develop schemas and data models to support high-throughput, large-volume workloads
  • Own systems end-to-end—from storage-layer behavior to API design, deployment workflows, and production monitoring
  • Improve the performance and reliability of one of the world’s largest repositories of container images
  • Develop and enhance observability through metrics, traces, alerting, and dashboards
  • Lead improvements to deployment and operational tooling (e.g., Argo CD, GitHub Actions)
  • Participate in on-call rotations as part of supporting critical production services
What we offer
What we offer
  • Freedom & flexibility
  • fit your work around your life
  • Designated quarterly Whaleness Days plus end of year Whaleness break
  • Home office setup
  • we want you comfortable while you work
  • 16 weeks of paid Parental leave
  • Technology stipend equivalent to $100 net/month
  • PTO plan that encourages you to take time to do the things you enjoy
  • Training stipend for conferences, courses and classes
  • Equity
  • Fulltime
Read More
Arrow Right