Cloud and Observability Engineer Job at Coralogix (Gurugram)

Senior Cloud Engineer – Observability & Performance Engineering

We are seeking a highly experienced Cloud Engineer (Observability) to lead the e...

Location

United States , Washington

Salary:

Not provided

Robert Half

Expiration Date

Until further notice

Requirements

Bachelor's degree in Information Technology, Computer Science, Engineering, or a related field
8+ years of experience in infrastructure, platform, cloud, or operations engineering
5+ years of experience focused on: Observability, Site Reliability Engineering (SRE), Performance Engineering, Application Performance Monitoring (APM)
Experience administering and optimizing observability platforms such as: Datadog, Dynatrace, New Relic, Splunk Observability, Grafana/Prometheus
Strong experience with: OpenTelemetry, Distributed tracing, Performance tuning, APM engineering, Cloud-native monitoring
Experience supporting Azure, AWS, and containerized platforms
Proven ability to troubleshoot complex performance and reliability issues
Ability to obtain and maintain Public Trust clearance

Job Responsibility

Observability Platform Engineering
Cloud & Container Monitoring
Performance Engineering & Reliability
Capacity Planning & Operational Excellence

What we offer

medical
vision
dental
life and disability insurance
401(k) plan

Senior Software Engineer - Observability and Reliability

We are growing the engineering team and looking for engineers who have the chops...

Location

United States , San Francisco

Salary:

150000.00 - 220000.00 USD / Year

Sigma Computing

Expiration Date

Until further notice

Requirements

Strong Computer Science fundamentals
5+ years industry experience building and maintaining high-quality software, especially software other engineers use
You apply a product mindset to infrastructure systems and feel accomplished enabling others
Desire to be a great teammate and have fun at work
Strong sense of craftsmanship, and a healthy academic curiosity

Job Responsibility

Build observability tools and platforms, including: metrics, logging, distributed tracing, dashboarding, alerting, application performance management
Build with modern tools and languages like Go, Open Telemetry and Kubernetes
Participate in on-call rotation and ensure uptime of services
Create runtime tools/processes that optimize cloud triaging and limit downtime
Define best practices around making our systems and services measurable
Collaborate with peers and stakeholders through design and code reviews to ensure best practices amongst available technologies. We expect successful candidates to be coding a majority of their time

What we offer

Equity
Generous health benefits
Flexible time off policy
Paid bonding time for all new parents
Traditional and Roth 401k
Commuter and FSA benefits
Lunch Program
Dog friendly office

Fulltime

Senior Software Engineer - Observability and Reliability

We are growing the engineering team and looking for engineers who have the chops...

Location

United States , New York City

Salary:

150000.00 - 220000.00 USD / Year

Sigma Computing

Expiration Date

Until further notice

Requirements

Strong Computer Science fundamentals
5+ years industry experience building and maintaining high-quality software, especially software other engineers use
You apply a product mindset to infrastructure systems and feel accomplished enabling others
Desire to be a great teammate and have fun at work
Strong sense of craftsmanship, and a healthy academic curiosity

Job Responsibility

Build observability tools and platforms, including: metrics, logging, distributed tracing, dashboarding, alerting, application performance management
Build with modern tools and languages like Go, Open Telemetry and Kubernetes
Participate in on-call rotation and ensure uptime of services
Create runtime tools/processes that optimize cloud triaging and limit downtime
Define best practices around making our systems and services measurable
Collaborate with peers and stakeholders through design and code reviews to ensure best practices amongst available technologies. We expect successful candidates to be coding a majority of their time

What we offer

Equity
Generous health benefits
Flexible time off policy
Paid bonding time for all new parents
Traditional and Roth 401k
Commuter and FSA benefits
Lunch Program
Dog friendly office

Fulltime

Senior Software Engineer - Observability and Reliability

We are growing the engineering team and looking for engineers who have the chops...

Location

United States , San Francisco

Salary:

170000.00 - 215000.00 USD / Year

Sigma Computing

Expiration Date

Until further notice

Requirements

Strong Computer Science fundamentals
5+ years industry experience building and maintaining high-quality software, especially software other engineers use
You apply a product mindset to infrastructure systems and feel accomplished enabling others
Desire to be a great teammate and have fun at work
Strong sense of craftsmanship, and a healthy academic curiosity

Job Responsibility

Build observability tools and platforms, including: metrics, logging, distributed tracing, dashboarding, alerting, application performance management
Build with modern tools and languages like Go, Open Telemetry and Kubernetes
Participate in on-call rotation and ensure uptime of services
Create runtime tools/processes that optimize cloud triaging and limit downtime
Define best practices around making our systems and services measurable
Collaborate with peers and stakeholders through design and code reviews to ensure best practices amongst available technologies. We expect successful candidates to be coding a majority of their time

What we offer

Equity
Generous health benefits
Flexible time off policy
Paid bonding time for all new parents
Traditional and Roth 401k
Commuter and FSA benefits
Lunch Program
Dog friendly office

Fulltime

Senior Software Engineer - Observability and Reliability

We are growing the engineering team and looking for engineers who have the chops...

Location

United States , San Francisco

Salary:

150000.00 - 220000.00 USD / Year

Sigma Computing

Expiration Date

Until further notice

Requirements

Strong Computer Science fundamentals
5+ years industry experience building and maintaining high-quality software, especially software other engineers use
You apply a product mindset to infrastructure systems and feel accomplished enabling others
Desire to be a great teammate and have fun at work
Strong sense of craftsmanship, and a healthy academic curiosity

Job Responsibility

Build observability tools and platforms, including: metrics, logging, distributed tracing, dashboarding, alerting, application performance management
Build with modern tools and languages like Go, Open Telemetry and Kubernetes
Participate in on-call rotation and ensure uptime of services
Create runtime tools/processes that optimize cloud triaging and limit downtime
Define best practices around making our systems and services measurable
Collaborate with peers and stakeholders through design and code reviews to ensure best practices amongst available technologies. We expect successful candidates to be coding a majority of their time

What we offer

Equity
Generous health benefits
Flexible time off policy
Paid bonding time for all new parents
Traditional and Roth 401k
Commuter and FSA benefits
Lunch Program
Dog friendly office
Stock options

Fulltime

Principal Engineer I - Cloud Observability

We’re not just building better tech. We’re rewriting how data moves and what the...

Location

India

Salary:

Not provided

Confluent

Expiration Date

Until further notice

Requirements

Minimum of 15+ years of hands-on software development experience with the ability to anticipate future technical needs for the product and craft plans to realize them
Taking ideas to production is something we look for
Ready to roll up your sleeves - code, debug, design - do whatever it takes to ship the product to production
Experience building and operating large-scale systems. Solid understanding of basic systems operations (disk, network, operating systems, etc). Experience running production services in the cloud
Strong fundamentals in distributed systems design and development. Solid fundamentals in concurrent and multi-threading programming
A self starter with the ability to work effectively in teams. Proactively identifying the symptoms of technical issues and reason about their causes is needed. This will be followed by fixing the root causes
Timely shipping of deliverables
being able to trade-off short term technical decisions with the long term. Move fast, build in increments, and iterate. A sense of urgency, a mindset towards achieving results, and excellent prioritization skills
Ability to influence the team, peers and upper management in technology decisions using effective communication and collaborative techniques
Degree in Computer Science, Engineering or equivalent experience. Understanding of various technologies, programming paradigms and frameworks is needed. Ability to be pragmatic and trade off their usage in production is essential

Job Responsibility

You will work with a team of engineers and architects to help evolve Confluent Observability features
Work closely with product management, engineering leadership, and other key stakeholders across various teams in Confluent to build and drive the overall roadmap
Need you to be a strong tech voice outside Confluent Observability within Confluent
Influence the overall domain health and operational hygiene for Confluent Observability
We need a tech champion for the observability capabilities we provide to our customers
You are expected to review designs and code and improve our technical standards
We are looking at you to lead the technology charter for our observability features in Confluent Cloud and in hybrid scenarios with Confluent Platform
Mentor a team of high-performing engineers and leads, helping them to continue in growing their skill set through hands-on experience and mentorship
Be a strong technical leader and representative for engineering teams in India
Provide timely and productive feedback, encourage a growth mindset, and advise team members in setting and working toward personal development goals

What we offer

Remote-First Work
Robust Insurance Benefits
Flexible Time Away
The Best Teammates
Experience Ambassadors
Open and Honest Culture
Well-Being and Growth

Fulltime

Senior Software Engineer - Cloud Infrastructure & Observability

Location

India , Bengaluru

Salary:

Not provided

Roku

Expiration Date

Until further notice

Requirements

15+ years in software engineering with a track record of architecting distributed systems or platforms at scale
Strong hands‑on experience in Golang and one scripting language (e.g., Python or Shell)
Experience operating observability at pb-scale ingestion and hundreds of millions of series
Expertise in observability platforms and tooling (Prometheus, Grafana, Loki, Tempo, ELK/OpenSearch, ClickHouse) and standards (OpenTelemetry, OpenMetrics)
Deep experience building systems of scale and operating cloud infrastructure with Kubernetes
strong proficiency with service mesh technologies (Istio/Envoy), infrastructure‑as‑code (Terraform) and experience in multi‑cloud (AWS, GCP)
Demonstrated ability to evolve storage and query architectures for cost, scale, and latency (e.g., TSDB, Parquet, distributed processing)
Proven experience integrating security as part of infrastructure and platform development
Exceptional cross‑functional communication
effective collaboration with both technical and non‑technical stakeholders

Job Responsibility

Architect and lead Roku’s observability platform across metrics, logs, and traces
evolve data pipelines and storage layers optimized for high throughput, performance, and cost at Roku scale (TSDBs, Parquet, distributed processing)
Extend and harden open‑source observability systems
overhaul core components (e.g., storage layers, query paths) to improve performance, reliability, and usability at scale
Implement features such as pre‑aggregation, down-sampling, and sampling to reduce load and accelerate queries across the platform
Collaborate across platform, SRE, and product teams to migrate hundreds of workloads to our common platform
augment and automate CI/CD flows and onboarding
Integrate security into infrastructure and platform services
ensure robust multi‑tenant, multi‑cluster, and multi‑cloud designs
Contribute improvements back to open source and CNCF‑aligned projects

What we offer

Global access to mental health and financial wellness support and resources
healthcare (medical, dental, and vision)
life, accident, disability, commuter, and retirement options (401(k)/pension)
time off in accordance with local leave policies

Fulltime

Senior Software Engineer - Cloud Infrastructure & Observability

We are building a next-generation observability and cloud platform that is high-...

Location

United Kingdom , Cambridge

Salary:

Not provided

Roku

Expiration Date

Until further notice

Requirements

Extensive experience with software engineering with a track record of architecting distributed systems or platforms at scale
Strong hands-on experience in Golang and one scripting language (e.g., Python or Shell)
Experience operating observability at pb-scale ingestion and hundreds of millions of series
Expertise in observability platforms and tooling (Prometheus, Grafana, Loki, Tempo, ELK/OpenSearch, ClickHouse) and standards (OpenTelemetry, OpenMetrics)
Deep experience building systems of scale and operating cloud infrastructure with Kubernetes
strong proficiency with service mesh technologies (Istio/Envoy), infrastructure-as-code (Terraform) and experience in multi-cloud (AWS, GCP)
Demonstrated ability to evolve storage and query architectures for cost, scale, and latency (e.g., TSDB, Parquet, distributed processing)
Proven experience integrating security as part of infrastructure and platform development
Exceptional cross-functional communication
effective collaboration with both technical and non-technical stakeholders

Job Responsibility

Architect and lead Roku’s observability platform across metrics, logs, and traces
evolve data pipelines and storage layers optimized for high throughput, performance, and cost at Roku scale (TSDBs, Parquet, distributed processing)
Extend and harden open-source observability systems
overhaul core components (e.g., storage layers, query paths) to improve performance, reliability, and usability at scale
Implement features such as pre-aggregation, down-sampling, and sampling to reduce load and accelerate queries across the platform
Collaborate across platform, SRE, and product teams to migrate hundreds of workloads to our common platform
augment and automate CI/CD flows and onboarding
Integrate security into infrastructure and platform services
ensure robust multi-tenant, multi-cluster, and multi-cloud designs
Contribute improvements back to open source and CNCF-aligned projects

What we offer

Global access to mental health and financial wellness support and resources
healthcare (medical, dental, and vision)
life, accident, disability, commuter, and retirement options (401(k)/pension)
time off work for vacation and other personal reasons

Fulltime

Select Country

Cloud and Observability Engineer

Job Description

Job Responsibility

Requirements

Nice to have

Looking for more opportunities?