Observability Operations Engineer Job at Technologent (Phoenix)

Staff Observability Operations Engineer

We are currently seeking several experienced and highly skilled Staff Observabil...

Location

United States , Hartford

Salary:

130295.00 - 260590.00 USD / Year

CVS Health

Expiration Date

Until further notice

Requirements

7+ Years of experience in IT operations, with significant responsibilities in system monitoring, performance tuning, and troubleshooting enterprise applications
5+ Years in a Site Reliability Engineering (SRE) role deploying and managing modern observability solutions
5+ Years managing and implementing observability and event management platforms (e.g., AppDynamics, Splunk, Prometheus, Grafana)
Experience developing and administering ServiceNow ITOM event management solutions
Experience deploying and managing service reliability platforms (e.g., xMatters, OpsGenie, PagerDuty)
Experience with and deep knowledge of cloud environments, cloud monitoring platforms, and container orchestration tools (e.g., AWS/CloudTrail, Azure/Monitor, GCP/GCM, Kubernetes, OpenShift)
Proficiency in Python and other scripting languages such as Ansible, PowerShell, Bash for automation and configuration
Hands-on experience deploying, managing, and administering observability platforms
Hands-on experience leading, coordinating, and performing migration of application, platform, and infrastructure observability solutions
Proven ability to troubleshoot and resolve complex technical issues

Job Responsibility

Deploy and implement modern observability solutions
Manage and administer observability and event management platforms
Coordinate and manage release cycles for observability platforms
Troubleshoot and resolve incidents related to observability platforms
Continuously monitor and enhance platform performance
Collaborate with cross-functional stakeholders
Provide training and mentoring to junior engineers
Ensure compliance and security of observability platforms
Maintain documentation of observability platform configurations
Generate and analyze reports on platform performance and capacity

What we offer

Affordable medical plan options
a 401(k) plan (including matching company contributions)
an employee stock purchase plan
No-cost programs for all colleagues including wellness screenings, tobacco cessation and weight management programs
confidential counseling and financial coaching
Paid time off
flexible work schedules
family leave
dependent care resources
colleague assistance programs

Fulltime

Senior Software Engineer, Observability

The Observability team at Airtable ensures that engineers have the tools they ne...

Location

United States , San Francisco; New York; Seattle

Salary:

196000.00 - 270000.00 USD / Year

Airtable

Expiration Date

Until further notice

Requirements

6+ years of software engineering experience
3+ years focused on observability or infrastructure at scale
Demonstrated success implementing and running production-grade logging, metrics, or tracing systems
Proficiency in distributed systems concepts, data streaming pipelines, and container orchestration (Kubernetes)
Deep hands-on knowledge of tools such as Prometheus, Grafana, Datadog, OpenTelemetry, ELK Stack, Loki, or ClickHouse
Comfort with at least one programming language (e.g., Go, Python, Java) to build and maintain observability tooling
Experience mentoring engineers and collaborating across multiple teams
Strong communication skills
Eagerness to own high-impact initiatives
Proven ability to balance short-term fixes with long-term strategic vision

Job Responsibility

Architect and scale core observability systems
Lead the design and evolution of logging, metrics, and tracing pipelines
Evaluate and integrate new technologies (e.g., OpenTelemetry, ClickHouse, ELK stack)
Guide and mentor a growing team of infrastructure engineers
Define and uphold coding standards and operational excellence
Partner with Deploy Infrastructure, Service Orchestration, and Product teams
Align infrastructure decisions with business goals
Own end-to-end reliability for observability tools and establish SLAs, SLOs, and error budgets
Optimize performance and cost of large-scale data pipelines
Shape the observability roadmap

What we offer

Opportunity to receive benefits
Restricted stock units
May include incentive compensation
Comprehensive benefit offerings

Fulltime

Senior Observability Engineer

Coralogix is a modern, full-stack observability platform transforming how busine...

Location

Germany , Berlin

Salary:

Not provided

Coralogix

Expiration Date

Until further notice

Requirements

5+ years of experience in Site Reliability, DevOps, or Platform Engineering with a focus on observability
Proven expertise with at least one major observability platform (e.g., Prometheus, Victoria Metrics, OpenSearch)
Hands-on experience with Kubernetes, including deep knowledge of controllers, operators, and Helm
Experience writing Kubernetes controllers (controller-runtime, KubeBuilder)
Strong programming skills in Go or Python (Rust is a plus)
Experience designing, scaling, and operating observability systems at enterprise scale
Familiarity with at least one major cloud provider (AWS, Azure, or GCP)
Strong understanding of distributed systems, telemetry pipelines, and instrumentation standards (e.g., OpenTelemetry)
Excellent communication skills with the ability to explain complex topics to diverse stakeholders

Job Responsibility

Design, implement, and maintain observability features such as Alerting, SLOs, Reporting, and Synthetic Tests
Manage and scale OpenTelemetry Collectors and other observability agents across Kubernetes environments
Write and maintain Kubernetes Controllers using frameworks like controller-runtime and KubeBuilder
Operate and optimize the internal Coralogix account, ensuring proper usage, cost efficiency, and best practices adoption
Define and enforce observability guidelines and standards across the organization
Partner with engineering teams to embed observability by default into products and services
Control observability-related costs while maximizing performance, visibility, and value
Contribute to upstream projects such as OpenTelemetry, helping shape industry standards
Explore and implement cutting-edge observability technologies, including eBPF-based approaches

Fulltime

Senior Security Operations Engineer II

As a Senior Security Operations Engineer, you’ll play a key role in ensuring the...

Location

United States , Scottsdale

Salary:

Not provided

Axon

Expiration Date

Until further notice

Requirements

7+ years of experience in operations, site reliability, or infrastructure engineering roles
Strong experience securing and managing cloud environments (e.g., AWS, Azure) and containerized workloads
Deep understanding of Linux systems, networking, distributed systems, and their associated security controls
Proficiency in automation, scripting, and security tooling integration to streamline operations and enforcement
Experience with security monitoring, alerting, SIEM platforms, and observability tools
Solid grasp of CI/CD practices with integrated security testing and compliance checks
Experience managing Kubernetes clusters and running containerized workloads in production
Experience with deploying and administrating any of the following: scalable cloud native secrets solutions such as AWS KMS, Azure KeyVault
PKI solutions such as EJBCA, Smallstep, Venafi
or vaulting solutions such as Hashicorp Vault

Job Responsibility

Implementing and improving automated security checks in CI/CD pipelines to prevent vulnerabilities from reaching production
Writing, reviewing, and maintaining security-focused infrastructure-as-code for scalable and compliant deployments
Investigating security incidents, performing root cause analysis, and implementing long-term mitigation strategies
Collaborating with developers to develop new features, services, and infrastructure requirements
Enhancing security observability through improved log collection, metrics, and alerting configurations
Maintaining and improving security runbooks, incident response playbooks, and internal security tooling for operational efficiency
Resolve security/infrastructure incidents by participating in high impact/high visibility incidents as a participant and ideally as an incident commander
Maintain and secure critical infrastructure components such as PKI (Public Key Infrastructure) and IAM ( Identity & Access Management) systems, ensuring reliability, scalability, and compliance with organizational and industry security standards
Build and maintain secure, reliable, and scalable infrastructure that protects core services and sensitive data
Troubleshoot and resolve complex operational and system-level issues across environments

What we offer

Competitive salary and 401k with employer match
Discretionary paid time off
Paid parental leave for all
Medical, Dental, Vision plans
Fitness Programs
Emotional & Mental Wellness support
Learning & Development programs
Snacks in our offices

Fulltime

Senior Security Operations Engineer II

As a Senior Security Operations Engineer, you’ll play a key role in ensuring the...

Location

United States , Scottsdale

Salary:

Not provided

Axon

Expiration Date

Until further notice

Requirements

7+ years of experience in operations, site reliability, or infrastructure engineering roles
Strong experience securing and managing cloud environments (e.g., AWS, Azure) and containerized workloads
Deep understanding of Linux systems, networking, distributed systems, and their associated security controls
Proficiency in automation, scripting, and security tooling integration to streamline operations and enforcement
Experience with security monitoring, alerting, SIEM platforms, and observability tools
Solid grasp of CI/CD practices with integrated security testing and compliance checks
Experience managing Kubernetes clusters and running containerized workloads in production
Experience with deploying and administrating any of the following: scalable cloud native secrets solutions such as AWS KMS, Azure KeyVault
PKI solutions such as EJBCA, Smallstep, Venafi
or vaulting solutions such as Hashicorp Vault

Job Responsibility

Implementing and improving automated security checks in CI/CD pipelines to prevent vulnerabilities from reaching production
Writing, reviewing, and maintaining security-focused infrastructure-as-code for scalable and compliant deployments
Investigating security incidents, performing root cause analysis, and implementing long-term mitigation strategies
Collaborating with developers to develop new features, services, and infrastructure requirements
Enhancing security observability through improved log collection, metrics, and alerting configurations
Maintaining and improving security runbooks, incident response playbooks, and internal security tooling for operational efficiency
Resolve security/infrastructure incidents by participating in high impact/high visibility incidents as a participant and ideally as an incident commander
Maintain and secure critical infrastructure components such as PKI (Public Key Infrastructure) and IAM ( Identity & Access Management) systems, ensuring reliability, scalability, and compliance with organizational and industry security standards
Build and maintain secure, reliable, and scalable infrastructure that protects core services and sensitive data
Troubleshoot and resolve complex operational and system-level issues across environments

What we offer

Competitive salary and 401k with employer match
Discretionary paid time off
Paid parental leave for all
Medical, Dental, Vision plans
Fitness Programs
Emotional & Mental Wellness support
Learning & Development programs
Snacks in our offices

Fulltime

Senior DevOps Engineer (Observability)

You will enable our machine learning team, data engineers, and applications team...

Location

United States , New York

Salary:

180000.00 - 225000.00 USD / Year

EvolutionIQ

Expiration Date

Until further notice

Requirements

7+ years of DevOps experience
Extensive experience designing and running production systems on GCP
Deep exposure and familiarity to networking concepts, Kubernetes clusters, Docker, containerized development, Terraform, Helm, Dagster (DE), and ArgoCD
Experience with production operations and working with product engineering teams
Experience integrating with SIEM and security software, such as vulnerability scanners
You know the critical questions to ask in order to understand a client’s business problem and can show the business impact of your technical solutions
Team player who is solutions-oriented
You have crisp written and verbal communication skills

Job Responsibility

Improve and further our observability stack across GCP infrastructure and applications
Drive consistency and operational excellence across all teams
Enable the data engineering team to use Dagster efficiently
Leverage tools like Terraform, Github Actions, Helm, and ArgoCD to build efficient infrastructure as code pipelines
Ensure industry standard security controls in our cloud environments
Institute culture of reliability in a federated ownership environment

What we offer

Medical, dental, vision, short & long-term disability, life insurance and AD&D, and 401k matching
Additional family, wellness, and pet benefits
Paid time off and sick leave, 100% paid parental leave (16 weeks for primary caregivers and 12 weeks for secondary caregivers)
We offer a flexible schedule for new parents returning to work
Catered lunches, happy hours, pet-friendly spaces, and monthly technology stipend
$1,000/year for each employee for professional development, as well opportunities for tuition reimbursement
An annual bonus plan and company equity plan (RSUs) are also included in our compensation package

Fulltime

Monitoring & Observability Engineer

The Monitoring & Observability Engineer is a senior level position responsible f...

Location

India , Chennai; Pune

Salary:

Not provided

Citi

Expiration Date

Until further notice

Requirements

3-7 years of relevant experience in an Engineering & IT role
At least 2+ years of hands-on working experience in: Strong understanding of UI/UX principles and best practices
Proficient in JavaScript, TypeScript, HTML, CSS, React, and Node.js
Experience with backend technologies and databases (e.g., MongoDB)
Experience with Python Programming
Experience with version control systems (e.g., Git)
Strong problem-solving and analytical skills
Excellent communication and collaboration skills
Create modular and reusable React components to streamline development and maintain consistency across the application
Continuously improve existing applications, addressing bugs, and implementing new features

Job Responsibility

Drive the best-in-class monitoring using a range of tools across all regions of Global Consumer bank
Drive POCs and incubate new features and capabilities
Be forward looking and ensure long term strategic success
Work closely with the monitoring operations teams, production support, performance test teams, operations, application owners and application owners to deliver best-in-class monitoring
Explain complicated performance bottlenecks to stakeholders
Understand complicated application architecture, including Java app servers, Web Servers, Cloud (PCF, AWS, Google), Kubernetes, TIBCO, mainframe
Build advanced dashboards and queries
Be a subject matter expert for the Global Consumer Bank, including conducting brown bags and office hours
Recommend product customization for system integration
Identify problem causality, business impact and root causes

Fulltime

Federal Observability Engineer

You will be part of a larger technical team, working as an Observability Enginee...

Location

United States , HILL AFB

Salary:

105500.00 - 243000.00 USD / Year

Hewlett Packard Enterprise

Expiration Date

Until further notice

Requirements

US Citizenship Required
Secret Clearance Required
DD8750 - Security Plus or higher Security Certification (CISSP, CASP, etc)
Bachelor's degree preferred or Associate degree holder (technical field) with 6-8 years working experience in related fields
Strong understanding of cloud computing platforms (AWS, Azure, GCP)
Experience with containerization technologies (Docker, Kubernetes)
Proficiency in scripting languages (Python, Go, Bash)
Experience with SQL and NoSQL databases
Knowledge of networking protocols (TCP/IP, HTTP)
Proven experience with the OpsRamp platform is a strong plus

Job Responsibility

Designing, implementing, and maintaining observability infrastructure in an OpsRamp environment
Working as part of a larger technical team supporting HPE's PCE environment and Cloud infrastructure for a Federal Customer
Configuring and managing data sources, defining and monitoring key performance indicators (KPIs), and analyzing performance trends
Configuring log collection, aggregation, and analysis within the OpsRamp platform
Creating and managing alerts, defining escalation paths, and integrating with incident management systems
Developing and implementing automated workflows and remediation actions within the OpsRamp platform
Designing and building custom dashboards and reports to provide key insights into system health and performance
Integrating OpsRamp with other monitoring and observability tools as needed
Ensuring data quality and integrity within the OpsRamp platform
Troubleshooting and resolving performance issues, application errors, and other operational problems

What we offer

Health & Wellbeing benefits
Personal & Professional Development programs
Unconditional Inclusion environment
Comprehensive suite of benefits supporting physical, financial and emotional wellbeing

Fulltime

Observability Operations Engineer

Technologent

Location:
United States , Phoenix

Category:
IT - Administration

Contract Type:
Not provided

Salary:

Job Description:

Job Responsibility:

Requirements:

Nice to have:

Additional Information:

Job Posted:
February 19, 2026

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Observability Operations Engineer

Staff Observability Operations Engineer

Senior Software Engineer, Observability

Senior Observability Engineer

Senior Security Operations Engineer II

Senior Security Operations Engineer II

Senior DevOps Engineer (Observability)

Monitoring & Observability Engineer

Federal Observability Engineer

Observability Operations Engineer

Technologent

Location:United States , Phoenix

Category:IT - Administration

Contract Type:Not provided

Salary:

Job Description:

Job Responsibility:

Requirements:

Nice to have:

Additional Information:

Job Posted:February 19, 2026

Looking for more opportunities? Search for other job offers that match your skills and interests.

Similar Jobs for Observability Operations Engineer

Staff Observability Operations Engineer

Senior Software Engineer, Observability

Senior Observability Engineer

Senior Security Operations Engineer II

Senior Security Operations Engineer II

Senior DevOps Engineer (Observability)

Monitoring & Observability Engineer

Federal Observability Engineer

Location:
United States , Phoenix

Category:
IT - Administration

Contract Type:
Not provided

Job Posted:
February 19, 2026