CrawlJobs Logo

Staff Observability Operations Engineer

https://www.cvshealth.com/ Logo

CVS Health

Location Icon

Location:
United States, Hartford

Category Icon
Category:
IT - Administration

Job Type Icon

Contract Type:
Employment contract

Salary Icon

Salary:

130295.00 - 260590.00 USD / Year

Job Description:

As a Staff Observability Operations Engineer at CVS Health, you will oversee and optimize the observability platform to ensure seamless and efficient operations. The role involves deploying observability solutions, platform management, system upgrades, troubleshooting incidents, and enhancing platform performance for scalability and complexity.

Job Responsibility:

  • Deploy and implement modern observability solutions to meet organizational needs
  • Manage and administer observability and event management platforms
  • Lead system upgrades, patching, and maintenance activities
  • Coordinate and manage release cycles for observability platforms
  • Troubleshoot and resolve incidents related to observability platforms
  • Continuously monitor and enhance platform performance to support scalability and complexity
  • Collaborate with cross-functional infrastructure, application, and business stakeholders
  • Identify opportunities for process optimization and efficiency gains
  • Ensure high levels of customer satisfaction by effectively managing customer relationships
  • Ensure observability platforms comply with organizational policies and security standards
  • Maintain comprehensive documentation of observability platform configurations
  • Provide training and mentoring to junior engineers, team members, and MSPs.

Requirements:

  • 7+ Years of experience in IT operations, with significant responsibilities in system monitoring, performance tuning, and troubleshooting enterprise applications
  • 5+ Years in a Site Reliability Engineering (SRE) role deploying and managing modern observability solutions
  • 5+ Years managing and implementing observability and event management platforms (e.g., AppDynamics, Splunk, Prometheus, Grafana)
  • Experience developing and administering ServiceNow ITOM event management solutions, ensuring seamless integration with observability tools
  • Experience deploying and managing service reliability platforms (e.g., xMatters, OpsGenie, PagerDuty), configuring incident notifications, incident command workflows, and automating incident remediation workflows
  • Experience with and deep knowledge of cloud environments, cloud monitoring platforms, and container orchestration tools (e.g., AWS/CloudTrail, Azure/Monitor, GCP/GCM, Kubernetes, OpenShift)
  • Proficiency in Python and other scripting languages such as Ansible, PowerShell, and Bash for automation and configuration
  • Experience building and instrumenting dashboards to deliver technical and business process insights leveraging standard observability/BI platforms (e.g., AppDynamics, Grafana, Tableau, PowerBI)
  • Excellent problem-solving skills, with the ability to handle multiple tasks, prioritize effectively, and work under pressure
  • Excellent communication skills, both verbal and written
  • Strong customer service orientation with the ability to manage customer relationships effectively.

Nice to have:

  • ITIL 4 Practitioner: Monitoring and Event Management
  • DevOps Institute Observability Foundation
  • DevOps Institute Site Reliability Engineering Foundation or Practitioner
  • ServiceNow CIS-Event Management Implementer
  • ServiceNow Certified Application Developer
  • xMatters Integrator.
What we offer:
  • Affordable medical plan options
  • 401(k) plan (including matching company contributions)
  • Employee stock purchase plan
  • No-cost programs such as wellness screenings, tobacco cessation, and weight management programs
  • Confidential counseling and financial coaching
  • Paid time off
  • Flexible work schedules
  • Family leave
  • Dependent care resources
  • Colleague assistance programs
  • Tuition assistance
  • Retiree medical access.

Additional Information:

Job Posted:
April 26, 2025

Expiration:
June 30, 2025

Employment Type:
Fulltime
Work Type:
Remote work
Job Link Share:
Welcome to CrawlJobs.com
Your Global Job Discovery Platform
At CrawlJobs.com, we simplify finding your next career opportunity by bringing job listings directly to you from all corners of the web. Using cutting-edge AI and web-crawling technologies, we gather and curate job offers from various sources across the globe, ensuring you have access to the most up-to-date job listings in one place.