CrawlJobs Logo

Observability Lead – Elastic (ELK) Stack

imss.co.in Logo

Integra Micro Software Services

Location Icon

Location:
India , Mumbai

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

We are seeking a highly experienced and visionary Observability Lead to spearhead our monitoring and infrastructure management initiatives. This senior role requires deep expertise in the Elastic Stack and a comprehensive understanding of modern distributed system telemetry. The successful candidate will drive the strategy and implementation of robust observability solutions, ensuring system reliability, performance, and insightful data visualization.

Job Responsibility:

  • Spearhead our monitoring and infrastructure management initiatives
  • Drive the strategy and implementation of robust observability solutions
  • Ensure system reliability, performance, and insightful data visualization

Requirements:

  • Bachelor’s or Master’s degree in Computer Science, Information Technology (IT), or a closely related technical field
  • Minimum of 8+ years of professional experience dedicated to observability, system monitoring, or infrastructure management practices
  • 3+ years of direct, hands-on experience specifically managing and engineering solutions using the full Elastic Stack (Elasticsearch, Kibana, Logstash/Beats, Elastic APM, and Fleet/Elastic Agent)
  • Strong, practical understanding of fundamental observability concepts, including the collection and analysis of logs, metrics, traces, and synthetic monitoring
  • Expertise in implementing OpenTelemetry, configuring distributed tracing, and carrying out telemetry instrumentation within complex microservice environments
  • Proven experience working with complementary modern monitoring and containerization tools such as Kubernetes, Docker, Prometheus, and Grafana
  • Demonstrated proficiency in managing system configurations using YAML-based configurations
  • Extensive experience in performance optimization, advanced data visualization, and sophisticated dashboarding using Kibana
What we offer:
  • Innovation Focused culture
  • Collaborative Environment
  • Professional Development through continuous learning programs, certifications, and mentorship opportunities
  • Work-Life Integration with competitive benefits and policies

Additional Information:

Job Posted:
January 02, 2026

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Observability Lead – Elastic (ELK) Stack

AWS Cloud Devops Technical Manager

AWS Cloud Devops Technical Manager role at Sopra Steria, a major Tech player in ...
Location
Location
India , Chennai; Noida
Salary
Salary:
Not provided
https://www.soprasteria.com Logo
Sopra Steria
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Total Experience Expected: 10-14 years
  • Professional Experience in Cloud-native software architecture (Microservices, Patterns, DDD)
  • Working with internal developer platforms such as backstage
  • AWS Platform (Infrastructure, Services, Administration, Provisioning, Monitoring)
  • Managing Kubernetes (EKS)
  • Operations
  • Networking
  • Autoscaling
  • High Availability
  • ELK Stack
Job Responsibility
Job Responsibility
  • Lead the Infra team
  • Manage Kubernetes (EKS)
  • Implement and maintain build and release pipelines
  • Cloud-native software architecture
  • Cloud security patterns
  • Application monitoring and healing
What we offer
What we offer
  • Commitment to fighting against all forms of discrimination
  • Inclusive and respectful work environment
  • All positions open to people with disabilities
  • Fulltime
Read More
Arrow Right

Senior Database Engineer

We’re looking for a skilled Data Reliability Engineer to join our team for a cli...
Location
Location
United States
Salary
Salary:
Not provided
zoolatech.com Logo
Zoolatech
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in Data Engineering, Database Reliability, or Infrastructure Operations
  • Strong expertise in PostgreSQL on AWS, including tuning, replication, backups, and HA configurations
  • Experience operating RDBMS databases (PostgreSQL, MySQL, etc.) and Kubernetes technologies is highly desirable
  • Experience provisioning and operating NoSQL databases at scale like Elasticsearch, Elastic Cache, DynamoDB, Neo4j, Mongo, Cassandra, etc.
  • Advanced SQL scripting and query optimization skills
  • Experience with data systems monitoring, alerting, and performance tuning
  • Strong programming/scripting in Java, Python, or Shell
  • Proven experience in designing or supporting complex data ecosystems
  • Solid understanding of cloud infrastructure (preferably AWS) and Infrastructure as Code tools (Terraform)
  • Familiarity with event streaming platforms (Kafka), and observability stacks (New Relic, ELK, etc.)
Job Responsibility
Job Responsibility
  • Own and optimize the reliability, availability, and performance of data infrastructure across production systems
  • Lead the design and implementation of resilient, secure, and observable data systems
  • Collaborate with SRE, Security, and Engineering teams to enforce data infrastructure standards and align on architectural decisions
  • Design and implement automation around provisioning, uptime monitoring, data refresh, integrity, backups, and disaster recovery
  • Support application developers with performance tuning, complex query optimization, and database design reviews
  • Analyze and resolve performance bottlenecks and incidents with a focus on long-term solutions
  • Participate in on-call rotation to support production systems and ensure high availability
  • Actively contribute to improving incident response and observability through metrics, alerting, and runbooks
  • Work with technologies such as Java, Ruby on Rails, PostgreSQL, AWS, Kafka, S3, Elasticsearch
What we offer
What we offer
  • Paid Vacation
  • Sick Days
  • Floating Holidays
  • Sport/Insurance Compensation
  • English Classes
  • Charity
  • Training Compensation
  • Fulltime
Read More
Arrow Right

Senior Database Engineer

We’re looking for a skilled Data Reliability Engineer to join our team for a cli...
Location
Location
Salary
Salary:
Not provided
zoolatech.com Logo
Zoolatech
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in Data Engineering, Database Reliability, or Infrastructure Operations
  • Strong expertise in PostgreSQL on AWS, including tuning, replication, backups, and HA configurations
  • Experience operating RDBMS databases (PostgreSQL, MySQL, etc.) and Kubernetes technologies is highly desirable
  • Experience provisioning and operating NoSQL databases at scale like Elasticsearch, Elastic Cache, DynamoDB, Neo4j, Mongo, Cassandra, etc.
  • Advanced SQL scripting and query optimization skills
  • Experience with data systems monitoring, alerting, and performance tuning
  • Strong programming/scripting in Java, Python, or Shell
  • Proven experience in designing or supporting complex data ecosystems
  • Solid understanding of cloud infrastructure (preferably AWS) and Infrastructure as Code tools (Terraform)
  • Familiarity with event streaming platforms (Kafka), and observability stacks (New Relic, ELK, etc.)
Job Responsibility
Job Responsibility
  • Own and optimize the reliability, availability, and performance of data infrastructure across production systems
  • Lead the design and implementation of resilient, secure, and observable data systems
  • Collaborate with SRE, Security, and Engineering teams to enforce data infrastructure standards and align on architectural decisions
  • Design and implement automation around provisioning, uptime monitoring, data refresh, integrity, backups, and disaster recovery
  • Support application developers with performance tuning, complex query optimization, and database design reviews
  • Analyze and resolve performance bottlenecks and incidents with a focus on long-term solutions
  • Participate in on-call rotation to support production systems and ensure high availability
  • Actively contribute to improving incident response and observability through metrics, alerting, and runbooks
  • Work with technologies such as Java, Ruby on Rails, PostgreSQL, AWS, Kafka, S3, Elasticsearch
What we offer
What we offer
  • Paid Vacation
  • Sick Days
  • Floating Holidays
  • Sport/Insurance Compensation
  • English Classes
  • Charity
  • Training Compensation
Read More
Arrow Right
New

Substation Maintenance Manager

This position is responsible for the maintenance of all medium and high voltage ...
Location
Location
United States , Andover
Salary
Salary:
136800.00 - 205200.00 USD / Year
enel.com Logo
Enel
Expiration Date
February 01, 2026
Flip Icon
Requirements
Requirements
  • Bachelor's degree in electrical engineering or Power Engineering from an ABET accredited university
  • Minimum of 8+ years of experience in electrical high voltage maintenance
  • Minimum of 3+ years managing electrical engineers or technicians, electrical power plants electrical maintenance, and CapEx electrical projects
  • Ability to work independently
  • Ability to be on call or work off hours as needed
  • Detailed knowledge and understanding of budgeting
  • Strong ability in scheduling and planning and the flexibility to maintain such as conditions change
  • Personnel management skills
  • Change management Skills
  • Able to interpret technical, legal documents and agreements
Job Responsibility
Job Responsibility
  • Overall responsibility for the performance of MV/HV Electrical Assets. OPEX and CapEx budgeting and control
  • Overall management of the MV/HV infrastructure onsite and remote operations
  • Provide technical support in resolving codes and standards issues (NEC, NESC, NFPA 70E, ANSI, UL, IEC, CSA)
  • Development of processes and procedures to safely and effectively operate and maintain the EGPNA Wind O&M fleet
  • Provide engineering consulting support to other areas of the organization, including Construction and Power Marketing
  • Manage, with the support of the EGPNA EHS department, an effective EHS program in accordance with all EGPNA policies, applicable laws and regulations and permits
  • Work with the Maintenance Planning team to develop and manage the implementation of a strategy for maintenance activities. Contracts with third parties, major repairs and capital improvements management to ensure maximum project availability and profitability
  • Establish a long-range regional plan and budget, and report on same to Head of Cross Technical Services and heads of the various technologies
  • Identify methods to increase project efficiency/reliability and implement capital improvement projects to increase project profitability
  • Ensure all maintenance is performed within all regulatory guidelines to include and focus on meeting all applicable NERC Reliability Standards
What we offer
What we offer
  • Affordable, quality healthcare for you and your family
  • Life insurance and disability benefits
  • Retirement benefits
  • Flexible spending accounts
  • Tuition reimbursement
  • Professional development allowance
  • 401k with match fully vested as of day one. Enel-NA matches 100% of the first 4% that you contribute up to set IRS limits
  • Generous PTO that supports work/life balance including: 4 weeks annually of vacation as well as personal days, volunteer days, your birthday off, paid holidays, and sick time
  • Paid leave programs
  • The opportunity to grow and develop your career with the support and mentorship of senior leaders
  • Fulltime
Read More
Arrow Right
New

Credit Portfolio Senior Manager

The Credit Portfolio Senior Manager is a senior management-level position respon...
Location
Location
Mexico , Ciudad De Mexico
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6-10 years of experience with analytical or data manipulation tools (e.g., SAS, SQL, R, SPSS)
  • Experience with econometric and statistical modeling or application risk scoring
  • Experience in big data with knowledge of probabilities/distributions, building decision trees and regression models
  • Proficiency in MS Office
  • Proven ability to derive patterns, trends and insights, perform risk/reward trade-off analyses and connect these analytics to business impacts
  • Demonstrated quantitative, presentation and analytic skills
  • Consistently demonstrate clear and concise written and verbal communication
  • Bachelor's degree/University degree or equivalent experience
  • Master's degree preferred
  • Experiencia en análisis de riesgo crediticio
Job Responsibility
Job Responsibility
  • Maintain compliance with Citibank credit policies/practices, regulatory policies and prepare, update, and ensure approval of all internal policy and procedure changes, keeping proper change and approval logs
  • Conduct managerial responsibilities, including coaching/mentoring, performance management and evaluation (including managing the Review the Reviewer process), work assignment management and resource/capacity management, monitoring and escalating as needed
  • Continuously identify process improvements and efficiencies to support business unit goals through dissecting issues and making recommendations
  • Monitor daily queue dashboard and production reports and address production and/or pipeline management issues, serving as a an escalation point of contact for these and SLA related issues with the goal of enhancing customer experience
  • Conduct analyses related to policy (e.g., benefits tracking), bureau and alternate credit information, risk and reward trade-off, etc. by leveraging data and creating ad-hoc analyses/management information systems (MIS)
  • Identify business opportunities, develop credit tests and risk strategies, balancing risk and return, and work with cross-functional teams to ensure flawless execution of policy tests and strategies
  • Appropriately assess risk when business decisions are made, demonstrating particular consideration for the firm's reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behavior, conduct and business practices, and escalating, managing and reporting control issues with transparency, as well as effectively supervise the activity of others and create accountability with those who fail to maintain these standards.
  • Fulltime
Read More
Arrow Right
New

Vp of engineering

Columbus, Ohio client is seeking an experienced VP of Engineering to join its ex...
Location
Location
United States , Dublin, Ohio
Salary
Salary:
Not provided
revelit.com Logo
Revel IT
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Extensive experience (10+ years) in senior engineering leadership roles, such as VP of Engineering, Director of Engineering, or similar
  • Deep expertise in cloud-native architecture, particularly the AWS ecosystem: EKS, EC2, Lambda, PostgreSQL, S3, and so on
  • Proven technical depth in the current modern stack: Ruby on Rails, React/TypeScript, and modern CI/CD
  • Experience in managing, maintaining, and modernizing C# / .NET Framework applications and complex installer/deployment tooling (MSI, WiX)
  • Prior experience with security compliance (SOC 2, HIPAA) and implementing preventative security controls (SAST, SCA)
  • Demonstrated ability to drive cultural change, adopt new technologies, such as Generative AI
  • 3-5 years of experience managing offshore teams
  • 2-3 years of experience leveraging AI to support the SDLC process to improve team efficiency with working examples
  • Working experience using Cursor, Microsoft Co-Pilot, GitHub or similar AI tools in coding practice with measurable results
Job Responsibility
Job Responsibility
  • Serve as a strategic technology leader and visionary, driving the engineering organization’s architecture, product roadmap, and operational excellence
  • Define a unifying engineering strategy and roadmap for managing the success of customers, leveraging Compass and Traverse while designing our next-generation offerings with an AI first mindset
  • Establish clear architectural governance and set consistent technical direction across the entire product portfolio
  • Drive the evaluation and standardization of CI/CD practices and modern DevOps pipelines to enable scalable, efficient SaaS delivery
  • Lead the adoption and institutionalization of a GenAI-assisted approach to increase engineering throughput
  • Advance the use of GenAI and emerging technologies within the product to benefit customers and the next generation of innovation
  • Evaluate and refine the engineering organization to ensure an effective structure, strong leadership, and a culture that fosters innovation, continuous skill development, and scalable growth
  • Implement continuous improvement in AWS operations, monitoring, services, cost profiles and core engineering processes
  • Set measurable engineering goals that track output, identify improvement opportunities, and drive accountability
  • Implement continuous improvement in the Secure Development Lifecycle (SDLC) while supporting compliance and industry standard practices
  • Fulltime
Read More
Arrow Right
New

Sr Organizational Capability Consultant, M&A Change Management & Communications

The Sr Consultant, Organizational Capability is a senior change effectiveness le...
Location
Location
United States , Bellevue; Overland Park
Salary
Salary:
110900.00 - 200100.00 USD / Year
https://www.t-mobile.com Logo
T-Mobile
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Business, Human Resources or related field
  • 7-10 years experience in driving end-to-end organizational effectiveness work, conducting organizational analyses to diagnose organizational issues, leading large scale operating model and organizational design efforts, and working closely with and influencing executives
  • 5+ years combined project and program management experience
  • 4-7 years experience developing creative and complex communication materials
  • effective and comfortable working with and communicating at all levels of the business (frontline through executives)
  • 5+ years leading high-level, critical initiatives within the Human Resources function at a large complex corporate environment
  • Thorough knowledge of HR programs, systems, processes and approaches
  • 4-7 years -experience in change management experience including helping leaders embrace and drive significant strategic change
  • High level of business acumen, organizational experience and knowledge to effectively navigate complex matrix and prioritization related issues
  • Ability to think big picture, converting abstract concepts into actionable initiatives
Job Responsibility
Job Responsibility
  • Provide expert organizational effectiveness advising to support assigned lines of business, partnering with HR account teams and business leaders
  • Conduct organizational analysis to identify root causes of performance issues and opportunities, partnering with analytics teams to uncover bottlenecks and effectiveness factors. This includes leading research efforts to identify best standards, analyze existing data, and gather additional insights to inform organizational design and change management strategies
  • Responsible for developing and implementing change management strategies and plans that increase employee adoption and usage and minimize resistance. Lead end-to-end change management for M&A or enterprise transformations by developing comprehensive plans that include partner engagement, impact analysis, communications, training, business readiness, and sustainment to ensure successful integration and adoption
  • Lead the design or re-engineering of organizational operation models, architecting organizations and driving team effectiveness solutions that enable successful integration of people, processes, and systems in support of strategic objectives and future-state alignment
  • Collaborate with business leaders to address organizational design and development opportunities, including change management initiatives that enable teams to operate effectively and maintain engagement during organizational transitions
  • Develop high-performing team effectiveness mechanisms and interventions to enhance collaboration, decision-making, and performance, ensuring alignment across teams
  • Create executive-level presentations and advanced data visualizations to communicate insights, recommendations, and progress
  • Measure the success and effectiveness of initiatives, creating metrics to develop adoption, engagement, and overall change impact over time, and determining necessary actions to ensure long-term sustainment and continuous improvement
What we offer
What we offer
  • Competitive base salary and compensation package
  • Annual stock grant
  • Employee stock purchase plan
  • 401(k)
  • Access to free, year-round money coaches
  • Annual bonus or periodic sales incentive or bonus
  • Medical, dental and vision insurance
  • Flexible spending account
  • Employee stock grants
  • Employee stock purchase plan
  • Fulltime
Read More
Arrow Right
New

Production Support Analyst

This hybrid position requires on-site presence in Madison, WI, 10 days per month...
Location
Location
United States , Madison
Salary
Salary:
Not provided
carexconsulting.com Logo
Carex Consulting Group
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum of 3 years of experience in data reporting, BI support, or technical analysis
  • Experience working with life insurance or financial services data (e.g., policy data, claims, underwriting, actuarial reporting)
  • Proficiency in SQL and business intelligence tools such as Power BI, Tableau, or QuickSight
  • Familiarity with ticketing systems (e.g., JIRA, ServiceNow) and documentation tools (e.g., Confluence)
  • Strong analytical thinking, communication, and problem-solving skills
  • Preferred experience with Crystal Reports, SSIS, SSRS, and ETL/data validation processes
  • Knowledge of life insurance data structures and terminology
  • Familiarity with Agile or ITIL frameworks
  • Bachelor’s degree preferred, or equivalent experience
Job Responsibility
Job Responsibility
  • Triage and resolve Tier 2 reporting issues related to dashboards, reports, and data extracts across underwriting, claims, and policy teams
  • Conduct root cause analysis across data pipelines, report logic, and business processes
  • Collaborate with BI developers, actuaries, and data engineers to implement fixes and enhancements
  • Review and classify incoming support tickets as bugs, enhancements, or new requests
  • prioritize and route based on business impact
  • Partner with stakeholders in sales, operations, compliance, and finance to clarify reporting needs and validate deliverables
  • Monitor report performance and proactively identify and mitigate recurring issues
  • Validate data accuracy and ensure consistency with operational and regulatory definitions
  • Maintain and update support logs, resolution guides, and FAQs tailored to life insurance reporting workflows
  • Share knowledge with Tier 1 support and contribute to onboarding resources for new team members
Read More
Arrow Right