CrawlJobs Logo

Senior Incident Operations and Optimization Specialist

https://www.citi.com/ Logo

Citi

Location Icon

Location:
India , Chennai

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

The Senior Incident Operations & Optimization Specialist for Mainframe & Batch is a specialized technical leadership role requiring deep expertise in mainframe operations, batch job scheduling, and enterprise-scale processing environments. This position is critical to the success of the Incident Reduction Program, providing delivery of solutions which optimize and automate operations workflows. You will be responsible for building automated incident remediation workflows and achieving measurable incident reduction through intelligent alert optimization, correlation, and automation while preserving the critical observability required for business-critical mainframe applications and batch processing. This role offers the unique opportunity to modernize event management for legacy systems using cutting-edge AIOps platforms and automation technologies.

Job Responsibility:

  • Conduct in-depth analysis of mainframe and batch processing alerts to identify chronic issues, reduce operational noise, and develop strategies to address high-volume incident generators, including recurring job failures
  • Design and implement domain-specific correlation, de-duplication, and suppression rules on AIOps and event management platforms
  • Develop logic that understands mainframe subsystem relationships and cascading batch job dependencies
  • Architect and develop automation playbooks for incident data enrichment, automated job restarts, and self-healing capabilities for common mainframe and batch processing failures
  • Assess monitoring gaps in mainframe and batch environments, proposing enhancements to ensure critical business processes have appropriate alerting coverage and align with enterprise standards
  • Partner closely with mainframe operations, batch scheduling, and application development teams to validate correlation logic, define automation initiatives, and provide expert guidance on modern event management practices
  • Continuously validate the effectiveness of implemented rules and automation
  • Establish feedback loops with operational teams to conduct post-implementation reviews and iterative improvements

Requirements:

  • Bachelor's degree in Computer Science, Information Technology, Computer Engineering, or a related technical field
  • A minimum of 8+ years of hands-on experience in mainframe operations, batch processing, or enterprise workload automation
  • Proven track record in event management, alert tuning, and incident reduction within complex mainframe and batch environments, with quantifiable results
  • Direct, hands-on experience with modern AIOps and event management platforms is required
  • Deep understanding of mainframe architecture, operating systems, and subsystems
  • Expertise in enterprise workload automation, including job design, scheduling, and dependency management
  • Hands-on experience developing robust automation solutions using relevant scripting languages and modern automation frameworks
  • Proficiency in log analysis, pattern recognition, and using query languages for data analysis on log aggregation platforms
  • Excellent analytical abilities with a systematic approach to troubleshooting complex batch dependencies and failure propagation scenarios
  • Exceptional communication skills with the ability to bridge mainframe/legacy and modern technology teams, influence collaboration, and present technical concepts to diverse audiences

Nice to have:

  • An advanced degree in a relevant technical field
  • Relevant industry certifications (e.g., Mainframe, Workload Automation, Automation, ITIL)
  • Experience with mainframe modernization initiatives, DevOps, and CI/CD pipelines
  • Familiarity with specialized financial systems
  • Background in large-scale financial services or other regulated environments, including knowledge of disaster recovery and high-availability patterns

Additional Information:

Job Posted:
March 22, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior Incident Operations and Optimization Specialist

New

Senior Incident Optimization Specialist - Enterprise Infrastructure

The Senior Incident Optimization Specialist serves as a critical bridge between ...
Location
Location
United States , Irving
Salary
Salary:
125760.00 - 188640.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
March 26, 2026
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Information Technology, Computer Engineering, or a related technical field
  • A minimum of 8+ years of hands-on experience in IT operations, infrastructure engineering, or system architecture within large-scale enterprise environments
  • Proven experience and demonstrated success in leading event management and incident reduction initiatives with quantifiable results
  • Direct, hands-on experience with modern AIOps and event management platforms is required
  • Deep understanding of enterprise infrastructure including virtualization architectures, container orchestration, microservices, and various storage architectures (block, file, object)
  • Expertise with a broad range of domain-specific monitoring tools for compute, virtualization, storage, and cloud platforms
  • Hands-on experience developing robust automation solutions using scripting languages and modern automation frameworks
  • Proficiency in log analysis, pattern recognition, and using query languages for data analysis on log aggregation platforms
  • Excellent analytical abilities with a systematic approach to troubleshooting complex issues and a holistic view of technology systems
  • Exceptional communication skills with the ability to influence and collaborate effectively across diverse, cross-functional teams and present technical concepts to various audiences
Job Responsibility
Job Responsibility
  • Conduct comprehensive analysis of alert and incident patterns to identify top sources of operational noise, determine root causes, and develop data-driven strategies for reduction
  • Design, implement, and optimize rules for event correlation, de-duplication, and suppression on AIOps and event management platforms
  • Develop domain-specific correlation logic leveraging configuration management data and infrastructure topology
  • Architect and develop automation playbooks for incident data enrichment and create self-healing capabilities for common and recurring infrastructure incident scenarios
  • Assess the current observability footprint across all infrastructure domains to identify gaps and propose enhancements that align with enterprise event management standards
  • Partner closely with infrastructure operations, engineering, and platform teams to understand incident drivers, validate correlation logic, and provide expert guidance on event management best practices
  • Continuously validate the effectiveness of implemented rules and automation to ensure no business-impacting alerts are missed
  • Monitor and report on alert quality metrics and lead iterative improvements
What we offer
What we offer
  • medical, dental & vision coverage
  • 401(k)
  • life, accident, and disability insurance
  • wellness programs
  • paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays
  • discretionary and formulaic incentive and retention awards
  • Fulltime
!
Read More
Arrow Right
New

Senior Incident Optimization Specialist - Data & Middleware

The Senior Incident Operations & Optimization Specialist for Data & Middleware i...
Location
Location
United States , Irving
Salary
Salary:
125760.00 - 188640.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
March 26, 2026
Flip Icon
Requirements
Requirements
  • A minimum of 8+ years of hands-on experience in database administration, middleware engineering, or enterprise data platform operations
  • Proven experience in event management, alert tuning, and incident reduction for data and middleware services, with measurable results
  • Direct, hands-on experience with modern AIOps and event management platforms is required
  • Deep knowledge of both relational (e.g., Oracle, SQL Server) and NoSQL (e.g., MongoDB) database technologies, including clustering, replication, and performance tuning
  • Expertise in middleware platforms, including messaging technologies (e.g., MQ, Kafka) and application servers (e.g., WebSphere, Tomcat)
  • Hands-on experience developing robust automation solutions using relevant scripting languages (e.g., Python, Shell) and modern automation frameworks
  • Proficiency in log analysis, pattern recognition, and using query languages for data analysis on log aggregation platforms
  • Excellent analytical abilities with a systematic approach to troubleshooting complex data platform architectures and correlating infrastructure issues with application impact
  • Exceptional communication skills with the ability to collaborate effectively with DBAs, middleware engineers, and application teams, and to present technical concepts to diverse audiences
  • Bachelor's degree in Computer Science, Information Technology, Computer Engineering, or a related technical field
Job Responsibility
Job Responsibility
  • Analyze and optimize monitoring across all database and middleware platforms to address high-volume, low-value alerts, identify patterns in incident generation, and determine root causes
  • Develop and implement domain-specific correlation, de-duplication, and suppression rules on AIOps and event management platforms
  • Create logic that understands database cluster relationships, messaging dependencies, and application-to-database connections
  • Architect and develop automation playbooks for incident data enrichment and automated remediation of common database and middleware issues, such as connection pool resets or service restarts
  • Identify monitoring gaps across the data and middleware landscape, proposing enhancements to ensure comprehensive health monitoring and address blind spots in transactional flows
  • Partner closely with Database Administration (DBA), middleware engineering, and application teams to validate correlation logic, build consensus on threshold changes, and provide expert guidance on event management best practices
  • Continuously validate the effectiveness of implemented rules and automation, ensuring critical health indicators remain highly visible
  • Lead post-implementation reviews and drive iterative improvements
What we offer
What we offer
  • medical, dental & vision coverage
  • 401(k)
  • life, accident, and disability insurance
  • wellness programs
  • paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays
  • discretionary and formulaic incentive and retention awards
  • Fulltime
!
Read More
Arrow Right

IAM Operations Specialist

IAM for User Operations Specialist is responsible for defining and delivering IA...
Location
Location
India , Bangalore Area
Salary
Salary:
Not provided
airbus.com Logo
Airbus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Engineering (B.Tech) or equivalent
  • Minimum of 9 years of relevant experience
  • Exposure to hands on IT operations ideally in Identity Management
  • SQL Proficiencies
  • Knowledge of ITSM bricks (ServiceNow)
  • Solid transversal management skills, including the ability to work effectively across functions, divisions, and international teams
  • Knowledge of relevant laws, guidelines, or regulations pertaining to IT
  • Profound understanding of technical troubleshooting and problem-solving methodologies
  • Demonstrated strong interpersonal skills
  • Excellent proficiency in both written and spoken English
Job Responsibility
Job Responsibility
  • Define and deliver IAM (Identity and Access Management) for User operational checks
  • Lead and work in a self-sufficient multi-disciplinary team environment
  • Responsible for end to end product operation in collaboration with transnational teams
  • Drive the Identity service delivery, associated Vendor and act as focal point for Identity activities in India
  • Perform continuous checks on users data and apply correction to ensure frictionless JML Experience
  • Collaborate with Service Delivery Manager and IT Operation Specialist to Monitor Operational performance, identify areas for improvement, and implement corrective actions
  • Develop, implement and maintain operation documents, policies, procedures, documentation
  • Deliver Service Excellence by driving continuous Improvements and minimizing operational issues within IAM
  • Weekly connect with Business & functions to provide support
  • Impact Analysis on Incidents and Proactive Remediation planning and approach
What we offer
What we offer
  • Flexible working arrangements to stimulate innovative thinking
  • Fulltime
Read More
Arrow Right

IAM Operations Specialist

IAM for User Operations Specialist is responsible for defining and delivering IA...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
airbus.com Logo
Airbus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Engineering (B.Tech) or equivalent
  • Minimum of 9 years of relevant experience
  • Exposure to hands on IT operations ideally in Identity Management
  • SQL Proficiencies
  • Knowledge of ITSM bricks (ServiceNow)
  • Solid transversal management skills, including the ability to work effectively across functions, divisions, and international teams
  • Knowledge of relevant laws, guidelines, or regulations pertaining to IT
  • Profound understanding of technical troubleshooting and problem-solving methodologies
  • Demonstrated strong interpersonal skills
  • Excellent proficiency in both written and spoken English
Job Responsibility
Job Responsibility
  • Perform continuous checks on users data and apply correction to ensure frictionless JML Experience
  • Collaborate with Service Delivery Manager and IT Operation Specialist to Monitor Operational performance, identify areas for improvement, and implement corrective actions as necessary to address any issues or gaps in service delivery
  • Develop, implement and maintain operation documents, policies, procedures, documentation
  • Deliver Service Excellence by driving continuous Improvements and minimizing operational issues within IAM
  • Weekly connect with Business & functions to provide support
  • Impact Analysis on Incidents and Proactive Remediation planning and approach
  • Proficient in utilizing ITSM tools and Agile methodologies to optimize product delivery and operational efficiency
  • Develop and maintain regular reports on service delivery performance, including SLA adherence, KPI achievement, and customer satisfaction metrics
  • Analyze performance data to identify trends, root causes of issues, and opportunities for improvement, and present findings to senior management and clients as needed
  • Utilize data-driven insights to make informed decisions and drive strategic initiatives aimed at optimizing service delivery processes and outcomes
  • Fulltime
Read More
Arrow Right

Associate Director, US Tax Operations

As Associate Director of US Tax Operations, you will own and lead Deel’s end-to-...
Location
Location
United States
Salary
Salary:
Not provided
deel.com Logo
Deel
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7–10+ years of progressive experience in US tax operations, payroll tax compliance, or payroll operations, including: Multi-state and local tax complexity
  • Federal, state, and local filings and remittances
  • PEO and/or EOR operating models
  • 3–5+ years of people leadership experience, managing managers and/or complex operational teams
  • Deep expertise in US employment tax compliance, controls, audits, reconciliations, and risk management
  • Hands-on experience with enterprise payroll and tax systems, including: Prism
  • MasterTax
  • Comparable payroll and tax engines
  • Proven ability to scale operations in high-growth, multi-product, or enterprise environments
  • Strong operational judgment with the ability to balance compliance, automation, client experience, and speed
Job Responsibility
Job Responsibility
  • Own the end-to-end US tax operating model across: US PEO tax operations
  • US EOR tax operations
  • Managed US employer payroll tax (external clients)
  • Self-service US employer payroll tax (external clients)
  • Deel internal US payroll tax operations
  • Define and execute the US tax operations strategy, including scalability plans, automation roadmap, vendor approach, and risk mitigation
  • Establish and maintain tax governance frameworks, controls, policies, documentation, and escalation models
  • Serve as a senior operational authority for US tax matters across leadership, product, legal, and risk forums
  • Partner with Legal, Risk, and Compliance to interpret regulatory changes impacting PEO, co-employment, and EOR models
  • Ensure accurate and compliant execution of: Federal, state, and local tax withholdings and remittances
What we offer
What we offer
  • Stock grant opportunities dependent on your role, employment status and location
  • Additional perks and benefits based on your employment status and country
  • The flexibility of remote work, including optional WeWork access
  • Fulltime
Read More
Arrow Right

Vp data & insights

Curriculum Associates is seeking a VP of Data & Insights to lead their data ecos...
Location
Location
United States of America , Massachusetts
Salary
Salary:
149500.00 - 275500.00 USD / Year
edtechjobs.io Logo
EdTech Jobs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience leading full-stack data organizations including engineering, governance, operations, and analytics
  • Strong command of relational and cloud-based data platforms (Snowflake, Azure Data Services, Databricks, etc.)
  • Deep understanding of data modeling, ETL/ELT frameworks, pipelines, orchestration tools, and API integrations
  • Expertise in BI and visualization tools (Power BI preferred)
  • Thorough knowledge of enterprise data governance, data quality management, metadata, and master data concepts
  • 12–18 years of experience in data and analytics roles, with at least 7–10 years in leadership positions
  • Demonstrated ability to build and scale global teams in a matrixed, fast-moving environment
  • Strong communication and data storytelling skills with the ability to influence senior executives
  • Experience driving enterprise adoption of new tools, platforms, and ways of working
  • Experience supporting diverse business functions such as Product, Sales, Finance, Marketing, Operations, or Customer Experience
Job Responsibility
Job Responsibility
  • Define and execute a comprehensive enterprise-wide data and analytics strategy, aligned to organizational goals and future growth
  • Establish the roadmap for data engineering, governance, quality, operations, and insights—balancing innovation, speed, and reliability
  • Drive modernization of data platforms (e.g., Snowflake, Azure, Databricks) and BI tools (Power BI, Tableau)
  • Lead the design, build, and maintenance of scalable, secure, and reliable data pipelines, data models, and integration frameworks
  • Own the enterprise data architecture and evolving the data stack in partnership with Engineering and Cloud teams
  • Oversee ingestion, transformation, orchestration, and monitoring to ensure timely and accurate data availability
  • Establish a strong data governance operating model including ownership, stewardship, standards, and policies
  • Implement frameworks for data quality measurement, metadata management, cataloguing, lineage, and controlled access
  • Partner with Security, Compliance, Legal, and business leaders to ensure data is used responsibly, ethically, and compliantly
  • Own end-to-end data operations, focusing on system reliability, SLAs, monitoring, performance optimization, and operational excellence
What we offer
What we offer
  • Medical, dental, vision, and basic life insurance coverage for employees (and their families)
  • Company 401k plan with employer match
  • Flexible vacation and sick policy
  • Twelve paid holidays
  • Winter office closure between Christmas and New Year's
  • Fulltime
Read More
Arrow Right

Senior Specialist Service Operations

At SITA, we keep airports moving, airlines flying smoothly, and borders open. Ou...
Location
Location
Egypt , Cairo
Salary
Salary:
Not provided
sita.aero Logo
SITA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proficiency in middleware technologies such as: Messaging broker systems: IBM MQ
  • Proxy servers : HA proxy, Max Scale
  • Application servers: WebSphere, JBoss, Tomcat
  • Hands on experience of operating systems (Linux Redhat)
  • Experience with cloud platforms (AWS or Azure) is a plus
  • Familiarity with databases (Oracle, Maria Database, SQL Server) is an asset
  • Scripting skills (e.g., Shell, Python, PowerShell) for automation tasks is plus
  • Familiarity with load balancers (F5, AVI or Elastic) is a plus
  • Strong problem-solving and analytical abilities
  • Excellent communication and teamwork skills
Job Responsibility
Job Responsibility
  • Install, configure, and maintain middleware messaging systems
  • Ensure middleware components are up-to-date with patches and upgrades
  • Respond to incidents reported by users or monitoring systems
  • Provide Service Operations support to internal and external customers in accordance with the terms of the customer contract and Service Level Agreements (SLAs)
  • Log and track issues using ticketing systems (e.g., ServiceNow, Jira)
  • Diagnose and resolve middleware-related issues, such as connectivity problems, performance bottlenecks, or configuration errors
  • Collaborate with development and infrastructure teams to address complex issues
  • Monitor performance using Dynatrace
  • Optimize middleware configurations to improve system efficiency and reliability
  • Participate in change advisory boards (CAB) and ensure all changes are documented and tested
What we offer
What we offer
  • Flex Week: Work from home up to 2 days/week
  • Flex Day: Make your workday suit your life and plans
  • Flex-Location: Take up to 30 days a year to work from any location in the world
  • Employee Wellbeing: Employee Assistance Program (EAP), for you and your dependents 24/7, 365 days/year
  • Champion Health - a personalized platform that supports a range of wellbeing needs
  • Professional Development: Level up your skills with our training platforms, including LinkedIn Learning
  • Competitive Benefits: Competitive benefits that make sense with both your local market and employment status
  • Fulltime
Read More
Arrow Right

Service Operations Specialist

To assure SITA's competitive strength and business growth through the provision ...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
sita.aero Logo
SITA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Minimum 3 -5 years of proven experience in the network and/or application/system support domain, IT System Administrator and application support role, or in a similar infrastructure-focused role
  • Must have dealt directly with external customers delivering to SLAs
  • A background in hybrid IT environments (on-premises and cloud), with practical knowledge of virtualization platforms (e.g., VMware) and cloud services (e.g., AWS)
  • Strong hands-on experience in managing and troubleshooting servers, network infrastructure, enterprise applications, and client systems in complex IT environments
  • Experience in operation and maintenance of airport IT systems, networking and airline-specific applications is highly preferred
  • A background in Airport IATA standards, airline infrastructure/applications, SBD, E-Gates, and airport passenger/baggage (Pax/Bags) systems would be an added advantage
  • Proficiency in Windows and Linux server environments, including installation, configuration, and administration
  • Strong knowledge of networking concepts and protocols such as TCP/IP, DNS, DHCP, and VPN
  • Strong hardware knowledge such as server, router, switch etc.
  • Knowledge on web server such as Apache, Tomcat
Job Responsibility
Job Responsibility
  • Provide Service Operations support to internal and external customers in accordance with the terms of the customer contract and Service Level Agreements (SLAs)
  • Ensure the correct functioning and maintenance of all internal and external systems and products serviced by Service Operations
  • When required act as the customer SPOC and co-ordinate the scheduling of intervention with Customer's internal resolver groups and the Service Desk ensuring the highest level of customer services and communications are maintained to resolve the fault and incident within the prescribed SLA
  • Carry out incident and problem management support to the highest standards and co-ordinate the resolution with the appropriate resolver groups
  • Ensure shortest restoral times possible initiating the timely escalations to specialized resolver groups inside and outside SITA according to the customer contracts SLAs and monitoring requirements
  • To ensure the Service Operations team adheres to the highest working standards for all incidents and problems by providing guidance support and direct management
  • Proactively detect problems related to service and infrastructure operations and delivery services conduct diagnostics and provide service request ownership to ensure resolution of customer problems
  • Support the senior team members in the management reporting and co-ordination of day-day tasks during absence of the Lead Engineer
  • Adhere to installation guidelines and industry best practices in order to deliver quality service and infrastructure operations
  • Use the appropriate tools and equipment to perform the installation intervention and repairs in accordance with Service Operations and Delivery guidelines and instructions where provided
What we offer
What we offer
  • Flex Week: Work from home up to 2 days/week (depending on your team's needs)
  • Flex Day: Make your workday suit your life and plans
  • Flex-Location: Take up to 30 days a year to work from any location in the world
  • Employee Wellbeing: Employee Assistance Program (EAP), for you and your dependents 24/7, 365 days/year
  • Champion Health - a personalized platform that supports a range of wellbeing needs
  • Professional Development: Level up your skills with our training platforms, including LinkedIn Learning
  • Competitive Benefits: Competitive benefits that make sense with both your local market and employment status
  • Fulltime
Read More
Arrow Right