CrawlJobs Logo

Senior Incident Operations and Optimization Specialist

https://www.citi.com/ Logo

Citi

Location Icon

Location:
India , Chennai

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

The Senior Incident Operations & Optimization Specialist for Mainframe & Batch is a specialized technical leadership role requiring deep expertise in mainframe operations, batch job scheduling, and enterprise-scale processing environments. This position is critical to the success of the Incident Reduction Program, providing delivery of solutions which optimize and automate operations workflows. You will be responsible for building automated incident remediation workflows and achieving measurable incident reduction through intelligent alert optimization, correlation, and automation while preserving the critical observability required for business-critical mainframe applications and batch processing. This role offers the unique opportunity to modernize event management for legacy systems using cutting-edge AIOps platforms and automation technologies.

Job Responsibility:

  • Conduct in-depth analysis of mainframe and batch processing alerts to identify chronic issues, reduce operational noise, and develop strategies to address high-volume incident generators, including recurring job failures
  • Design and implement domain-specific correlation, de-duplication, and suppression rules on AIOps and event management platforms
  • Develop logic that understands mainframe subsystem relationships and cascading batch job dependencies
  • Architect and develop automation playbooks for incident data enrichment, automated job restarts, and self-healing capabilities for common mainframe and batch processing failures
  • Assess monitoring gaps in mainframe and batch environments, proposing enhancements to ensure critical business processes have appropriate alerting coverage and align with enterprise standards
  • Partner closely with mainframe operations, batch scheduling, and application development teams to validate correlation logic, define automation initiatives, and provide expert guidance on modern event management practices
  • Continuously validate the effectiveness of implemented rules and automation
  • Establish feedback loops with operational teams to conduct post-implementation reviews and iterative improvements

Requirements:

  • Bachelor's degree in Computer Science, Information Technology, Computer Engineering, or a related technical field
  • A minimum of 8+ years of hands-on experience in mainframe operations, batch processing, or enterprise workload automation
  • Proven track record in event management, alert tuning, and incident reduction within complex mainframe and batch environments, with quantifiable results
  • Direct, hands-on experience with modern AIOps and event management platforms is required
  • Deep understanding of mainframe architecture, operating systems, and subsystems
  • Expertise in enterprise workload automation, including job design, scheduling, and dependency management
  • Hands-on experience developing robust automation solutions using relevant scripting languages and modern automation frameworks
  • Proficiency in log analysis, pattern recognition, and using query languages for data analysis on log aggregation platforms
  • Excellent analytical abilities with a systematic approach to troubleshooting complex batch dependencies and failure propagation scenarios
  • Exceptional communication skills with the ability to bridge mainframe/legacy and modern technology teams, influence collaboration, and present technical concepts to diverse audiences

Nice to have:

  • An advanced degree in a relevant technical field
  • Relevant industry certifications (e.g., Mainframe, Workload Automation, Automation, ITIL)
  • Experience with mainframe modernization initiatives, DevOps, and CI/CD pipelines
  • Familiarity with specialized financial systems
  • Background in large-scale financial services or other regulated environments, including knowledge of disaster recovery and high-availability patterns

Additional Information:

Job Posted:
March 22, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior Incident Operations and Optimization Specialist

Senior Incident Optimization & Reliability Specialist - End-User Technology

The Senior Incident Optimization & Reliability Specialist serves as a critical b...
Location
Location
India , Chennai
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Information Technology, Computer Engineering, or a related technical field
  • A minimum of 8+ years of hands-on experience in IT operations, end-user computing, or a related field, with proven experience in incident reduction and operational excellence
  • Demonstrated success in leading event management and incident reduction initiatives with quantifiable results
  • Direct, hands-on experience with modern AIOps and enterprise event management platforms (e.g., BigPanda)
  • Deep understanding of end-user technology ecosystems, including VMWare-hosted cloud desktop infrastructure, Microsoft 365 suite (Teams, Outlook, Office), SharePoint, and collaboration platforms
  • Expertise with a broad range of domain-specific monitoring and observability tools
  • Hands-on experience developing robust automation solutions using scripting languages (e.g., Python, PowerShell) and modern automation frameworks
  • Proficiency in log analysis, pattern recognition, and using query languages for data analysis on log aggregation platforms
  • Excellent analytical abilities with a systematic approach to troubleshooting complex issues
  • Exceptional communication skills with the ability to influence and collaborate effectively across diverse, cross-functional teams
Job Responsibility
Job Responsibility
  • Conduct comprehensive analysis of alert and incident patterns to identify top sources of operational noise, determine root causes, and develop data-driven strategies for reduction
  • Design, implement, and optimize rules for event correlation, de-duplication, and suppression on AIOps and event management platforms
  • Architect and develop automation playbooks for incident data enrichment and create self-healing capabilities to reduce manual intervention (toil)
  • Assess the current observability footprint across all end-user technology domains
  • Champion and apply core SRE practices to systematically improve service reliability
  • Partner closely with end-user services, engineering, and platform teams to understand incident drivers, validate correlation logic, and provide expert guidance
  • Continuously validate the effectiveness of implemented rules and automation to ensure no business-impacting alerts are missed
  • Fulltime
Read More
Arrow Right

Gen AI Engineering and Scaled AI Transformation

Location
Location
Canada , Mississauga
Salary
Salary:
145100.00 - 217700.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of progressive experience in software engineering, ML, or AI platforms, with 5+ years leading senior engineers and architects
  • 3+ years of hands‑on experience deploying LLM‑based systems in production environments at enterprise scale
  • Demonstrated authority across commercial and open‑source LLM ecosystems (e.g., OpenAI, Anthropic, Google, Llama), including model selection, fine‑tuning, and hosting strategies
  • Proven ability to define enterprise-wide GenAI standards, reference architectures, and reusable accelerators
  • Demonstrated leadership in establishing prompt engineering standards and orchestration patterns
  • Experience optimizing latency, throughput, accuracy, and token cost across large‑scale GenAI workloads
  • Bachelor’s degree/University degree or equivalent experience
  • Master’s degree preferred
Job Responsibility
Job Responsibility
  • Acts as a senior technical authority on Large Language Models, including both commercial and open‑source ecosystems (OpenAI, Gemini, Claude, Llama)
  • Leads model selection and deployment strategy, balancing use‑case fit, data sensitivity, cost efficiency, latency, accuracy, and regulatory constraints
  • Guides decisions on hosted vs. private vs. fine‑tuned models, ensuring optimal trade‑offs between performance, control, and operational risk
  • Establishes enterprise standards for LLM lifecycle management, including upgrades, regression validation, and decommissioning
  • Demonstrates hands‑on leadership in building GenAI applications using LangChain, LangGraph, LlamaIndex, and Hugging Face, translating experimentation into production systems
  • Architects agentic and multi‑step workflows, enabling tool‑use, reasoning chains, state management, and orchestration at enterprise scale
  • Sets reusable reference patterns and accelerators for GenAI adoption across application teams
  • Ensures solutions are built with enterprise-grade reliability, explainability, and extensibility
  • Designs and delivers robust RAG architectures that ground GenAI outputs in trusted, auditable enterprise data
  • Leads implementation of vector databases and embedding strategies (pgvector, Pinecone, Weaviate, FAISS), aligned with data access and security models
  • Fulltime
Read More
Arrow Right

IAM Operations Specialist

IAM for User Operations Specialist is responsible for defining and delivering IA...
Location
Location
India , Bangalore Area
Salary
Salary:
Not provided
airbus.com Logo
Airbus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Engineering (B.Tech) or equivalent
  • Minimum of 9 years of relevant experience
  • Exposure to hands on IT operations ideally in Identity Management
  • SQL Proficiencies
  • Knowledge of ITSM bricks (ServiceNow)
  • Solid transversal management skills, including the ability to work effectively across functions, divisions, and international teams
  • Knowledge of relevant laws, guidelines, or regulations pertaining to IT
  • Profound understanding of technical troubleshooting and problem-solving methodologies
  • Demonstrated strong interpersonal skills
  • Excellent proficiency in both written and spoken English
Job Responsibility
Job Responsibility
  • Define and deliver IAM (Identity and Access Management) for User operational checks
  • Lead and work in a self-sufficient multi-disciplinary team environment
  • Responsible for end to end product operation in collaboration with transnational teams
  • Drive the Identity service delivery, associated Vendor and act as focal point for Identity activities in India
  • Perform continuous checks on users data and apply correction to ensure frictionless JML Experience
  • Collaborate with Service Delivery Manager and IT Operation Specialist to Monitor Operational performance, identify areas for improvement, and implement corrective actions
  • Develop, implement and maintain operation documents, policies, procedures, documentation
  • Deliver Service Excellence by driving continuous Improvements and minimizing operational issues within IAM
  • Weekly connect with Business & functions to provide support
  • Impact Analysis on Incidents and Proactive Remediation planning and approach
What we offer
What we offer
  • Flexible working arrangements to stimulate innovative thinking
  • Fulltime
Read More
Arrow Right

IAM Operations Specialist

IAM for User Operations Specialist is responsible for defining and delivering IA...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
airbus.com Logo
Airbus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Engineering (B.Tech) or equivalent
  • Minimum of 9 years of relevant experience
  • Exposure to hands on IT operations ideally in Identity Management
  • SQL Proficiencies
  • Knowledge of ITSM bricks (ServiceNow)
  • Solid transversal management skills, including the ability to work effectively across functions, divisions, and international teams
  • Knowledge of relevant laws, guidelines, or regulations pertaining to IT
  • Profound understanding of technical troubleshooting and problem-solving methodologies
  • Demonstrated strong interpersonal skills
  • Excellent proficiency in both written and spoken English
Job Responsibility
Job Responsibility
  • Perform continuous checks on users data and apply correction to ensure frictionless JML Experience
  • Collaborate with Service Delivery Manager and IT Operation Specialist to Monitor Operational performance, identify areas for improvement, and implement corrective actions as necessary to address any issues or gaps in service delivery
  • Develop, implement and maintain operation documents, policies, procedures, documentation
  • Deliver Service Excellence by driving continuous Improvements and minimizing operational issues within IAM
  • Weekly connect with Business & functions to provide support
  • Impact Analysis on Incidents and Proactive Remediation planning and approach
  • Proficient in utilizing ITSM tools and Agile methodologies to optimize product delivery and operational efficiency
  • Develop and maintain regular reports on service delivery performance, including SLA adherence, KPI achievement, and customer satisfaction metrics
  • Analyze performance data to identify trends, root causes of issues, and opportunities for improvement, and present findings to senior management and clients as needed
  • Utilize data-driven insights to make informed decisions and drive strategic initiatives aimed at optimizing service delivery processes and outcomes
  • Fulltime
Read More
Arrow Right

Associate Director, US Tax Operations

As Associate Director of US Tax Operations, you will own and lead Deel’s end-to-...
Location
Location
United States
Salary
Salary:
Not provided
deel.com Logo
Deel
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7–10+ years of progressive experience in US tax operations, payroll tax compliance, or payroll operations, including: Multi-state and local tax complexity
  • Federal, state, and local filings and remittances
  • PEO and/or EOR operating models
  • 3–5+ years of people leadership experience, managing managers and/or complex operational teams
  • Deep expertise in US employment tax compliance, controls, audits, reconciliations, and risk management
  • Hands-on experience with enterprise payroll and tax systems, including: Prism
  • MasterTax
  • Comparable payroll and tax engines
  • Proven ability to scale operations in high-growth, multi-product, or enterprise environments
  • Strong operational judgment with the ability to balance compliance, automation, client experience, and speed
Job Responsibility
Job Responsibility
  • Own the end-to-end US tax operating model across: US PEO tax operations
  • US EOR tax operations
  • Managed US employer payroll tax (external clients)
  • Self-service US employer payroll tax (external clients)
  • Deel internal US payroll tax operations
  • Define and execute the US tax operations strategy, including scalability plans, automation roadmap, vendor approach, and risk mitigation
  • Establish and maintain tax governance frameworks, controls, policies, documentation, and escalation models
  • Serve as a senior operational authority for US tax matters across leadership, product, legal, and risk forums
  • Partner with Legal, Risk, and Compliance to interpret regulatory changes impacting PEO, co-employment, and EOR models
  • Ensure accurate and compliant execution of: Federal, state, and local tax withholdings and remittances
What we offer
What we offer
  • Stock grant opportunities dependent on your role, employment status and location
  • Additional perks and benefits based on your employment status and country
  • The flexibility of remote work, including optional WeWork access
  • Fulltime
Read More
Arrow Right

Vp data & insights

Curriculum Associates is seeking a VP of Data & Insights to lead their data ecos...
Location
Location
United States of America , Massachusetts
Salary
Salary:
149500.00 - 275500.00 USD / Year
edtechjobs.io Logo
EdTech Jobs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience leading full-stack data organizations including engineering, governance, operations, and analytics
  • Strong command of relational and cloud-based data platforms (Snowflake, Azure Data Services, Databricks, etc.)
  • Deep understanding of data modeling, ETL/ELT frameworks, pipelines, orchestration tools, and API integrations
  • Expertise in BI and visualization tools (Power BI preferred)
  • Thorough knowledge of enterprise data governance, data quality management, metadata, and master data concepts
  • 12–18 years of experience in data and analytics roles, with at least 7–10 years in leadership positions
  • Demonstrated ability to build and scale global teams in a matrixed, fast-moving environment
  • Strong communication and data storytelling skills with the ability to influence senior executives
  • Experience driving enterprise adoption of new tools, platforms, and ways of working
  • Experience supporting diverse business functions such as Product, Sales, Finance, Marketing, Operations, or Customer Experience
Job Responsibility
Job Responsibility
  • Define and execute a comprehensive enterprise-wide data and analytics strategy, aligned to organizational goals and future growth
  • Establish the roadmap for data engineering, governance, quality, operations, and insights—balancing innovation, speed, and reliability
  • Drive modernization of data platforms (e.g., Snowflake, Azure, Databricks) and BI tools (Power BI, Tableau)
  • Lead the design, build, and maintenance of scalable, secure, and reliable data pipelines, data models, and integration frameworks
  • Own the enterprise data architecture and evolving the data stack in partnership with Engineering and Cloud teams
  • Oversee ingestion, transformation, orchestration, and monitoring to ensure timely and accurate data availability
  • Establish a strong data governance operating model including ownership, stewardship, standards, and policies
  • Implement frameworks for data quality measurement, metadata management, cataloguing, lineage, and controlled access
  • Partner with Security, Compliance, Legal, and business leaders to ensure data is used responsibly, ethically, and compliantly
  • Own end-to-end data operations, focusing on system reliability, SLAs, monitoring, performance optimization, and operational excellence
What we offer
What we offer
  • Medical, dental, vision, and basic life insurance coverage for employees (and their families)
  • Company 401k plan with employer match
  • Flexible vacation and sick policy
  • Twelve paid holidays
  • Winter office closure between Christmas and New Year's
  • Fulltime
Read More
Arrow Right

Senior Specialist Service Operations

At SITA, we keep airports moving, airlines flying smoothly, and borders open. Ou...
Location
Location
Egypt , Cairo
Salary
Salary:
Not provided
sita.aero Logo
SITA
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proficiency in middleware technologies such as: Messaging broker systems: IBM MQ
  • Proxy servers : HA proxy, Max Scale
  • Application servers: WebSphere, JBoss, Tomcat
  • Hands on experience of operating systems (Linux Redhat)
  • Experience with cloud platforms (AWS or Azure) is a plus
  • Familiarity with databases (Oracle, Maria Database, SQL Server) is an asset
  • Scripting skills (e.g., Shell, Python, PowerShell) for automation tasks is plus
  • Familiarity with load balancers (F5, AVI or Elastic) is a plus
  • Strong problem-solving and analytical abilities
  • Excellent communication and teamwork skills
Job Responsibility
Job Responsibility
  • Install, configure, and maintain middleware messaging systems
  • Ensure middleware components are up-to-date with patches and upgrades
  • Respond to incidents reported by users or monitoring systems
  • Provide Service Operations support to internal and external customers in accordance with the terms of the customer contract and Service Level Agreements (SLAs)
  • Log and track issues using ticketing systems (e.g., ServiceNow, Jira)
  • Diagnose and resolve middleware-related issues, such as connectivity problems, performance bottlenecks, or configuration errors
  • Collaborate with development and infrastructure teams to address complex issues
  • Monitor performance using Dynatrace
  • Optimize middleware configurations to improve system efficiency and reliability
  • Participate in change advisory boards (CAB) and ensure all changes are documented and tested
What we offer
What we offer
  • Flex Week: Work from home up to 2 days/week
  • Flex Day: Make your workday suit your life and plans
  • Flex-Location: Take up to 30 days a year to work from any location in the world
  • Employee Wellbeing: Employee Assistance Program (EAP), for you and your dependents 24/7, 365 days/year
  • Champion Health - a personalized platform that supports a range of wellbeing needs
  • Professional Development: Level up your skills with our training platforms, including LinkedIn Learning
  • Competitive Benefits: Competitive benefits that make sense with both your local market and employment status
  • Fulltime
Read More
Arrow Right

Senior Solar Asset Specialist (Distributed Generation)

The SP Solar department within EDP’s Client Solutions platform is looking to str...
Location
Location
Spain , Oviedo
Salary
Salary:
Not provided
https://www.edp.com Logo
EDP
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • University degree in Engineering (Electrical, Industrial, Energy, Telecommunications, or similar)
  • Postgraduate studies in renewable energy, asset management, or operations are a plus
  • Minimum of 3 to 5 years of experience in asset management, operations, or O&M, preferably in solar photovoltaic energy or distributed generation environments
  • Proven experience managing energy asset portfolios and coordinating preventive and corrective maintenance activities
  • Experience managing external vendors, installers, and service contracts
  • Experience leading performance improvement projects and technical or operational optimization initiatives
  • Previous experience managing teams is a plus
  • Strong knowledge of O&M in solar photovoltaic installations
  • Ability to analyze asset technical performance, identify deviations, and propose corrective actions
  • Knowledge of budgeting, operational cost control, and tracking of technical and financial KPIs
Job Responsibility
Job Responsibility
  • Lead the end-to-end management of the Solar DG portfolio in Spain, currently comprising more than 808 assets and 122 MWp, with continuous growth
  • Coordinate and oversee Operations & Maintenance (O&M) activities, including planning, monitoring of preventive and corrective actions, and resolution of technical incidents
  • Ensure asset availability, technical performance, and optimization, as well as the accurate communication of information required for energy billing
  • Provide direct support to the Head of the area and coordinate a team of 4 people
  • Manage the O&M budget in Spain, ensuring cost control, service quality, and compliance with contractual commitments
  • Maintain close and effective relationships with external providers and installers, ensuring agreed service levels are met
  • Lead continuous improvement initiatives, performance optimization plans, process standardization, reporting, and quality assurance (QA/QC) projects
  • Coordinate and align strategies and initiatives at Iberian level in collaboration with the Portugal team, leveraging synergies and lessons learned
  • Act as a key point of contact with global and local teams such as Global Asset Management, Value Management, Field Services, and commercial teams
What we offer
What we offer
  • Empower our employees through a positive and innovative work environment that promotes collaboration and agile decision-making
  • Respect and value each person, providing a flexible, healthy, and inclusive workplace with a range of attractive benefits
  • Provide a meaningful work experience and prepare our people for future challenges through different opportunities for development and internal mobility
  • Fulltime
Read More
Arrow Right