CrawlJobs Logo

AIOps Automation Engineering Lead

https://www.citi.com/ Logo

Citi

Location Icon

Location:
India , Chennai

Category Icon

Job Type Icon

Contract Type:
Employment contract

Salary Icon

Salary:

Not provided

Job Description:

The Engineering Lead Analyst is a senior level position responsible for leading a variety of engineering activities including the design, acquisition and deployment of hardware, software and network infrastructure in coordination with the Technology team. The position is within the Production Management AIOps Organization that is at the forefront of transforming production management and operations through cutting-edge technologies. The incumbent will lead the efforts to automate the routine production tasks, enhance predictive capabilities, reduce manual intervention and ensure integration of AI into existing operational workflows.

Job Responsibility:

  • Serve as a technology subject matter expert for internal and external stakeholders and provide direction for all firm mandated controls and compliance initiatives, all projects within the group and in creating a technology domain roadmap
  • ensure that all integration of functions meet business goals
  • define necessary system enhancements to deploy new products and process enhancements
  • recommend product customization for system integration
  • identify problem causality, business impact and root causes
  • exhibit knowledge of how own specialty area contributes to the business and apply knowledge of competitors, products and services
  • advise or mentor junior team members
  • impact the engineering function by influencing decisions through advice, counsel or facilitating services
  • drive and implement rigorous quality standards for all aspects of the automation delivery from initial concept to final implementation
  • continually evolve the working practices within and services provided by Production Management (regionally and globally) to improve efficiency and productivity
  • continuous forward compatibility and acquisition of competency around automation, Artificial Intelligence, Robotics Process Automation, predictive analytics, etc.
  • decision analytics and technology platforms to deliver immediate results and long-term business impact
  • develop predictive models that will form the basis of information-driven strategies executed with respect to services provided by Production Management

Requirements:

  • 10+ years of relevant experience in an Engineering role
  • experience working in Financial Services or a large complex and/or global environment
  • project management experience
  • J2EE/microservices development experience of running applications in cloud native environments (Google Cloud, AWS, API Gateway technologies)
  • strong proficiency in JavaScript, including experience with ReactJS and NodeJS
  • experience with MongoDB or other NoSQL databases
  • solid understanding of Python and experience with relevant libraries
  • experience with version control systems like Git
  • knowledge of CI/CD pipelines and DevOps practices is a plus
  • consistently demonstrates clear and concise written and verbal communication
  • comprehensive knowledge of design metrics, analytics tools, benchmarking activities and related reporting to identify best practices
  • demonstrated analytic/diagnostic skills
  • ability to work in a matrix environment and partner with virtual teams
  • ability to work independently, multi-task, and take ownership of various parts of a project or initiative
  • ability to work under pressure and manage to tight deadlines or unexpected changes in expectations or requirements
  • proven track record of operational process change and improvement

Nice to have:

  • knowledge of CI/CD pipelines and DevOps practices
  • project management experience
What we offer:
  • Equal opportunity employer
  • consideration without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law

Additional Information:

Job Posted:
May 03, 2025

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for AIOps Automation Engineering Lead

New

Principal AIOps Engineer

We’re building a world of health around every individual — shaping a more connec...
Location
Location
United States
Salary
Salary:
144200.00 - 288400.00 USD / Year
https://www.cvshealth.com/ Logo
CVS Health
Expiration Date
July 01, 2026
Flip Icon
Requirements
Requirements
  • 10+ years of experience in SRE, production operations supporting highly available services along with experience with Product model
  • Proven technical leadership: ability to set direction, lead cross-team initiatives, and advise stakeholders through architecture reviews, tradeoffs, and operational readiness
  • Strong programming/scripting skills (Python preferred) and experience building automation, integrations, and APIs
  • Experience integrating observability platforms and event sources across hybrid environments (cloud/on-prem) and operating production-grade monitoring/event management at scale
  • Strong ServiceNow experience as an ITSM system of record (Incident/Problem/Change
  • CMDB/asset concepts). Ability to build and operate integrations at scale (REST, webhooks, event management) to support automation and auditability
  • Python (preferred) for automation and data/ML pipelines
  • experience building integrations, services, and operational tooling
  • Workflow orchestration and integrations (ServiceNow APIs, event pipelines, runbook automation) with strong reliability, security, and auditability practices
  • Observability: Prometheus/Grafana, OpenTelemetry, ELK/Splunk/Datadog (or equivalent)
Job Responsibility
Job Responsibility
  • Lead the AIOps strategy, roadmap, and operating model (intake, triage, automation lifecycle, KPIs) to measurably improve MTTR, alert quality, and operational efficiency
  • Own the observability-to-AIOps pipeline (metrics, logs, traces, events) and drive standardization of telemetry, service health models, and actionable alerting across teams and platforms
  • Design and implement event intelligence: correlation, deduplication, suppression, anomaly detection, incident clustering, and probable-cause analysis using topology/CMDB context
  • Advise operations, service owners, and leadership stakeholders
  • lead change enablement, adoption, and value measurement for AIOps and agentic automation across the organization
  • Develop ServiceNow-centric AIOps integrations (ITSM + ITOM/Event Management where applicable): event ingestion, alert-to-incident policies, enrichment, assignment/routing, approvals, change workflows, and closure updates for auditable closed-loop ops
  • Establish governance for operational AI (risk controls, approvals, auditability, data access, prompt/response logging, evaluation, and continuous improvement) in partnership with security, compliance, and operations
  • Build and operationalize agentic AI workflows for incident triage and resolution: signal summarization, similar-incident retrieval, knowledge article drafting, ticket updates, stakeholder communications, and human-in-the-loop remediation
  • Enable closed-loop automation and self-healing by connecting AIOps detections to orchestrated actions (runbooks/workflows), with clear approvals, safety checks, and rollback paths
  • Partner with NOC/SOC, infrastructure, and application owners to onboard services into AIOps, define service models, and improve signal quality, escalation paths, and operational readiness
What we offer
What we offer
  • Medical, dental, and vision coverage
  • Paid time off
  • Retirement savings options
  • Wellness programs
  • Bonus, commission or short-term incentive program
  • Equity award program
  • Fulltime
Read More
Arrow Right

Managing Vice President - Infrastructure Platforms & Operations

The Managing Vice President, Infrastructure Platforms & Operations is a senior t...
Location
Location
United States , Bethesda
Salary
Salary:
215700.00 - 389700.00 USD / Year
https://www.marriott.com Logo
Marriott Bonvoy
Expiration Date
May 20, 2026
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Information Systems, Engineering, Business Administration, or related technical field
  • 15+ years of senior leadership experience across cloud engineering, infrastructure platforms, network services, and/or enterprise workplace technologies, preferably in a large global Fortune 500 organization
  • 10+ years of prior hands-on technical engineering or development experience (cloud, infrastructure, networking, automation, or enterprise platforms)
  • Demonstrated success leading large, multi-disciplinary global engineering and operations organizations
  • Deep expertise in multi-cloud platforms, network architecture, DevSecOps, automation, and reliability engineering
  • Strong experience partnering with cybersecurity teams to deliver secure by design platforms
  • Proven ability to influence senior executives and lead transformation in complex, matrixed enterprises
  • Strong financial acumen with experience managing large technology budgets and vendor portfolios
Job Responsibility
Job Responsibility
  • Lead global teams responsible for cloud foundations, DevOps and CI/CD platforms, automation, container platforms, service mesh, and self-service engineering capabilities
  • Oversee enterprise cloud landing zones across all regions, ensuring secure, scalable, and cost-efficient architecture
  • Drive modernization of hybrid platforms, including datacenter, edge compute, and infrastructure engineering capabilities
  • Oversee SRE, observability, resiliency, and disaster recovery governance
  • Lead global network architecture and operations across datacenter networks, property connectivity, enterprise networks, and cloud network integration
  • Drive transformation of Marriott's global connectivity ecosystem, including SD WAN, wireless, secure network edge, voice, and network automation
  • Ensure network performance, reliability, compliance, and resiliency at global scale
  • Lead workplace technology platforms supporting collaboration, productivity, endpoint, and digital employee experience solutions
  • Partner with business, HR, and IT leaders to deliver intuitive, reliable, and secure workplace tools that enable associate productivity
  • Drive standardization, modernization, and lifecycle management of workplace platforms and services
What we offer
What we offer
  • 401(k) plan
  • stock purchase plan
  • discounts at Marriott properties
  • commuter benefits
  • employee assistance plan
  • childcare discounts
  • medical
  • dental
  • vision
  • health care flexible spending account
  • Fulltime
!
Read More
Arrow Right

Executive Director, Digital Engineering- Aetna Member Services

We’re building a world of health around every individual — shaping a more connec...
Location
Location
United States , Hartford
Salary
Salary:
175100.00 - 334750.00 USD / Year
https://www.cvshealth.com/ Logo
CVS Health
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years of software engineering experience with deep expertise in backend systems, distributed services, and API platforms
  • Proven experience leading large engineering organizations delivering mission‑critical services
  • Strong background in AWS cloud platform, microservices architecture, CI/CD pipelines, and DevOps/SRE practices
  • Demonstrated success driving stability, resiliency, and observability improvements at scale
  • Experience leveraging AI, ML, or LLM-based engineering and operational tooling
  • Bachelor’s degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience
Job Responsibility
Job Responsibility
  • Lead the design, development, and delivery of scalable backend systems, APIs, and microservices powering member-facing capabilities
  • Define API contract standards, and integration patterns used across Member Services platforms
  • Drive service modernization by adopting cloud‑native architectures, containerization, service mesh, and event-driven patterns
  • Establish standards for availability, resiliency, performance, and disaster recovery across all services
  • Implement SLO/SLI/error budget frameworks, health checks, and high‑availability architectures
  • Institutionalize strong observability practices using metrics, logs, traces, and distributed monitoring
  • Drive continuous reliability improvements through chaos engineering, automated fault injection, and proactive root‑cause analysis
  • Integrate AI and LLM-based tooling into software development, QA, and operational processes (e.g., test automation, code generation, anomaly detection, intelligent incident triage)
  • Promote AIOps capabilities to reduce manual toil and amplify engineering productivity
  • Introduce AI-enhanced workflows across Member Services to improve personalization, routing, and intelligent decisioning
What we offer
What we offer
  • medical
  • dental
  • vision coverage
  • paid time off
  • retirement savings options
  • wellness programs
  • bonus
  • commission or short-term incentive program
  • equity award program
  • Fulltime
Read More
Arrow Right
New

Staff Software Development Engineer-Automation Engineer

We’re building a world of health around every individual — shaping a more connec...
Location
Location
United States
Salary
Salary:
106605.00 USD / Year
https://www.cvshealth.com/ Logo
CVS Health
Expiration Date
June 29, 2026
Flip Icon
Requirements
Requirements
  • Extensive experience in software development and production support for enterprise systems
  • Strong expertise in automation/RPA platforms, scripting, and debugging complex workflows
  • Proven ability to lead incident response and root cause analysis in high-availability environments
  • Deep understanding of SDLC, CI/CD, release management, and production readiness standards
  • Bachelor's degree in Computer Science, Engineering, or equivalent practical experience
Job Responsibility
Job Responsibility
  • Serve as the technical owner for production support of automation and RPA solutions across critical business processes
  • Lead incident triage, root cause analysis, and permanent remediation for high-severity automation failures
  • Establish and enforce runbooks, support models, escalation paths, and on-call readiness for automation platforms
  • Proactively identify systemic issues and implement stability, resiliency, and performance improvements
  • Provide hands-on technical leadership for automation design, debugging, and optimization in production environments
  • Review automation code and configurations to ensure adherence to standards, security, and reliability best practices
  • Partner with development teams to ensure production readiness of new automations before release
  • Guide architectural decisions that reduce operational complexity and technical debt
  • Design and maintain monitoring, alerting, and health dashboards for automation platforms
  • Drive adoption of AIOps, SRE, and automation-first support practices where applicable
What we offer
What we offer
  • Medical, dental, and vision coverage
  • Paid time off
  • Retirement savings options
  • Wellness programs
  • Fulltime
Read More
Arrow Right

Executive Director, Digital Engineering- Aetna Member Services

The Executive Director, Digital Engineering- Aetna Member Services is a senior t...
Location
Location
United States , Work at Home
Salary
Salary:
175100.00 - 334750.00 USD / Year
https://www.cvshealth.com/ Logo
CVS Health
Expiration Date
May 31, 2026
Flip Icon
Requirements
Requirements
  • 15+ years of software engineering experience with deep expertise in backend systems, distributed services, and API platforms
  • Proven experience leading large engineering organizations delivering mission‑critical services
  • Strong background in AWS cloud platform, microservices architecture, CI/CD pipelines, and DevOps/SRE practices
  • Demonstrated success driving stability, resiliency, and observability improvements at scale
  • Experience leveraging AI, ML, or LLM-based engineering and operational tooling
  • Bachelor’s degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience
Job Responsibility
Job Responsibility
  • Lead the design, development, and delivery of scalable backend systems, APIs, and microservices powering member-facing capabilities
  • Define API contract standards, and integration patterns used across Member Services platforms
  • Drive service modernization by adopting cloud‑native architectures, containerization, service mesh, and event-driven patterns
  • Establish standards for availability, resiliency, performance, and disaster recovery across all services
  • Implement SLO/SLI/error budget frameworks, health checks, and high‑availability architectures
  • Institutionalize strong observability practices using metrics, logs, traces, and distributed monitoring
  • Drive continuous reliability improvements through chaos engineering, automated fault injection, and proactive root‑cause analysis
  • Integrate AI and LLM-based tooling into software development, QA, and operational processes
  • Promote AIOps capabilities to reduce manual toil and amplify engineering productivity
  • Introduce AI-enhanced workflows across Member Services to improve personalization, routing, and intelligent decisioning
What we offer
What we offer
  • Affordable medical plan options
  • 401(k) plan (including matching company contributions)
  • Employee stock purchase plan
  • No-cost programs for all colleagues including wellness screenings, tobacco cessation and weight management programs, confidential counseling and financial coaching
  • Paid time off
  • Flexible work schedules
  • Family leave
  • Dependent care resources
  • Colleague assistance programs
  • Tuition assistance
  • Fulltime
Read More
Arrow Right

AI Operations Tech Leader

We are looking for an experienced Al Ops Tech Leader — Operations Support to lea...
Location
Location
Salary
Salary:
Not provided
lingarogroup.com Logo
Lingaro
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years in data engineering, Al/ML engineering, or operations support technology roles
  • 4—6+ years in technical leadership positions within operations support / IT operations / service operations environments
  • Proven track record delivering production Al/ML/data solutions that measurably improved operations support KPIs
  • Strong hands-on expertise with modern data/AI stacks (Python, Spark, Kafka, Airflow, cloud data platforms, PyTorch/TensorFlow, LLM frameworks) and integration into operations support ecosystems
  • Deep practical experience with AIOps patterns in live operations support settings: event correlation, anomaly detection, automated actions, predictive analytics, GenAI for ops
  • Experience leading development or significant enhancement of AIOps/internal tooling platforms specifically for operations support teams
  • Ability to stay deeply technical while leading people and strategy in a high-velocity operations support context
  • Excellent communication — can explain complex Al concepts to operations support practitioners and translate operational pain into technical roadmaps for executives
  • Strong bias for action, production impact, and reducing operational toil through intelligent automation
Job Responsibility
Job Responsibility
  • Actively lead and contribute to high-impact data/AI projects that directly improve operations support outcomes
  • Design and deliver scalable features embedded into operations support workflows and platforms
  • Ensure solutions meet strict operations support SLAs for reliability, low latency, auditability, explainability, and zero-downtime deployment
  • Up-to-date with innovations and research in AIOPS Tools
  • Lead the architecture, development, and continuous enhancement of internal AIOps platforms and reusable components that power operations support teams
  • Serve as the lead Al technical authority and trusted advisor for all operations support programs, automation movements, and Al transformation efforts
  • Lead technical discussions, architecture reviews, PoCs, vendor evaluations, and solution selection
  • Identify, prioritize, and drive the highest-ROI Al use cases in operations support
  • Build, mentor, and lead a high-performing squad of AIOps specialists focused on operations support outcomes
  • Foster a culture of rapid experimentation, production-first mindset, and relentless focus on operational impact
  • Fulltime
Read More
Arrow Right
New

Apps Dev Tech Sr Lead Analyst

We are looking for an experienced Apps Development Group Manager to lead enginee...
Location
Location
United States , Jersey City
Salary
Salary:
176720.00 - 265080.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
May 21, 2026
Flip Icon
Requirements
Requirements
  • Kotlin - Primary platform language — strong hands-on proficiency
  • Python - Hands-on expertise in data pipelines, AI/ML integration, scripting, and automation
  • Java - Extensive hands-on experience in high-throughput, production-grade Java engineering
  • JVM performance tuning
  • Microservices Architecture - Hands-on design of microservices ecosystems
  • Event-Driven & Messaging Systems - Deep hands-on expertise in Kafka or Solace
  • Low-Latency & High-Performance Computing - Hands-on profiling and optimization
  • High Availability & Fault Tolerance - Hands-on design of resilience patterns
  • Databases - Hands-on expertise in Oracle (SQL) and MongoDB (NoSQL)
  • AI & ML Integration - Hands-on experience designing and integrating AI/ML models
Job Responsibility
Job Responsibility
  • Actively participate in system design, architecture reviews, and code reviews across TMZ platform teams
  • Contribute to the design of distributed, fault-tolerant, real-time systems for high-volume, low-latency equity trade processing
  • Write, review, and refactor production-grade code in Kotlin, Java, and Python
  • Lead design of event-driven, microservices-based architectures using Kafka or Solace
  • Drive low-latency and high-performance system design
  • Design and govern data architecture across Oracle (SQL) and MongoDB (NoSQL)
  • Champion trunk-based development, feature flags, and progressive delivery
  • Produce and review architecture decision records (ADRs) and technical design documents
  • Contribute hands-on to AI/ML integration on the TMZ platform
  • Lead the implementation of AI-powered platform capabilities
What we offer
What we offer
  • medical, dental & vision coverage
  • 401(k)
  • life, accident, and disability insurance
  • wellness programs
  • planned time off (vacation)
  • unplanned time off (sick leave)
  • paid holidays
  • Fulltime
!
Read More
Arrow Right
New

Apps Development Senior Manager

Location
Location
United States , Jersey City
Salary
Salary:
142320.00 - 213480.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
May 21, 2026
Flip Icon
Requirements
Requirements
  • Kotlin Primary language for platform services
  • strong hands-on proficiency required
  • Python Used for data pipelines, AI/ML integration, scripting, and automation
  • Java Core backend development
  • deep expertise in production-grade Java applications
  • Microservices Architecture Design and delivery of loosely coupled, independently deployable services at scale
  • Event-Driven & Messaging Systems Hands-on experience with Kafka or Solace for real-time, high-throughput event streaming and messaging
  • Low-Latency & High-Performance Computing Proven experience optimizing systems for sub-millisecond to millisecond response times in high-volume financial environments
  • High Availability & Fault Tolerance Design patterns for resilient systems — circuit breakers, bulkheads, failover, and graceful degradation
  • Databases Strong proficiency in Oracle (SQL) for transactional data and MongoDB (NoSQL) for flexible, high-throughput data models
Job Responsibility
Job Responsibility
  • Architect, design, develop, and maintain robust, scalable, and high-performance applications supporting equity trade settlement workflows on the Trade Manager Zone platform.
  • Lead the design of distributed, fault-tolerant, real-time systems capable of handling high-volume, low-latency trade processing across global markets.
  • Champion the use of AI-assisted coding tools (e.g., GitHub Copilot or equivalent GenAI tools) to accelerate developer productivity, reduce toil, and improve code quality.
  • Drive adoption of trunk-based development practices to enable continuous integration and rapid, safe delivery.
  • Ensure code is clean, maintainable, and testable — adhering to SOLID principles, design patterns, and platform engineering standards.
  • Actively contribute to hands-on coding, code reviews, and refactoring to maintain high engineering standards across the team.
  • Own the technical design of key platform components, producing clear architecture documentation and decision records.
  • Champion Test-Driven Development (TDD), Behavior-Driven Development (BDD), and high unit test coverage as non-negotiable engineering standards.
  • Introduce AI-powered code review tooling to complement human reviews — catching security vulnerabilities, anti-patterns, and performance issues at scale.
  • Apply predictive quality analytics to identify high-risk code changes before they reach production.
What we offer
What we offer
  • medical, dental & vision coverage
  • 401(k)
  • life, accident, and disability insurance
  • wellness programs
  • paid time off packages, including planned time off (vacation), unplanned time off (sick leave), and paid holidays
  • Fulltime
!
Read More
Arrow Right