CrawlJobs Logo

AI and DevOps Platform Support Engineer

United Kingdom, London · Job Posted March 25, 2026
Apply Position
Job Link Share

Job Description

Engineer the future of global finance. At Citi, our Tech team doesn’t just support finance – we are helping to redefine it. Every day, $5 trillion crosses through our network. We do business in 180+ countries operating at a scale few can match. From deploying advanced AI to helping shape global markets, we build systems that matter. Look to join a team where your work helps influence economies, your ideas can drive innovation and outcomes, and your growth is backed by mentorship, continuous learning and flexibility with potential hybrid work opportunities. Help solve real-world challenges that touch millions and get the opportunity to build the future of finance with Citi Tech. We are seeking a motivated individual contributor to work in our AI and DevOps Platform Support team in EMEA. This role is responsible for ensuring the stability, reliability, and performance of our critical AI and DevOps platforms. The team supports a wide range of services, including multiple AI applications, developer tools, and CI/CD pipeline technologies used by teams across the organization. The ideal candidate will manage incident and problem resolution and collaborate with engineering and development teams to improve platform services and supportability. Involved in short- to medium-term planning of actions and resources for own area.

Job Responsibility

  • Ensuring the stability, reliability, and performance of our critical AI and DevOps platforms
  • Manage incident and problem resolution and collaborate with engineering and development teams to improve platform services and supportability
  • Vendor relationship management including oversight for all offshore managed service
  • Improve the service level the team provides to our end users, which includes maximizing operational efficiencies, strengthening incident management, problem management and knowledge sharing practices
  • Guide development teams on application stability and supportability improvements
  • Formulate and implement a framework for managing capacity, throughput and latency
  • Define and implemented application on-boarding guidelines and standards
  • Work with various team members on coaching them on how to maximize their potential
  • Drives continued cost reductions and efficiencies across the portfolios supported
  • Evaluates subordinates' performance and makes decisions on pay increases, hiring, terminations and other personnel actions
  • Participates in business review meetings, relating technology tools strategies to business requirements
  • Assures adherence to all support process and tool standards
  • Act as the primary point of contact for platform matters, defining the vision and roadmap
  • Champion the platform's resilience strategy by planning and executing wargaming scenarios, chaos engineering tests, and disaster recovery drills
  • Drive a comprehensive automation strategy to reduce manual toil, improve deployment velocity, and identify opportunities to leverage AI for operational intelligence
  • Provides in-depth analysis with interpretive thinking to define problems and develop innovative solutions
  • Solves the highest impact, highest profile problems with significant impact
  • Develop and implement AI-powered solutions to automate routine support tasks, predict system failures, and optimize resource utilization

Requirements

  • Project management with demonstrable results in improving IT services
  • Capacity Planning/Forecasting exposure a plus
  • Ability to plan and organize workload
  • Consistently demonstrates clear and concise written and verbal communication skills
  • Excellent analytical and problem-solving skills, with the ability to thrive in a fast-paced support role
  • Strong communication skills and the ability to explain complex technical concepts to diverse audiences
  • A strong track record of developing and executing a strategic roadmap for a technical platform, balancing new features with a dedicated 'book of work' for stability
  • Demonstrable experience leading resilience initiatives such as wargaming, disaster recovery planning, and incident response simulation
  • Demonstrated experience in designing and implementing disaster recovery (DR) plans and conducting resilience tests (e.g., wargaming, failure simulation)
  • A creative and proactive mindset with a demonstrated ability to identify opportunities for process improvement and automation using AI/ML techniques
  • Bachelor’s/University degree, Master’s degree preferred

Nice to have

Capacity Planning/Forecasting exposure a plus

What we offer

  • 27 days annual leave (plus bank holidays)
  • A discretional annual performance related bonus
  • Private Medical Care & Life Insurance
  • Employee Assistance Program
  • Pension Plan
  • Paid Parental Leave
  • Special discounts for employees, family, and friends
  • Access to an array of learning and development resources

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

AI and DevOps Platform Support Engineer

8 matching positions

AI and DevOps Platform Support Lead

Engineer the future of global finance. At Citi, our Tech team doesn't just suppo...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of relevant experience in a hands-on technical or support leadership role
  • Experience contributing to architecture discussions and ensuring solutions align with enterprise standards and long-term maintainability
  • Experience working with senior stakeholders or technology partners
  • Demonstrated experience supporting IT service improvements or platform stability initiatives
  • Strong communication and presentation skills, with the ability to convey technical concepts clearly
  • Experience supporting or contributing to technical roadmaps or operational workstreams
  • Experience participating in resilience-related activities such as incident simulations, disaster recovery exercises, or stability testing
  • Ability to collaborate with cross-functional support teams and technology groups
  • Strong organizational and workload-planning skills
  • Consistently demonstrates clear and concise written and verbal communication skills
Job Responsibility
Job Responsibility
  • Demonstrates a strong understanding of how application support contributes to the overall technology function and organizational objectives
  • Assist with vendor relationship management, including coordination with offshore managed services
  • Support efforts to improve service levels for end users by enhancing operational efficiencies and strengthening incident management, problem management, and knowledge-sharing practices
  • Partner with development teams to guide improvements in application stability and supportability
  • Contribute to frameworks for managing capacity, throughput, and latency
  • Assist in defining and implementing application onboarding guidelines and standards
  • Support team members by fostering a collaborative environment and encouraging skill development
  • Participate in cost-reduction efforts through Root Cause Analysis reviews, knowledge management, performance tuning, and user training
  • Participate in business review meetings to help align technology tools and strategies with business requirements
  • Ensure adherence to support processes and tool standards, and assist in enhancing processes to promote consistency and quality across the support program
  • Fulltime
Read More
Arrow Right

Ai And Devops Platform Support Lead

Engineer the future of global finance. At Citi, our Tech team doesn't just suppo...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of relevant experience in a hands-on technical or support leadership role
  • Experience contributing to architecture discussions and ensuring solutions align with enterprise standards and long-term maintainability
  • Experience working with senior stakeholders or technology partners
  • Demonstrated experience supporting IT service improvements or platform stability initiatives
  • Strong communication and presentation skills, with the ability to convey technical concepts clearly
  • Experience supporting or contributing to technical roadmaps or operational workstreams
  • Experience participating in resilience-related activities such as incident simulations, disaster recovery exercises, or stability testing
  • Ability to collaborate with cross-functional support teams and technology groups
  • Strong organizational and workload-planning skills
  • Consistently demonstrates clear and concise written and verbal communication skills
Job Responsibility
Job Responsibility
  • Demonstrates a strong understanding of how application support contributes to the overall technology function and organizational objectives
  • Assist with vendor relationship management, including coordination with offshore managed services
  • Support efforts to improve service levels for end users by enhancing operational efficiencies and strengthening incident management, problem management, and knowledge-sharing practices
  • Partner with development teams to guide improvements in application stability and supportability
  • Contribute to frameworks for managing capacity, throughput, and latency
  • Assist in defining and implementing application onboarding guidelines and standards
  • Support team members by fostering a collaborative environment and encouraging skill development
  • Participate in cost-reduction efforts through Root Cause Analysis reviews, knowledge management, performance tuning, and user training
  • Participate in business review meetings to help align technology tools and strategies with business requirements
  • Ensure adherence to support processes and tool standards, and assist in enhancing processes to promote consistency and quality across the support program
  • Fulltime
Read More
Arrow Right

AI and Devops Platform Support Lead

Engineer the future of global finance. At Citi, our Tech team doesn’t just suppo...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5–7 years of relevant experience in a hands‑on technical or support leadership role
  • Experience contributing to architecture discussions and ensuring solutions align with enterprise standards and long‑term maintainability
  • Experience working with senior stakeholders or technology partners
  • Demonstrated experience supporting IT service improvements or platform stability initiatives
  • Strong communication and presentation skills
  • Experience supporting or contributing to technical roadmaps or operational workstreams
  • Experience participating in resilience‑related activities such as incident simulations, disaster recovery exercises, or stability testing
  • Ability to collaborate with cross‑functional support teams and technology groups
  • Strong organizational and workload‑planning skills
  • Consistently demonstrates clear and concise written and verbal communication skills
Job Responsibility
Job Responsibility
  • Contribute to the stability, reliability, and performance of critical AI and DevOps platforms
  • Support a wide range of services, including multiple AI applications, developer tools, and CI/CD pipeline technologies
  • Help lead a team of SRE and Support engineers
  • Facilitate incident and problem resolution
  • Collaborate with engineering and development teams to enhance platform services and supportability
  • Short‑term planning and coordination of actions and resources within the team
  • Assist with vendor relationship management, including coordination with offshore managed services
  • Support efforts to improve service levels for end users
  • Partner with development teams to guide improvements in application stability and supportability
  • Contribute to frameworks for managing capacity, throughput, and latency
  • Fulltime
Read More
Arrow Right

AI and DevOps Platform Support Manager

AI and DevOps Platform Support Manager is accountable for management of complex/...
Location
Location
Canada , Mississauga
Salary
Salary:
145100.00 - 217700.00 USD / Year
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years relevant experience
  • Relevant experience in a technical leadership or management role with demonstrated success in building and scaling a high-performing support team
  • Experience of senior stakeholder management
  • Project management with demonstrable results in improving IT services
  • Exceptional communication and presentation skills, with the ability to articulate a technical vision and report on key metrics to senior leadership
  • A strong track record of developing and executing a strategic roadmap for a technical platform, balancing new features with a dedicated 'book of work' for stability
  • Demonstrable experience leading resilience initiatives such as wargaming, disaster recovery planning, and incident response simulation
  • Effectively share information with other support team members and with other technology teams
  • Ability to plan and organize workload
  • Consistently demonstrates clear and concise written and verbal communication skills
Job Responsibility
Job Responsibility
  • Demonstrates an in-depth understanding of how apps support integrates within the overall technology function to achieve objectives
  • requires a good understanding of the industry
  • Vendor relationship management including oversight for all offshore managed service
  • Improve the service level the team provides to our end users, which includes maximizing operational efficiencies, strengthening incident management, problem management and knowledge sharing practices
  • Guide development teams on application stability and supportability improvements
  • Formulate and implement a framework for managing capacity, throughput and latency
  • Define and implemented application on-boarding guidelines and standards
  • Work with various team members on coaching them on how to maximize their potential, work better in a highly integrated team environment and focus on bringing out their strengths
  • Drives continued cost reductions and efficiencies across the portfolios supported by means of Root Cause Analysis reviews, Knowledge management, Performance tuning, and user training
  • Evaluates subordinates' performance and makes decisions on pay increases, hiring, terminations and other personnel actions
  • Fulltime
Read More
Arrow Right

AI and DevOps Platform Support Manager

Engineer the future of global finance. At Citi, our Tech team doesn’t just suppo...
Location
Location
United Kingdom , Belfast
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Relevant experience in a technical leadership or management role with demonstrated success in building and scaling a high-performing support team
  • Experience of senior stakeholder management
  • Project management with demonstrable results in improving IT services
  • Exceptional communication and presentation skills, with the ability to articulate a technical vision and report on key metrics to senior leadership
  • A strong track record of developing and executing a strategic roadmap for a technical platform, balancing new features with a dedicated 'book of work' for stability
  • Demonstrable experience leading resilience initiatives such as wargaming, disaster recovery planning, and incident response simulation
  • Effectively share information with other support team members and with other technology teams
  • Ability to plan and organize workload
  • Consistently demonstrates clear and concise written and verbal communication skills
  • Ability to communicate appropriately to relevant stakeholders
Job Responsibility
Job Responsibility
  • Demonstrates an in-depth understanding of how apps support integrates within the overall technology function to achieve objectives
  • requires a good understanding of the industry
  • Vendor relationship management including oversight for all offshore managed service
  • Improve the service level the team provides to our end users, which includes maximizing operational efficiencies, strengthening incident management, problem management and knowledge sharing practices
  • Guide development teams on application stability and supportability improvements
  • Formulate and implement a framework for managing capacity, throughput and latency
  • Define and implemented application on-boarding guidelines and standards
  • Work with various team members on coaching them on how to maximize their potential, work better in a highly integrated team environment and focus on bringing out their strengths
  • Drives continued cost reductions and efficiencies across the portfolios supported by means of Root Cause Analysis reviews, Knowledge management, Performance tuning, and user training
  • Evaluates subordinates' performance and makes decisions on pay increases, hiring, terminations and other personnel actions
What we offer
What we offer
  • 27 days annual leave (plus bank holidays)
  • A discretional annual performance related bonus
  • Private Medical Care & Life Insurance
  • Employee Assistance Program
  • Pension Plan
  • Paid Parental Leave
  • Special discounts for employees, family, and friends
  • Access to an array of learning and development resources
  • Fulltime
Read More
Arrow Right

AI DevOps / Platform Engineer

Hermeus is a high-speed aircraft manufacturer focused on the rapid design, build...
Location
Location
United States , Atlanta, GA / Los Angeles, CA
Salary
Salary:
105000.00 - 225000.00 USD / Year
hermeus.com Logo
Hermeus
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Software Engineering, or a related field
  • 5+ years of professional experience in DevOps, platform engineering, or MLOps
  • Proficiency with CI/CD tools (GitHub Actions, GitLab, Jenkins, etc.)
  • Hands-on experience with AWS or equivalent cloud platforms
  • Strong scripting and automation skills (Python, Bash)
  • Experience with containers and orchestration (Docker, Kubernetes)
  • Demonstrated ability to integrate AI/ML or intelligent automation into engineering workflows
Job Responsibility
Job Responsibility
  • Design, implement, and maintain CI/CD pipelines for aerospace software and simulation environments
  • Integrate AI-driven tools into engineering workflows (e.g., intelligent code assistance, anomaly detection, automated testing frameworks)
  • Build and manage scalable cloud infrastructure (AWS or equivalent) to support R&D, simulation, and production environments
  • Develop monitoring, logging, and alerting systems enhanced by AI for predictive insights
  • Collaborate with aerospace, flight software, and test engineers to ensure seamless integration between software, hardware, and operations
  • Write efficient, maintainable, and well-documented infrastructure code (Python, Bash, IaC)
  • Conduct system reviews and implement best practices for secure and reliable operations
  • Stay up-to-date with emerging AI/ML and DevOps technologies, applying them to improve speed, safety, and scalability
  • Fulltime
Read More
Arrow Right

Senior Software Engineer, AI Platform and Enablement

We're building a next-generation AI-powered platform and web application for cre...
Location
Location
United States , San Francisco
Salary
Salary:
180000.00 - 286000.00 USD / Year
descript.com Logo
Descript
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience in deploying and managing AI models in production
  • Experience with the tools of large volume data pipelines like spark, flume, dask, etc.
  • Familiarity with cloud platforms (AWS, Google Cloud, Azure) and container technologies (Docker, Kubernetes)
  • Knowledge of DevOps and MLOps best practices
  • Strong problem-solving abilities and excellent communication skills
Job Responsibility
Job Responsibility
  • Build, maintain, and standardize third-party model integrations, including consulting for other engineering teams with AI model integration needs
  • Design, implement, and maintain our AI infrastructure supporting our machine learning life cycle, including data ingestion pipelines, training developer experience and infrastructure, evaluation frameworks, and deployments / GPU infrastructure
  • Collaborate with Product Managers, Research Engineers, and AI Researchers to understand their infrastructure needs and ensure our AI systems are robust, scalable, and efficient
  • Optimize and scale our models and algorithms for efficient inference
  • Deploy, monitor, and manage AI models in production
What we offer
What we offer
  • Generous healthcare package
  • 401k matching program
  • Catered lunches
  • Flexible vacation time
  • Fulltime
Read More
Arrow Right

AI and Devops Technical Support Analyst

Engineer the future of global finance. At Citi, our Tech team doesn’t just suppo...
Location
Location
Poland , Warsaw
Salary
Salary:
165020.00 - 280980.00 PLN / Year
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6-8 years of relevant experience in technical support, platform operations, or engineering
  • Demonstrated experience supporting IT services, platform operations, or infrastructure, with participation in resilience and stability-focused activities
  • Exposure to architecture concepts with the ability to contribute to technical discussions and understand design decisions
  • Strong communication skills and a proven ability to collaborate effectively with business partners, engineering teams, and other cross-functional stakeholders
  • Experience with Red Hat OpenShift/Kubernetes, CI/CD tools, and modern observability/monitoring tools (e.g., Prometheus, Grafana, Splunk)
  • Experience with databases (e.g., Postgres, MongoDB) and scripting or coding abilities in languages like Java, Python, or Go
  • Strong organizational skills with the ability to manage a daily workload and effectively prioritize tasks
  • A working knowledge of Generative AI concepts
Job Responsibility
Job Responsibility
  • Collaborative Application Support: Provides application support by partnering with vendors, offshore managed services, and internal development and engineering teams to ensure platform stability and operational readiness
  • Service Level Improvement: Actively supports efforts to improve service levels by participating in incident management, problem resolution (RCA), and knowledge-sharing initiatives
  • Platform Resilience and Testing: Assists in resilience-related activities, including incident simulations, disaster recovery exercises, and platform readiness testing to ensure business continuity
  • Automation and Efficiency: Supports automation efforts designed to reduce manual tasks, improve operational efficiency, and contribute to cost-saving initiatives
  • Monitoring and Observability: Helps maintain platform health by supporting observability practices, including monitoring, logging, tracing, and alerting for proactive issue identification
  • Platform Onboarding and Enhancement: Assists with application onboarding activities and contributes to platform enhancement initiatives in partnership with engineering and support leads
  • Data Collection for Planning: Helps collect and track capacity, performance, and latency data to support platform planning efforts and prepare materials for business review meetings
  • Troubleshooting and Incident Response: Maintains a practical understanding of platform components (like OpenShift, ECS, CI/CD) to effectively support troubleshooting and incident response activities
What we offer
What we offer
  • Employer paid Defined Contribution Pension Plan contribution of 6% of employee’s pensionable earnings (PPE Program)
  • Employer paid Private Medical Care Package for employees and Private Medical Care Packages for certain family members available at preferential rates
  • Employer paid Life Insurance Program for employees and Life Insurance for certain family members available at preferential rates
  • Employee Assistance Program financed by Employer
  • Paid Parental Leave Program (maternity and paternity leave
  • statutory and 2 weeks additional paid paternity leave)
  • Sport Card for employees subsidized via Social Benefits Fund and Sport Cards for certain family members available at preferential rates
  • Additional benefits from Company’s Social Benefit Fund, in particular: Holidays Allowance, support for sport and cultural activities, team building events
  • Additional day off for volunteering
  • Cafeteria/ flex benefit – a company benefits system which enables employees to select and purchase benefits offered by a provider and available for employees on the platform
  • Fulltime
Read More
Arrow Right