CrawlJobs Logo

AI and DevOps Platform Support Engineer

https://www.citi.com/ Logo

Citi

Location Icon

Location:
United Kingdom , London

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

Engineer the future of global finance. At Citi, our Tech team doesn’t just support finance – we are helping to redefine it. Every day, $5 trillion crosses through our network. We do business in 180+ countries operating at a scale few can match. From deploying advanced AI to helping shape global markets, we build systems that matter. Look to join a team where your work helps influence economies, your ideas can drive innovation and outcomes, and your growth is backed by mentorship, continuous learning and flexibility with potential hybrid work opportunities. Help solve real-world challenges that touch millions and get the opportunity to build the future of finance with Citi Tech. We are seeking a motivated individual contributor to work in our AI and DevOps Platform Support team in EMEA. This role is responsible for ensuring the stability, reliability, and performance of our critical AI and DevOps platforms. The team supports a wide range of services, including multiple AI applications, developer tools, and CI/CD pipeline technologies used by teams across the organization. The ideal candidate will manage incident and problem resolution and collaborate with engineering and development teams to improve platform services and supportability. Involved in short- to medium-term planning of actions and resources for own area.

Job Responsibility:

  • Ensuring the stability, reliability, and performance of our critical AI and DevOps platforms
  • Manage incident and problem resolution and collaborate with engineering and development teams to improve platform services and supportability
  • Vendor relationship management including oversight for all offshore managed service
  • Improve the service level the team provides to our end users, which includes maximizing operational efficiencies, strengthening incident management, problem management and knowledge sharing practices
  • Guide development teams on application stability and supportability improvements
  • Formulate and implement a framework for managing capacity, throughput and latency
  • Define and implemented application on-boarding guidelines and standards
  • Work with various team members on coaching them on how to maximize their potential
  • Drives continued cost reductions and efficiencies across the portfolios supported
  • Evaluates subordinates' performance and makes decisions on pay increases, hiring, terminations and other personnel actions
  • Participates in business review meetings, relating technology tools strategies to business requirements
  • Assures adherence to all support process and tool standards
  • Act as the primary point of contact for platform matters, defining the vision and roadmap
  • Champion the platform's resilience strategy by planning and executing wargaming scenarios, chaos engineering tests, and disaster recovery drills
  • Drive a comprehensive automation strategy to reduce manual toil, improve deployment velocity, and identify opportunities to leverage AI for operational intelligence
  • Provides in-depth analysis with interpretive thinking to define problems and develop innovative solutions
  • Solves the highest impact, highest profile problems with significant impact
  • Develop and implement AI-powered solutions to automate routine support tasks, predict system failures, and optimize resource utilization

Requirements:

  • Project management with demonstrable results in improving IT services
  • Capacity Planning/Forecasting exposure a plus
  • Ability to plan and organize workload
  • Consistently demonstrates clear and concise written and verbal communication skills
  • Excellent analytical and problem-solving skills, with the ability to thrive in a fast-paced support role
  • Strong communication skills and the ability to explain complex technical concepts to diverse audiences
  • A strong track record of developing and executing a strategic roadmap for a technical platform, balancing new features with a dedicated 'book of work' for stability
  • Demonstrable experience leading resilience initiatives such as wargaming, disaster recovery planning, and incident response simulation
  • Demonstrated experience in designing and implementing disaster recovery (DR) plans and conducting resilience tests (e.g., wargaming, failure simulation)
  • A creative and proactive mindset with a demonstrated ability to identify opportunities for process improvement and automation using AI/ML techniques
  • Bachelor’s/University degree, Master’s degree preferred

Nice to have:

Capacity Planning/Forecasting exposure a plus

What we offer:
  • 27 days annual leave (plus bank holidays)
  • A discretional annual performance related bonus
  • Private Medical Care & Life Insurance
  • Employee Assistance Program
  • Pension Plan
  • Paid Parental Leave
  • Special discounts for employees, family, and friends
  • Access to an array of learning and development resources

Additional Information:

Job Posted:
March 25, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for AI and DevOps Platform Support Engineer

Senior Director of Platform Engineering

Lead the Future of Platform Engineering at Modus Create. As Senior Director of P...
Location
Location
United States of America
Salary
Salary:
Not provided
moduscreate.com Logo
Modus Create
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 15+ years in Platform Engineering/DevOps
  • 7+ years in senior engineering leadership
  • ideally in consulting or high-growth tech environments
  • a clear point of view on modern architecture, engineering best-practices, and agile delivery
  • proven experience scaling distributed global teams and platform engineering operations
  • strong pre-sales and delivery experience
  • able to shape winning proposals and roadmaps
  • a customer-first mindset and passion for solving complex problems with elegant, scalable solutions
  • excellent communication and collaboration skills in cross-functional and cross-cultural environments
  • a history of growing leaders and fostering high-trust, high-performance teams
Job Responsibility
Job Responsibility
  • Lead and scale a high-performing, distributed platform engineering team through strong mentorship and inclusive leadership
  • define what great looks like—through reusable runbooks, technical standards, and nurturing a culture grounded in quality, belonging, and continuous learning
  • help clients modernize platforms, launch new infrastructure, and make better innovation investment decisions
  • ensure every solution is aligned with client goals and drives measurable value
  • own and evolve our delivery frameworks, platform engineering standards, and team operations
  • champion cloud-native development, DevOps and SRE best practices, and scalable architecture
  • partner with Sales, Partnerships, and Client Executives to shape and win new opportunities
  • translate client needs into technical solutions, delivery plans, and estimates
  • lead development of proposals, estimation, and pre-sales architecture discussions
  • develop reusable solution assets, infrastructure templates and case studies for future engagements
What we offer
What we offer
  • Remote work with flexible working hours
  • Modus Global Office Programme: on-demand access to private offices, meeting rooms, coworking spaces and business lounges in locations in over 120 countries
  • Employee Referral Program
  • Client Referral Program
  • Travel according to client or team needs
  • The chance to work side-by-side with thought leaders in emerging tech
  • Access to more than 12,000 courses with a licensed Coursera account
  • Possibility to obtain paid certification/courses if they align with company goals and are relevant to the employee's role
  • Fulltime
Read More
Arrow Right

DevOps Quality Engineer

Engineer reliability at scale. We’re looking for a DevOps Quality Engineer who t...
Location
Location
Bulgaria , Sofia
Salary
Salary:
Not provided
ebrd.com Logo
European Bank for Reconstruction and Development
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Holds ISTQB Foundation as a minimum
  • Advanced Test Analyst or equivalent certifications desirable
  • Qualification in IT Service Management (ITIL v3 or v4 Foundation) or demonstrable experience integrating QA practices into ITSM processes
  • Familiar with the NIST Cybersecurity Framework (CSF) and Digital Operational Resilience Act (DORA), with practical awareness of how they influence quality standards and assurance planning
  • Demonstrates solid understanding of automation and non-functional testing concepts, including performance, accessibility, and shift-left/shift-right practices
  • Experience working within Agile, DevOps, and product-aligned teams, contributing to sprint-based delivery and continuous integration testing strategies
  • Proficient in test tooling and CI/CD frameworks including Azure DevOps, Selenium, Cypress, Jenkins, Git, and test management platforms such as TestRail or Zephyr
  • Familiarity with AI/ML use cases in quality engineering, including AI-assisted test case generation, defect clustering, and predictive analytics
  • Strong communication and collaboration skills, with the ability to explain test scenarios, defects, and coverage to technical and non-technical stakeholders
  • Awareness of security, compliance, and resilience considerations such as OWASP Top 10, ISO 27001, GDPR, and DORA, with practical experience embedding these into quality practices
Job Responsibility
Job Responsibility
  • Plans and performs testing of infrastructure changes, configuration updates, and cloud deployments across data centres, branch offices, and Azure environments, ensuring functional and non-functional validations
  • Collaborates with DevOps, Infrastructure, and Cloud Platform teams to verify IaC deployments, perform smoke testing post-deployment, and document any issues in platform reliability
  • Writes and maintains basic automation scripts or checklists for network, AV, and platform components, contributing to regression detection and provisioning assurance
  • Investigates environment or deployment defects, escalates issues with supporting diagnostics, and retests fixes with attention to uptime, latency, and failover coverage
  • Engages in sprint reviews and planning sessions, raising risks tied to infrastructure changes, recommending testing scope for AV, cloud services, or network resilience improvements
What we offer
What we offer
  • Varied, stimulating and engaging work that gives you an opportunity to interact with a wide range of experts in the financial, political, public and private sectors across the regions we invest in
  • A working culture that embraces inclusion and celebrates diversity
  • An environment that places sustainability, equality and digital transformation at the heart of what we do
  • Fulltime
Read More
Arrow Right

Senior DevOps Engineer (GCP)

Our client is a global UK-based financial services and investment banking organi...
Location
Location
Salary
Salary:
Not provided
n-ix.com Logo
N-iX
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in DevOps, Cloud Engineering, or SRE roles
  • Strong hands-on experience with Google Cloud Platform, including: GKE / Kubernetes, Cloud Run, Cloud Functions, Pub/Sub, Cloud Storage, VPC, IAM, networking, security
  • Expertise in Terraform, Helm, or other IaC tools
  • Experience building CI/CD pipelines (GitHub Actions, GitLab CI, CircleCI, Jenkins, etc.)
  • Strong understanding of containerization and orchestration: Docker, Kubernetes
  • Solid experience with monitoring, observability, and logging stacks
  • Familiarity with networking, load balancing, security hardening, and zero-trust principles
  • Experience supporting production systems in high-availability, distributed environments
  • Strong scripting skills (Python, Bash, or similar)
  • Experience working with agile engineering teams
Job Responsibility
Job Responsibility
  • Design, implement, and maintain cloud infrastructure on Google Cloud (GKE, Cloud Run, Cloud Functions, Pub/Sub, Cloud Storage)
  • Build and optimize CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, or similar)
  • Develop infrastructure-as-code using Terraform or similar tools
  • Set up and maintain container orchestration (Kubernetes, GKE) and automated deployment workflows
  • Implement monitoring, alerting, and observability using tools such as Prometheus, Grafana, ELK/Elastic, Stackdriver, or OpenTelemetry
  • Ensure compliance with security and governance standards across all environments
  • Collaborate closely with engineering teams to ensure scalable, high-performance deployment architectures
  • Support AI/ML and GenAI workloads (Vertex AI pipelines, model hosting, GPU workloads, inference optimization)
  • Manage environment strategies, release pipelines, configuration management, and secrets management
  • Optimize cloud costs and recommend improvements for performance and reliability
What we offer
What we offer
  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits
Read More
Arrow Right

DevOps Engineer

We are looking for a skilled and motivated DevOps Engineer to join our team and ...
Location
Location
Poland , Warsaw
Salary
Salary:
Not provided
https://www.inetum.com Logo
Inetum
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 2+ years of hands-on experience in DevOps or Cloud Engineering
  • Experience with Azure (preferred), AWS, or similar cloud platforms
  • Proficiency in Kubernetes, Docker, Helm, and Terraform
  • Familiarity with CI/CD tools (Azure DevOps, Jenkins) and scripting (Python, Bash)
  • Understanding of cloud networking, API gateways, load balancing, and monitoring/logging tools (Prometheus, Grafana, ELK)
  • Experience with data storage solutions and ETL/ELT concepts
  • Basic knowledge of cloud security, GDPR compliance, RBAC, and auditability
  • Strong troubleshooting skills and willingness to support incident management
  • English and Polish language at a minimum B2 level
  • Degree in Computer Science, IT, or related field—or equivalent experience.
Job Responsibility
Job Responsibility
  • Design, implement, and manage cloud infrastructure for GenAI-based platforms (Azure-focused)
  • Maintain and enhance CI/CD pipelines for deploying AI/ML and conversational AI solutions
  • Automate provisioning, monitoring, and scaling of cloud-native microservices (Kubernetes, Docker, Helm, Terraform)
  • Support production operations including monitoring, logging, alerting, and disaster recovery
  • Collaborate with AI/ML engineers, backend developers, and compliance teams to deliver GenAI products
  • Follow best practices for Infrastructure as Code (IaC) and contribute to cloud cost optimization
  • Integrate and manage data storage solutions (Azure PostgreSQL Flexible Server, data lakes, warehousing)
  • Build and maintain secure data pipelines (ETL/ELT)
  • Support API integrations, API gateways, and service mesh components for multi-channel (chat/voice) deployments
  • Ensure compliance with privacy, GDPR, and secure data handling standards
What we offer
What we offer
  • Flexible working hours
  • Hybrid work model
  • A cafeteria system that allows employees to personalize benefits by choosing from a variety of options
  • Generous referral bonuses, offering up to PLN6,000 for referring specialists
  • Additional revenue sharing opportunities for initiating partnerships with new clients
  • Ongoing guidance from a dedicated Team Manager for each employee
  • Tailored technical mentoring from an assigned technical leader, depending on individual expertise and project needs
  • Dedicated team-building budget for online and on-site team events
  • Opportunities to participate in charitable initiatives and local sports programs
  • A supportive and inclusive work culture with an emphasis on diversity and mutual respect.
  • Fulltime
Read More
Arrow Right

AI Data Engineer

The AI Data Engineer role involves designing and implementing cloud platforms fo...
Location
Location
United States , San Juan
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, engineering, information systems, or closely related quantitative discipline
  • 4-7 years’ experience
  • strong programming skills in Python, Java, Golang, or JavaScript
  • good understanding of distributed systems, event-driven programming paradigms, and designing for scale and performance
  • experience with cloud-native applications, developer tools, managed services, and next-generation databases
  • knowledge of DevOps practices like CI/CD, infrastructure as code, containerization, and orchestration using Kubernetes
  • good written and verbal communication skills
  • comfortable with AWS services
  • familiarity with the landscape of big data exploration, visualization, and prototyping platforms
  • familiarity with statistical and machine learning techniques
Job Responsibility
Job Responsibility
  • Research, propose, design, implement, operate and maintain cloud platforms for big data exploration and visualization, in support of a team of data scientists
  • deploy data science solutions into cloud environments
  • work with data scientists to troubleshoot cloud workflows
  • closely collaborate with our datalake team on cloud technologies
  • identify and implement cost-saving strategies to reduce ongoing cloud expenses
  • build CI/CD pipelines
  • deploy and maintain orchestration and monitoring systems for big data processing
  • help build images and containerize applications
What we offer
What we offer
  • Comprehensive suite of benefits that supports physical, financial, and emotional wellbeing
  • specific programs catered to professional development
  • inclusive working environment
  • Fulltime
Read More
Arrow Right

AI Lead Engineer

Lead Engineer role in HPE Hybrid Cloud focusing on AI innovation and technology ...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience designing and developing software systems design tools and languages in storage/server/networking area
  • Two or more years of experience in applying AI to practical and comprehensive technology solutions
  • Experience with ML, deep learning, TensorFlow, Python, NLP
  • Experience in program leadership, governance, and change enablement
  • Knowledge of basic algorithms, object-oriented and functional design principles, and best-practice patterns
  • Experience in REST API development, NoSQL database design, and RDBMS design and optimizations
  • Experience with innovation accelerators
  • Cloud Architectures
  • Cross Domain Knowledge
  • Design Thinking
Job Responsibility
Job Responsibility
  • Lead cross-functional teams in identifying and prioritizing key areas of a partner's business where AI solutions can drive significant business benefit
  • Design and develop solutions leveraging patterns in the data and metadata stored in Petabytes of Objects and Files in distributed fashion across enterprise storage platform
  • Design, develop, and deploy hybrid RAG architectures integrating LLMs with retrieval-based systems for improved relevance and contextual responses
  • Work on functional design, process design (including scenario design, flow mapping), prototyping, testing, training, and defining support procedures
  • Translating technical AI findings into clear, business-oriented language for non-technical stakeholders
  • Implement and manage pipelines that effectively combine retrieval mechanisms with generative capabilities
  • Develop custom plugins, adapters, or APIs to integrate retrieval systems with generative models
  • Fine-tune and optimize large language models
  • Monitor and troubleshoot issues within pipelines
  • Evaluate and benchmark the performance of vector databases
What we offer
What we offer
  • Health & Wellbeing benefits
  • Personal & Professional Development programs
  • Unconditional Inclusion environment
  • Fulltime
Read More
Arrow Right

ML Ops Engineer

As an MLOps Engineer, you will be responsible for building, maintaining, and opt...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
nstarxinc.com Logo
NStarX
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4 to 10 years of experience in MLOps, DevOps, or ML Engineering
  • Strong proficiency with cloud platforms such as AWS, Azure, or GCP
  • Experience with containerization and orchestration tools like Docker and Kubernetes
  • Hands-on experience with ML model deployment, monitoring, and scaling
  • Proficiency with CI/CD tools such as Jenkins or GitLab CI
  • Familiarity with data versioning and management tools such as DVC
  • Strong coding skills in Python with knowledge of ML libraries like TensorFlow or PyTorch
  • Strong problem-solving skills and ability to work in a collaborative environment
  • Effective communication skills for cross-functional teamwork
Job Responsibility
Job Responsibility
  • Develop and manage infrastructure for end-to-end ML workflows including model training, deployment, monitoring, and maintenance
  • Implement CI/CD pipelines for ML models and data workflows
  • Collaborate with cross-functional teams to build scalable and robust ML infrastructure on cloud and on-premises environments
  • Monitor and optimize model performance and infrastructure to ensure efficient resource usage
  • Manage data versioning and model versioning across multiple environments
  • Implement security, governance, and compliance protocols in ML deployment and data pipelines
  • Support troubleshooting, debugging, and incident management for ML infrastructure issues
What we offer
What we offer
  • Competitive compensation
  • Opportunity to work with a dynamic team on cutting-edge AI and ML solutions
  • Professional growth and development opportunities
  • Fulltime
Read More
Arrow Right

Director of Data, ML & AI Engineering

As Director of Data, ML & AI Engineering, you will lead the design, delivery, an...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
collinsongroup.com Logo
Collinson
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Senior leadership experience across data, platform, ML, and/or AI engineering in enterprise or federated environments
  • Deep understanding of modern cloud-native data platforms, large-scale distributed systems, and emerging data technologies
  • Proven experience delivering and evolving enterprise-scale data and AI platforms from inception to production
  • Hands-on knowledge of ML/AI operationalisation, including pipelines, lifecycle management, and experimentation frameworks
  • Demonstrated capability managing cost, risk, security, and compliance at scale
  • Strong people leadership and team development experience, promoting inclusion, clarity, and accountability
  • Ability to translate complex technical concepts into business impact with senior stakeholders
  • A collaborative, adaptive leadership style that encourages openness, trust, and curiosity
Job Responsibility
Job Responsibility
  • Lead the design and evolution of enterprise-grade data, ML, and AI engineering platforms, covering ingestion, transformation, feature management, model pipelines, and deployment
  • Ensure platforms are resilient, scalable, and production-ready to support both analytics and AI workloads
  • Balance continuous innovation with operational reliability, service continuity, and business value
  • Lead multiple engineering squads across data, platform, ML, and AI engineering disciplines
  • Establish clear engineering standards, ownership models, and accountability frameworks
  • Embed modern delivery practices such as DevOps, DataOps, MLOps, and AIOps to improve reliability and speed
  • Champion operational excellence, predictable delivery, and effective incident management
  • Partner with the VP of Analytics and Head of Innovation & AI to align platform capabilities with insight delivery, experimentation, and AI productisation
  • Provide high-quality, governed, production-ready data products and shared tools that empower analytics and AI teams
  • Accelerate time to value through automation, reusable patterns, and scalable platform abstractions
Read More
Arrow Right