CrawlJobs Logo

AI and DevOps Platform Support Manager

https://www.citi.com/ Logo

Citi

Location Icon

Location:
Canada , Mississauga

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

145100.00 - 217700.00 USD / Year

Job Description:

AI and DevOps Platform Support Manager is accountable for management of complex/critical/large professional disciplinary areas. Leads and directs a team of professionals. Requires a comprehensive understanding of multiple areas within a function and how they interact in order to achieve the objectives of the function. Applies in-depth understanding of the business impact of technical contributions. Strong commercial awareness is a necessity. Generally accountable for delivery of a full range of services to one or more businesses/ geographic regions. Excellent communication skills required in order to negotiate internally, often at a senior level. Some external communication may be necessary. Accountable for the end results of an area. Exercises control over resources, policy formulation and planning. Primarily affects a sub-function. Involved in short- to medium-term planning of actions and resources for own area. Full management responsibility of a team or multiple teams, including management of people, budget and planning, to include performance evaluation, compensation, hiring, disciplinary actions and terminations and budget approval. We are seeking an experienced and motivated Manager to lead our AI and DevOps Platform Support team in Canada. This role is responsible for ensuring the stability, reliability, and performance of our critical AI and DevOps platforms. The team supports a wide range of services, including multiple AI applications, developer tools, and CI/CD pipeline technologies used by teams across the organization. The ideal candidate will lead a team of support engineers, manage incident and problem resolution, and collaborate with engineering and development teams to improve platform services and supportability. Involved in short- to medium-term planning of actions and resources for own area.

Job Responsibility:

  • Demonstrates an in-depth understanding of how apps support integrates within the overall technology function to achieve objectives
  • requires a good understanding of the industry
  • Vendor relationship management including oversight for all offshore managed service
  • Improve the service level the team provides to our end users, which includes maximizing operational efficiencies, strengthening incident management, problem management and knowledge sharing practices
  • Guide development teams on application stability and supportability improvements
  • Formulate and implement a framework for managing capacity, throughput and latency
  • Define and implemented application on-boarding guidelines and standards
  • Work with various team members on coaching them on how to maximize their potential, work better in a highly integrated team environment and focus on bringing out their strengths
  • Drives continued cost reductions and efficiencies across the portfolios supported by means of Root Cause Analysis reviews, Knowledge management, Performance tuning, and user training
  • Evaluates subordinates' performance and makes decisions on pay increases, hiring, terminations and other personnel actions
  • Participates in business review meetings, relating technology tools strategies to business requirements
  • Assures adherence to all support process and tool standards and work with Management to create new and/or enhance processes to ensure consistency and quality in “best practices” across the overall support program
  • Performs other duties and functions as assigned
  • Act as the primary point of contact for platform matters, defining the vision and roadmap in partnership with engineering leaders and business stakeholders
  • Champion the platform's resilience strategy by planning and executing wargaming scenarios, chaos engineering tests, and disaster recovery drills
  • Drive a comprehensive automation strategy to reduce manual toil, improve deployment velocity, and identify opportunities to leverage AI for operational intelligence
  • Define and drive the enterprise-wide observability strategy, ensuring the team has the tools and insights needed to guarantee platform health, performance, and cost-effectiveness. This includes overseeing monitoring, logging, tracing, and alerting
  • Remain hands-on and maintain a deep technical understanding of the platform architecture and services
  • Oversee the operational health of all production platforms (including OpenShift, ECS, CI/CD), ensuring SLAs are met and a robust incident management process is in place
  • Implement and manage comprehensive monitoring and observability strategies to ensure proactive issue detection, performance analysis, and system health checks across all supported platforms

Requirements:

  • 10+ years relevant experience
  • Relevant experience in a technical leadership or management role with demonstrated success in building and scaling a high-performing support team
  • Experience of senior stakeholder management
  • Project management with demonstrable results in improving IT services
  • Exceptional communication and presentation skills, with the ability to articulate a technical vision and report on key metrics to senior leadership
  • A strong track record of developing and executing a strategic roadmap for a technical platform, balancing new features with a dedicated 'book of work' for stability
  • Demonstrable experience leading resilience initiatives such as wargaming, disaster recovery planning, and incident response simulation
  • Effectively share information with other support team members and with other technology teams
  • Ability to plan and organize workload
  • Consistently demonstrates clear and concise written and verbal communication skills
  • Ability to communicate appropriately to relevant stakeholders
  • Hands-on experience with modern observability and monitoring tools (e.g., Prometheus, Grafana, Splunk)
  • Bachelor’s/University degree, Master’s degree preferred

Additional Information:

Job Posted:
March 25, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for AI and DevOps Platform Support Manager

IT Development Manager for Data Intelligence Platform

You will lead technical developments of a Data Intelligence Platform and partner...
Location
Location
Poland , Warsaw
Salary
Salary:
Not provided
https://www.bosch.pl/ Logo
Robert Bosch Sp. z o.o.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in developing Data Management and Analytics applications
  • Extensive Knowledge of (Meta) Data Management capabilities: Data Governance, Data Lineage, Data Assets, Data Products, Data Catalog, Data Marketplace, Data Policy, Ontologies
  • Considerable experience in designing, developing & integrating Data and Analytics applications using modern architectures and frameworks, structured and unstructured data
  • Broad and up to date technical knowledge: Databases (e.g. Oracle, Databricks), Middleware/Integration (e.g. Solace, Kafka), API Management, Cloud Providers (Azure, AWS, Google), AI Technologies (LLMs, Agents)
  • Experimental mindset, self-motivation to search for solutions and appreciate learning new things
  • Strong communication skills, proactive in contacting people
  • English (fluent in spoken and written)
Job Responsibility
Job Responsibility
  • Lead technical developments of a Data Intelligence Platform
  • Partner with Business, Solution and IT Architects on the strategy and delivery of the Platform functionalities
  • Define consistent system specific guidelines for the software development and configuration environment in alignment with central guidelines
  • Assess customer requirements from a technical perspective with respective effort estimations and assist in the design and development of proof of concept and prototypes
  • Document specifications and support the creation of operational support manuals during the technical implementation
  • Take over responsibility for interface implementation and documentation
  • Steer external and internal developers
  • Support DevOps by sizing and scalability concepts (for specific use cases)
What we offer
What we offer
  • Competitive salary + annual bonus
  • Hybrid work with flexible working hours
  • Referral Bonus Program
  • Copyright costs for IT employees
  • Private medical care and life insurance
  • Cafeteria System with multiple benefits (incl. MultiSport, shopping vouchers, cinema tickets, etc.)
  • Prepaid Lunch Card
  • Non-working day on the 31st of December
  • Fulltime
Read More
Arrow Right

IT Development Manager

At Bosch, we shape the future by inventing high-quality technologies and service...
Location
Location
Poland , Warszawa
Salary
Salary:
Not provided
https://www.bosch.pl/ Logo
Robert Bosch Sp. z o.o.
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in developing Data Management and Analytics applications
  • Extensive Knowledge of (Meta) Data Management capabilities: Data Governance, Data Lineage, Data Assets, Data Products, Data Catalog, Data Marketplace, Data Policy, Ontologies
  • Considerable experience in designing, developing & integrating Data and Analytics applications using modern architectures and frameworks, structured and unstructured data
  • Broad and up to date technical knowledge: Databases (e.g. Oracle, Databricks), Middleware/Integration (e.g. Solace, Kafka), API Management, Cloud Providers (Azure, AWS, Google), AI Technologies (LLMs, Agents)
  • Experimental mindset, self-motivation to search for solutions and appreciate learning new things
  • Strong communication skills, proactive in contacting people
  • English (fluent in spoken and written), German would be a plus.
Job Responsibility
Job Responsibility
  • Lead technical developments of a Data Intelligence Platform
  • Partner with Business, Solution and IT Architects on the strategy and delivery of the Platform functionalities
  • Define consistent system specific guidelines for the software development and configuration environment in alignment with central guidelines
  • Assess customer requirements from a technical perspective with respective effort estimations and assist in the design and development of proof of concept and prototypes
  • Document specifications and support the creation of operational support manuals during the technical implementation
  • Take over responsibility for interface implementation and documentation
  • Steer external and internal developers
  • Support DevOps by sizing and scalability concepts (for specific use cases).
What we offer
What we offer
  • Competitive salary + annual bonus
  • Hybrid work with flexible working hours
  • Referral Bonus Program
  • Copyright costs for IT employees
  • Complex environment of working, professional support and possibility to share knowledge and best practices
  • Ongoing development opportunities in a multinational environment
  • Broad access to professional trainings (incl. language courses), conferences and webinars
  • Private medical care and life insurance
  • Cafeteria System with multiple benefits (incl. MultiSport, shopping vouchers, cinema tickets, etc.)
  • Prepaid Lunch Card
  • Fulltime
Read More
Arrow Right

AI and DevOps Platform Support Manager

Engineer the future of global finance. At Citi, our Tech team doesn’t just suppo...
Location
Location
United Kingdom , Belfast
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Relevant experience in a technical leadership or management role with demonstrated success in building and scaling a high-performing support team
  • Experience of senior stakeholder management
  • Project management with demonstrable results in improving IT services
  • Exceptional communication and presentation skills, with the ability to articulate a technical vision and report on key metrics to senior leadership
  • A strong track record of developing and executing a strategic roadmap for a technical platform, balancing new features with a dedicated 'book of work' for stability
  • Demonstrable experience leading resilience initiatives such as wargaming, disaster recovery planning, and incident response simulation
  • Effectively share information with other support team members and with other technology teams
  • Ability to plan and organize workload
  • Consistently demonstrates clear and concise written and verbal communication skills
  • Ability to communicate appropriately to relevant stakeholders
Job Responsibility
Job Responsibility
  • Demonstrates an in-depth understanding of how apps support integrates within the overall technology function to achieve objectives
  • requires a good understanding of the industry
  • Vendor relationship management including oversight for all offshore managed service
  • Improve the service level the team provides to our end users, which includes maximizing operational efficiencies, strengthening incident management, problem management and knowledge sharing practices
  • Guide development teams on application stability and supportability improvements
  • Formulate and implement a framework for managing capacity, throughput and latency
  • Define and implemented application on-boarding guidelines and standards
  • Work with various team members on coaching them on how to maximize their potential, work better in a highly integrated team environment and focus on bringing out their strengths
  • Drives continued cost reductions and efficiencies across the portfolios supported by means of Root Cause Analysis reviews, Knowledge management, Performance tuning, and user training
  • Evaluates subordinates' performance and makes decisions on pay increases, hiring, terminations and other personnel actions
What we offer
What we offer
  • 27 days annual leave (plus bank holidays)
  • A discretional annual performance related bonus
  • Private Medical Care & Life Insurance
  • Employee Assistance Program
  • Pension Plan
  • Paid Parental Leave
  • Special discounts for employees, family, and friends
  • Access to an array of learning and development resources
  • Fulltime
Read More
Arrow Right
New

AI and DevOps Platform Support Engineer

Engineer the future of global finance. At Citi, our Tech team doesn’t just suppo...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Project management with demonstrable results in improving IT services
  • Capacity Planning/Forecasting exposure a plus
  • Ability to plan and organize workload
  • Consistently demonstrates clear and concise written and verbal communication skills
  • Excellent analytical and problem-solving skills, with the ability to thrive in a fast-paced support role
  • Strong communication skills and the ability to explain complex technical concepts to diverse audiences
  • A strong track record of developing and executing a strategic roadmap for a technical platform, balancing new features with a dedicated 'book of work' for stability
  • Demonstrable experience leading resilience initiatives such as wargaming, disaster recovery planning, and incident response simulation
  • Demonstrated experience in designing and implementing disaster recovery (DR) plans and conducting resilience tests (e.g., wargaming, failure simulation)
  • A creative and proactive mindset with a demonstrated ability to identify opportunities for process improvement and automation using AI/ML techniques
Job Responsibility
Job Responsibility
  • Ensuring the stability, reliability, and performance of our critical AI and DevOps platforms
  • Manage incident and problem resolution and collaborate with engineering and development teams to improve platform services and supportability
  • Vendor relationship management including oversight for all offshore managed service
  • Improve the service level the team provides to our end users, which includes maximizing operational efficiencies, strengthening incident management, problem management and knowledge sharing practices
  • Guide development teams on application stability and supportability improvements
  • Formulate and implement a framework for managing capacity, throughput and latency
  • Define and implemented application on-boarding guidelines and standards
  • Work with various team members on coaching them on how to maximize their potential
  • Drives continued cost reductions and efficiencies across the portfolios supported
  • Evaluates subordinates' performance and makes decisions on pay increases, hiring, terminations and other personnel actions
What we offer
What we offer
  • 27 days annual leave (plus bank holidays)
  • A discretional annual performance related bonus
  • Private Medical Care & Life Insurance
  • Employee Assistance Program
  • Pension Plan
  • Paid Parental Leave
  • Special discounts for employees, family, and friends
  • Access to an array of learning and development resources
  • Fulltime
Read More
Arrow Right
New

Application Production Support Engineer Generative AI

Engineer the future of global finance. At Citi, our Tech team doesn’t just suppo...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6-8 years of relevant experience in technical support, platform operations, or engineering
  • Exposure to architecture concepts with the ability to contribute to technical discussions and understand design decisions
  • Experience working with business partners, engineering teams, or technology stakeholders
  • Demonstrated experience supporting IT services, platform operations, or infrastructure components
  • Strong verbal and written communication skills, with the ability to document technical issues clearly
  • Experience supporting operational workstreams or participating in platform improvement initiatives
  • Participation in resilience‑related or stability‑focused activities preferred
  • Ability to collaborate effectively with cross‑functional teams
  • Strong organizational skills and ability to manage daily workload and task priorities
  • Working knowledge of Generative AI concepts preferred
Job Responsibility
Job Responsibility
  • Understand how application support functions within the broader technology organization and contributes to business objectives
  • Assist with vendor coordination and day‑to‑day interactions with offshore managed services
  • Support efforts to improve service levels, including participating in incident management, problem management, and knowledge‑sharing initiatives
  • Partner with development and engineering teams to support application stability and operational readiness
  • Assist in collecting capacity, performance, and latency data to support platform planning efforts
  • Support application onboarding activities using established guidelines and standards
  • Contribute to fostering a collaborative and supportive team environment that encourages skill development
  • Participate in cost‑efficiency initiatives such as Root Cause Analysis reviews, knowledge management, and performance tuning
  • Assist in preparing materials for business review meetings and help align technology activities with business needs
  • Follow established support processes and tool standards and provide input on improvement opportunities
  • Fulltime
Read More
Arrow Right
New

Ai devops platform support lead

Engineer the future of global finance. At Citi, our Tech team doesn’t just suppo...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10 years of relevant experience in a hands‑on technical leadership role
  • Lead architecture decision‑making for platform services, ensuring alignment with enterprise standards, long‑term scalability, and operational resilience
  • Experience with senior stakeholder management
  • Project management experience with demonstrable results in improving IT services
  • Exceptional communication and presentation skills, with the ability to articulate a technical vision and report on key metrics to senior leadership
  • A strong track record of developing and executing a strategic roadmap for a technical platform, balancing new features with a dedicated “book of work” for stability
  • Demonstrable experience leading resilience initiatives such as wargaming, disaster recovery planning, and incident response simulations
  • Ability to effectively share information with other support team members and other technology teams
  • Ability to plan and organize workload
  • Consistently demonstrates clear and concise written and verbal communication skills
Job Responsibility
Job Responsibility
  • Demonstrates an in-depth understanding of how application support integrates within the overall technology function to achieve objectives
  • requires a good understanding of the industry
  • Vendor relationship management, including oversight for all offshore managed services
  • Improve the service level the team provides to our end users, including maximizing operational efficiencies and strengthening incident management, problem management, and knowledge‑sharing practices
  • Guide development teams on application stability and supportability improvements
  • Formulate and implement a framework for managing capacity, throughput, and latency
  • Define and implement application onboarding guidelines and standards
  • Work with various team members, coaching them on how to maximize their potential, work better in a highly integrated team environment, and focus on bringing out their strengths
  • Drive continued cost reductions and efficiencies across the portfolios supported through Root Cause Analysis reviews, knowledge management, performance tuning, and user training
  • Participate in business review meetings, relating technology tools and strategies to business requirements
  • Fulltime
Read More
Arrow Right
New

Specialist, Data and AI - AI Platforms

The Specialist, Data & Analytics – AI Platforms acts as a technical steward with...
Location
Location
Canada , Toronto
Salary
Salary:
Not provided
aircanada.com Logo
Air Canada
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • University degree or technical certification in Engineering, Computer Science, Mathematics, Statistics, or other related technical fields
  • 5+ years of IT experience in large enterprise environments
  • Experience with technologies supporting AI/ML environments, including: Azure DevOps & GitHub, Azure Kubernetes Service, Databricks, Azure Machine Learning, Azure AI Foundry, Azure Data Factory, Azure Function Apps, Azure Storage Accounts, Key Vault, SQL Server Databases, Service Bus
  • Excellent communication skills (written and verbal)
  • Strong problem solving and analytical abilities
  • Proven capability to work effectively in team driven, collaborative environments
  • Demonstrate punctuality and dependability to support overall team success in a fast-paced environment
Job Responsibility
Job Responsibility
  • Integrate AI products, machine learning models, and data pipelines into enterprise systems, ensuring they are stable, scalable, and aligned with reference architecture
  • Configure and maintain connectivity between AI platforms (such as Databricks or Dataiku) and upstream/downstream systems
  • Support the onboarding of new AI products into shared platforms and contribute to continuous improvement of integration processes
  • Deploy, configure, and maintain AI workloads within Microsoft Azure, adhering to enterprise cloud standards
  • Manage compute, storage, networking, and other Azure resources needed for AI platforms
  • Contribute to environment governance by applying deployment patterns and enforcing development/test/production separation
  • Support infrastructure as code processes using approved templates and CI/CD pipelines
  • Implement and maintain CI/CD pipelines for models, inference services, and data workflows in accordance with defined MLOps standards
  • Support model packaging, deployment, versioning, rollbacks, and promotion across environments
  • Integrate monitoring, observability, and alerting tools into AI workloads to ensure operational health
Read More
Arrow Right

Director of Data, ML & AI Engineering

As Director of Data, ML & AI Engineering, you will lead the design, delivery, an...
Location
Location
United Kingdom , London
Salary
Salary:
Not provided
collinsongroup.com Logo
Collinson
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Senior leadership experience across data, platform, ML, and/or AI engineering in enterprise or federated environments
  • Deep understanding of modern cloud-native data platforms, large-scale distributed systems, and emerging data technologies
  • Proven experience delivering and evolving enterprise-scale data and AI platforms from inception to production
  • Hands-on knowledge of ML/AI operationalisation, including pipelines, lifecycle management, and experimentation frameworks
  • Demonstrated capability managing cost, risk, security, and compliance at scale
  • Strong people leadership and team development experience, promoting inclusion, clarity, and accountability
  • Ability to translate complex technical concepts into business impact with senior stakeholders
  • A collaborative, adaptive leadership style that encourages openness, trust, and curiosity
Job Responsibility
Job Responsibility
  • Lead the design and evolution of enterprise-grade data, ML, and AI engineering platforms, covering ingestion, transformation, feature management, model pipelines, and deployment
  • Ensure platforms are resilient, scalable, and production-ready to support both analytics and AI workloads
  • Balance continuous innovation with operational reliability, service continuity, and business value
  • Lead multiple engineering squads across data, platform, ML, and AI engineering disciplines
  • Establish clear engineering standards, ownership models, and accountability frameworks
  • Embed modern delivery practices such as DevOps, DataOps, MLOps, and AIOps to improve reliability and speed
  • Champion operational excellence, predictable delivery, and effective incident management
  • Partner with the VP of Analytics and Head of Innovation & AI to align platform capabilities with insight delivery, experimentation, and AI productisation
  • Provide high-quality, governed, production-ready data products and shared tools that empower analytics and AI teams
  • Accelerate time to value through automation, reusable patterns, and scalable platform abstractions
Read More
Arrow Right