CrawlJobs Logo

Senior Manager, AI Infrastructure and Operations

pfizer.de Logo

Pfizer

Location Icon

Location:
Japan , Tokyo

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

The Sr. Manager/Staff Engineer, AI Infrastructure & MLOps Engineering is a senior technical leader responsible for architecting, building, and scaling Pfizer’s AI infrastructure and developer platforms. This role leverages extensive experience in cloud engineering, DevOps, and MLOps to deliver robust, high-performance solutions supporting advanced AI/ML workloads in biotechnology, healthcare, and enterprise technology. The successful candidate will drive innovation in automation, reliability, and scalability, enabling scientists and engineers to rapidly develop, deploy, and monitor machine learning models in production environments.

Job Responsibility:

  • Design, implement, and own large-scale cloud-based HPC and MLOps platforms supporting AI model training, genomic sequencing, and precision medicine
  • Architect multi-environment clusters (AWS, GCP, Azure), enabling GPU/FPGA workloads and advanced observability
  • Lead the development of developer and cloud platforms, including internal engineering accelerators and reusable toolsets
  • Design, implement, and manage unified platform catalogs using Backstage, enhancing developer experience and application metadata management
  • Develop custom plugins and APIs for Backstage to support internal engineering workflows and documentation
  • Build and maintain Python-based automation frameworks, CI/CD pipelines, and Infrastructure-as-Code (Terraform, Helm, Pulumi, AWS CDK)
  • Operationalize containerized solutions using Docker and Kubernetes, integrating MLflow, Kubeflow, and other orchestration platforms
  • Implement robust automation for provisioning, configuring, and managing cloud resources across multiple environments
  • Lead the implementation of Service Level Indicators (SLIs), Service Level Objectives (SLOs), and advanced observability (Prometheus, Grafana, PagerDuty)
  • Develop and maintain APIs and services for model management, feature stores, and inference pipelines
  • Operationalize ML model serving at scale using frameworks such as TensorFlow Serving, TorchServe, KServe, and Seldon Core
  • Ensure compliance with industry standards (e.g., HIPAA, FDA) for data protection and reliability
  • Mentor engineers and lead cross-functional teams to deliver integrated solutions
  • Champion engineering excellence through design documentation, code reviews, and testing automation
  • Present at industry summits, author technical proposals, and contribute to open-source projects (Kubernetes, Helm, Go, Envoy)
  • Drive agile delivery, sprint planning, and performance optimization
  • Lead incident response and disaster recovery initiatives for mission-critical platforms
  • Foster a culture of shared ownership, transparency, and innovation

Requirements:

  • 8+ years of hands-on software engineering experience in cloud infrastructure, DevOps, and MLOps
  • Deep expertise in Python, Kubernetes, Terraform, Helm, and CI/CD pipeline development
  • Proven experience architecting and operating containerized solutions on AWS, GCP, and Azure
  • Strong knowledge of Infrastructure-as-Code, distributed systems, and production system reliability
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related field

Nice to have:

  • Expertise in AWS cloud services (EC2, S3, Lambda, EKS, SageMaker, API Gateway, CloudFormation, IAM, etc.)
  • Experience deploying and customizing Backstage as a unified catalog for teams, services, and technical documentation
  • Experience building and deploying microservices and REST/gRPC APIs for AI model delivery
  • Familiarity with MLflow, Kubeflow, and other MLOps orchestration platforms
  • Proficiency with model serving frameworks (TensorFlow Serving, TorchServe, KServe, Seldon Core, BentoML, etc.)

Additional Information:

Job Posted:
February 20, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Senior Manager, AI Infrastructure and Operations

Senior Manager, Operations Knowledge Systems & Process Design

This isn't traditional knowledge management. You're building the operating syste...
Location
Location
United States , Nashville
Salary
Salary:
Not provided
https://checkr.com Logo
Checkr
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of experience in operations, process improvement, knowledge management, or related fields
  • 3+ years leading teams or complex cross-functional initiatives
  • Demonstrated expertise in business process design, mapping, and optimization (Lean, Six Sigma, or similar methodologies)
  • Strong systems thinking—ability to see how knowledge, process, technology, and people interconnect
  • Proven ability to write clear, effective operational content that scales across audiences and channels
  • Data fluency: comfortable using metrics and analytics to drive decisions and measure impact
  • Experience building scalable solutions that work across multiple teams or functions
  • Excellent stakeholder management skills with ability to influence without authority
  • Clear, compelling communication—can translate complex systems into understandable frameworks
Job Responsibility
Job Responsibility
  • Design and evolve the knowledge infrastructure that powers compliance operations, customer support, and external help center content
  • Write and oversee the creation of content that works—clear, actionable knowledge that scales across channels and use cases
  • Develop and maintain structured taxonomies and leverage AI-powered approaches for organizing and surfacing unstructured content
  • Create systems that enable both human agents and AI systems to leverage knowledge effectively
  • Establish frameworks for knowledge quality, governance, and lifecycle management that scale with business growth
  • Map, document, and optimize cross-functional processes across compliance, support, and supply chain operations
  • Design processes that balance efficiency, quality, and customer experience outcomes
  • Build process frameworks that support continuous improvement and rapid iteration
  • Harness conversation analytics and AI to surface patterns, gaps, and opportunities in knowledge and process performance
  • Use operational data and performance metrics to identify knowledge and process gaps and translate insights into action
What we offer
What we offer
  • Lunch four times a week
  • Commuter stipend
  • Snacks and beverages
  • Fulltime
Read More
Arrow Right

Senior Engineering Manager - AI Core Platform

We’re hiring a Senior Engineering Manager (or high-potential EM2) for the Core P...
Location
Location
Ireland , Dublin
Salary
Salary:
Not provided
intercom.com Logo
Intercom
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience leading engineering teams, ideally across infrastructure or platform domains
  • Recent hands-on coding experience — you’ve shipped production code in the last couple of years
  • Strong technical judgment and the ability to coach senior engineers through complex architectural trade-offs
  • Adaptable leadership style suited to a group that will grow quickly, and change shape over time
  • Curiosity and enthusiasm for AI, with a desire to learn how ML systems are developed and operated in production
Job Responsibility
Job Responsibility
  • Lead a high-performing team building the platform and infrastructure that power Intercom’s AI capabilities
  • Contribute directly to production code, staying close to the work and building knowledge & context through first-hand experience
  • Support teams of ML Scientists and Engineers building AI powered capabilities
  • Plan, prioritize, and deliver high-impact roadmaps in partnership with the team’s most senior engineers, balancing delivery, quality, and innovation
  • Improve developer experience across the AI infrastructure stack, ensuring that systems are observable, scalable, and easy to build upon
  • Empower the engineers on the team to act with agency and maximize their impact
  • Expand your scope over time, potentially taking ownership of additional platform domains as the team and AI initiatives grow
What we offer
What we offer
  • Competitive salary and equity in a fast-growing start-up
  • We serve lunch every weekday, plus a variety of snack foods and a fully stocked kitchen
  • Regular compensation reviews - we reward great work
  • Pension scheme & match up to 4%
  • Peace of mind with life assurance, as well as comprehensive health and dental insurance for you and your dependents
  • Flexible paid time off policy
  • Paid maternity leave, as well as 6 weeks paternity leave for fathers, to let you spend valuable time with your loved ones
  • If you’re cycling, we’ve got you covered on the Cycle-to-Work Scheme. With secure bike storage too
  • MacBooks are our standard, but we also offer Windows for certain roles when needed
  • Fulltime
Read More
Arrow Right

Senior AI Infrastructure Engineer

This role will be responsible for designing, deploying, and maintaining high-per...
Location
Location
United States , Bothell; Overland Park; Bellevue
Salary
Salary:
113600.00 - 205000.00 USD / Year
https://www.t-mobile.com Logo
T-Mobile
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years technical engineering experience, preferably in multiple technology focus areas
  • Expert understanding of AI/ML infrastructure components, or GPU-based systems – preferably in a high-availability, large scale environment
  • Hands-on Experience with NVIDIA DGX servers, BasePOD architectures, and advanced GPU technologies
  • Proficient in Linux/UNIX environments, including scripting/automation tools (Bash, Python, Ansible, Terraform)
  • Understanding of AI infrastructure security best practices
  • Experience with container orchestration (Kubernetes, Docker) and GPU workload management tools
  • Strong knowledge of networking (InfiniBand/Ethernet) and storage solutions in AI/ML contexts
Job Responsibility
Job Responsibility
  • Technical System Expertise: Understands system protocols, how systems operate and data flows
  • Technical Engineering Services: Drives engineering projects by active contribution to the application of engineering techniques
  • Innovation: Contributes to designs to implement new ideas which improve an existing and new system/process/service
  • Technical Writing: Writes basic documentation on how technology works
  • Technical Leadership: Collaborates with technical teams and utilizes system expertise to deliver technical solutions
  • Technology Strategy: Contributes to new and existing technology options that support business goals
What we offer
What we offer
  • Competitive base salary and compensation package
  • Annual stock grant
  • Employee stock purchase plan
  • 401(k)
  • Access to free, year-round money coaches
  • Medical, dental and vision insurance
  • Flexible spending account
  • Paid time off
  • Paid holidays
  • Paid parental and family leave
  • Fulltime
Read More
Arrow Right

Senior Engineering Manager, Search Infrastructure

The Search Platform team is responsible for powering all of Rovo Search as well ...
Location
Location
India , Bengaluru
Salary
Salary:
Not provided
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4+ years of experience managing high performing, software engineering teams running core services at scale
  • Deep technical experience building and scaling search applications and distributed systems using large amounts of data on cloud platforms, preferably AWS
  • Expert level knowledge and understanding of low-latency distributed data management and query processing systems including Lucene based stacks will be strongly preferred
  • Proven track record of consistent execution delivering outsized results with strong operational rigour
  • ability to continuously simplify and evolve reliable processes for the team
  • Strong organisation and communication skills with the ability to drive clarity in an ambiguous environment
  • effectively communicate technical concepts and results clearly and concisely, and solicit feedback from various stakeholders
  • Ability to hire, onboard, and retain top talent for your team and foster a culture of innovation, collaboration, and excellence
  • Passion for mentoring and coaching your team members on best practices, code quality, design patterns, testing and operational skills
  • Focus on business outcomes and the 80/20 rule
Job Responsibility
Job Responsibility
  • Designing the right goals in collaboration
  • executing and delivering them for our customers
  • Responsible for building the highest performing teams consisting of deeply skilled and highly motivated engineers
  • Empowering and enabling team members to exceed their own potential
  • Primary architect of the culture in your team, aligned with our core values, focused on innovation and excellence
  • Help hire, develop and work closely with senior engineers to actively drive technical solutions and architecture
  • Own the quality of decision making along with their long-term outcomes
  • Continuously upgrading deep technical skills, engineering judgment and strong operational rigour
What we offer
What we offer
  • Health coverage
  • Paid volunteer days
  • Wellness resources
  • Fulltime
Read More
Arrow Right

Revenue Operations Manager

This is one of the most critical roles driving the scalability and financial per...
Location
Location
Sweden , Stockholm
Salary
Salary:
Not provided
mentimeter.com Logo
Mentimeter
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of experience in Operations (Revenue, Sales or Marketing Ops), SaaS Sales or Consultancy
  • Highly driven, proactive, and action-oriented with a strong bias toward execution
  • Curious interest in leveraging AI and automation to drive smarter decisions and improve operational effectiveness
  • Excellent communicator with the ability to align and collaborate effectively with senior leadership and cross-functional teams
  • Ability to work cross-functionally and align operational initiatives with business goals
  • Attention to detail and a structured, problem-solving mindset
  • Familiarity with SaaS sales processes and CRM data models
Job Responsibility
Job Responsibility
  • Revenue Process Design and Implementation: Responsible for process design and driving scalability within our Enterprise Bow Tie funnel
  • Partnering with Revenue leaders to align Sales Ops initiatives with Mentimeter’s G2M strategy
  • Leading and contributing to cross-functional projects focused on revenue enablement and operational excellence
  • Implement process changes through tooling and data infrastructure, automating workflows where possible to ensure scalability
  • Drive cross-functional alignment and change management to ensure consistent process adoption and scalability
  • Tech Stack & System Enablement: Ownership of tools and systems that are the closest to your specialisation
  • Workflows and automation: Identify and implement workflow improvements that increase productivity and visibility throughout the funnel
  • Ensure data activation within the system
  • Ensure CRM data integrity: Responsible for legal compliance for the data in the tools and maintaining data hygiene
  • Having commercial ownership for driving renewal process and negotiations and optimise costs and tool ROI
What we offer
What we offer
  • Diverse and inclusive work environment
  • Continuous professional development
  • Access to a leadership program (including external personal coach)
  • Relevant education
  • Competitive compensation and benefits package, including pension contributions
Read More
Arrow Right

Engineering Manager, Infrastructure

As an Engineering Manager for the Infrastructure team, you’ll lead the engineers...
Location
Location
Canada; United States
Salary
Salary:
195000.00 - 285000.00 USD / Year
apollo.io Logo
Apollo.io
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of hands-on software or infrastructure engineering experience
  • 2+ years of experience leading teams of senior and staff-level engineers in platform, SRE, or infrastructure domains
  • Proven ability to design and operate large-scale distributed systems in cloud environments (preferably GCP or AWS)
  • Expertise with Kubernetes, Docker, Terraform, Ubuntu, and CI/CD pipelines
  • Familiarity with observability tools (Grafana, Prometheus, ELK, Datadog, NewRelic) and performance tuning
  • Strong grounding in networking, security, and reliability principles
  • Experience managing infrastructure costs, availability SLAs, and high-throughput systems at scale
Job Responsibility
Job Responsibility
  • Lead, coach, and grow a distributed team of high-impact Infrastructure Engineers
  • Partner with senior engineering leadership on strategic initiatives such as cloud migration, infrastructure scaling, platform reliability, and cost efficiency
  • Define and implement modern operational excellence practices, including SLOs, error budgets, incident reviews, and performance monitoring
  • Guide technical decision-making across key areas like Kubernetes, GCP, observability, networking, CI/CD, and IaC (Terraform, Ansible)
  • Collaborate with AI, Data, and Product Engineering teams to ensure infrastructure scalability for ML and AI-native workloads
  • Run effective 1:1s, career development conversations, and quarterly performance reviews
  • Support recruiting efforts to attract top engineering talent across time zones
What we offer
What we offer
  • Equity
  • Company bonus or sales commissions/bonuses
  • 401(k) plan
  • At least 10 paid holidays per year
  • Flex PTO
  • Parental leave
  • Employee assistance program and wellbeing benefits
  • Global travel coverage
  • Life/AD&D/STD/LTD insurance
  • FSA/HSA and medical, dental, and vision benefits
  • Fulltime
Read More
Arrow Right

Senior Systems Architect, DAM & Infrastructure

The Senior Systems Architect is a cross-functional role on the Design Operations...
Location
Location
United States , Bay Area
Salary
Salary:
153000.00 - 270000.00 USD / Year
block.xyz Logo
Block
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • AI fluency
  • 8+ years of experience in related roles (library science, mass communications, computer sciences, advertising, graphics design, marketing, or equivalent experience)
  • Track record of successful people management across multi-discipline teams
  • Ability to explain complex ideas to a variety of audiences, building collaborative relationships across teams
  • Excellent communication, problem solving, and analytical skills
  • Expertise with enterprise digital asset management systems, software, related tooling, and scalable workflows
  • Ability to design/extend cataloging taxonomy and define content policies
  • In-depth knowledge of file formats (print, digital, video) and media usage rights terminology
  • Familiarity with licensing agreements, talent contracts, and rights-management in the advertising, film, photography, and music industries
Job Responsibility
Job Responsibility
  • Create and manage consistent asset management processes for Square's global DAM and MAM, with a focus on ecosystem definition that leverages AI for processing, ingesting, metadata tagging, cataloging, versioning, and distribution of assets
  • Develop systems to scale assets across channels and markets, owning user-friendly optimizations for localization and QA. Obtain final asset approvals from necessary stakeholders and communicate to all partners when delivering
  • Review and fulfill new asset requests, consult and recommend alternative solutions, when appropriate, for end-users
  • Manage rights information and governance for all licensed content
  • auditing, decommissioning, and gathering insights into usage and content gaps
  • Partner with IT, Design Operations, and Integrated Production teams
  • defining project request processes, workflow optimization, and coordination of project pipeline
  • Align our assets pipeline process with other business units, ensuring our MAM and other internal creative file systems are linked and automated with Square's DAM
  • Work with our tooling vendors and guide the Asset Management team to plan, build, and rollout system updates, fixes, and new feature implementations
  • Help educate and mature the proper security and compliant usage of licensed assets
What we offer
What we offer
  • Remote work
  • medical insurance
  • flexible time off
  • retirement savings plans
  • modern family planning
Read More
Arrow Right

Senior Product Manager

As a Senior Product Manager for Private Cloud AI, you will lead the strategy, de...
Location
Location
United States , Spring, Texas
Salary
Salary:
117500.00 - 270000.00 USD / Year
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree or equivalent in computer science, engineering or related field of study
  • MBA or advanced degree in computer science or engineering preferred
  • 8+ years of work experience in related field
  • Technical understanding and knowledge of the AI infrastructure industry
Job Responsibility
Job Responsibility
  • Define and execute a product strategy to unlock AI opportunities across the world’s largest organizations
  • Independently leads and drives the end to end strategy and operational product roadmap for one or more complex products
  • Defines the value proposition, target customer segments, and business case to bring one or more innovative and disruptive products to market
  • Synthesizes market requirements into marketing/customer details
  • Advises key stakeholders on the portfolio strategy across all phases of the lifecycle
  • Creates and drives goal alignment and collaborates across value chain partners to optimize margins and enable product success
What we offer
What we offer
  • Comprehensive suite of benefits for physical, financial and emotional wellbeing
  • Programs catered to helping you reach career goals
  • Unconditional inclusion celebrating individual uniqueness
  • Flexibility to manage work and personal needs
  • Fulltime
Read More
Arrow Right