CrawlJobs Logo

Principal ML Ops Engineer

citizensbank.com Logo

Citizens Bank

Location Icon

Location:
United States , Charlotte

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

175000.00 - 230000.00 USD / Year

Job Description:

Principal ML Ops Engineer who will lead the design and operationalization of ML systems on AI/ML platforms such as AWS SageMaker and H2O.ai. This role focuses on building scalable ML systems rather than individual models and includes leadership responsibilities for mentoring and grooming talent in global capability centers (GCC) with potential onshore leadership opportunities. The position also requires hands-on experience with GenAI, including building intelligent agents and exposure to Agentic AI concepts.

Job Responsibility:

  • Lead and mentor engineering teams, including GCC talent development and potential onshore leadership
  • Architect, design, and build ML engineering systems on the CFG ML Platform to accelerate ML pipeline delivery
  • Develop and enhance platform capabilities and frameworks to standardize and automate ML pipeline deployment
  • Implement capabilities such as feature stores, feature tracking, feature serving (real-time and batch), model performance monitoring, model lineage tracking, model health, and model serving and consumption (real-time, batch, event-triggered, near real-time using Kafka)
  • Define processes, research market trends, and implement best practices for ML pipeline development and deployment
  • Collaborate with business teams, data science teams, enterprise architects, and security to uphold ML engineering standards
  • Develop CI/CD pipelines for continuous integration and delivery of ML models
  • Identify and automate ML pipeline and model deployment patterns to streamline workflows
  • Troubleshoot and resolve issues related to ML system performance and deployment
  • Contribute to GenAI initiatives, including building intelligent agents and integrating them into ML Ops workflows
  • Demonstrate exposure to Agentic AI concepts and proof-of-concepts (POCs)

Requirements:

  • 7+ years of experience with Python for scripting ML workflows
  • 5+ years of experience deploying ML pipelines and systems using AWS SageMaker
  • 3+ years of experience developing APIs with Flask, Django, or FastAPI
  • 2+ years of experience with ML frameworks and tools such as scikit-learn, PyTorch, XGBoost, LightGBM, MLflow
  • Solid understanding of the ML lifecycle: model development, training, validation, deployment, and monitoring
  • Solid understanding of CI/CD pipelines for ML workflows using Bitbucket, Jenkins, Nexus
  • Experience with ETL processes for ML pipelines using Spark and Kafka
  • Bachelor’s Degree or equivalent combination of education, training, and experience required

Nice to have:

  • Preferred experience with H2O.ai
  • Preferred experience with containerization using Docker and orchestration using Kubernetes
  • Required exposure to GenAI and Agentic AI concepts, including building or contributing to POCs
What we offer:
  • comprehensive medical, dental and vision coverage
  • retirement benefits
  • maternity/paternity leave
  • flexible work arrangements
  • education reimbursement
  • wellness programs
  • competitive pay
  • opportunity to earn an annual discretionary bonus

Additional Information:

Job Posted:
January 25, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Principal ML Ops Engineer

Principal Engineer

The Principal AI/ML Operations Engineer leads the architecture, automation, and ...
Location
Location
United States , Pleasanton, California
Salary
Salary:
251000.00 - 314500.00 USD / Year
blackline.com Logo
BlackLine
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Machine Learning, Data Science, or a related field
  • 10+ years in ML infrastructure, DevOps, and software system architecture
  • 4+ years in leading MLOps or AI Ops platforms
  • Strong programming skills in languages such as Python, Java, or Scala
  • Expertise in ML frameworks (TensorFlow, PyTorch, scikit-learn) and orchestration tools (Airflow, Kubeflow, Vertex AI, MLflow)
  • Proven experience operating production pipelines for ML and LLM-based systems across cloud ecosystems (GCP, AWS, Azure)
  • Deep familiarity with LangChain, LangGraph, ADK or similar agentic system runtime management
  • Strong competencies in CI/CD, IaC, and DevSecOps pipelines integrating testing, compliance, and deployment automation
  • Hands-on with observability stacks (Prometheus, Grafana, Newrelic) for model and agent performance tracking
  • Understanding of governance frameworks for Responsible AI, auditability, and cost metering across training and inference workloads
Job Responsibility
Job Responsibility
  • Define enterprise-level standards and reference architectures for ML-Ops and AIOps systems
  • Partner with data science, security, and product teams to set evaluation and governance standards (Guardrails, Bias, Drift, Latency SLAs)
  • Mentor senior engineers and drive design reviews for ML pipelines, model registries, and agentic runtime environments
  • Lead incident response and reliability strategies for ML/AI systems
  • Lead the deployment of AI models and systems in various environments
  • Collaborate with development teams to integrate AI solutions into existing workflows and applications
  • Ensure seamless integration with different platforms and technologies
  • Define and manage MCP Registry for agentic component onboarding, lifecycle versioning, and dependency governance
  • Build CI/CD pipelines automating LLM agent deployment, policy validation, and prompt evaluation of workflows
  • Develop and operationalize experimentation frameworks for agent evaluations, scenario regression, and performance analytics
What we offer
What we offer
  • short-term and long-term incentive programs
  • robust offering of benefit and wellness plans
  • Fulltime
Read More
Arrow Right

Senior Principal Technical Program Manager - ML Platform

Location
Location
Salary
Salary:
231300.00 - 301975.00 USD / Year
https://www.atlassian.com Logo
Atlassian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of experience on software teams as Development Manager, Technical Product Manager or TPM leading technical platforms areas
  • Deep domain experience in AI and/or Search. Example: Model Inference, Model Evaluation, Model Training, LLM Ops, Semantic Search, Search Relevance, etc.
  • Partner with Engineering in defining direction, strategy and execution at Platform level
  • Strategic thinking and ability to understand business objectives to translate them into technical problems and programs.
  • Technical understanding of systems involved. Willingness to develop domain expertise in the area they operate - storage, networking, authentication, capacity management, service deployments, etc.
  • TPMs are not expected to write or read code, but are expected to understand system flows, block architectures, APIs and such.
  • Experience defining and running end-to-end complex technical programs
  • Strong leadership, organizational, and communication skills
Job Responsibility
Job Responsibility
  • Understand and stay up-to-date on latest innovations in AI and Search. Partner closely with engineering teams to translate these into practical platform evolution for Atlassian bringing value to our customers.
  • Analyze business objectives, customer needs, product adoption inhibitors and opportunities, industry trends, and based on these, in close collaboration with your stakeholders, define a long-term strategy and roadmap for your platform and product components.
  • Understand business objectives and translate them into technical systems problems that need to be prioritized solved in the current business environment.
  • Define specific systems programs and create a plan of action for realizing those programs. Such programs could be around capacity planning, migration efforts, high availability, network architecture, performance optimization, reliability improvements and more.
  • Use your technical understanding of Atlassian and related systems to partner with and influence engineers and architects in making progress on these problems.
  • Responsible for taking a systematic approach to engineering problems. This includes: prioritizing tasks, scoping out the project, defining objectives, and making consistent progress against each of these.
  • Be accountable for the success of these technical programs by managing the entire lifecycle from initiation to forecasting, budgeting, scheduling, etc.
  • Manage complex dependencies and projects with a broad scope across the company
What we offer
What we offer
  • health and wellbeing resources
  • paid volunteer days
Read More
Arrow Right

Principal Engineer, Model Dev Platform

As the Principal Engineer for the Model Development Platform at Wayve, you will ...
Location
Location
United States , Sunnyvale
Salary
Salary:
Not provided
wayve.ai Logo
Wayve
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Technical Leadership at Scale – 10+ years of experience designing and building large-scale distributed systems, ML/AI infrastructure, full stack web application, or developer platforms, including at least 3 years as a staff or principal-level engineer
  • Architectural Depth & Breadth – Proven ability to design systems spanning web platforms, ML pipelines, and large-scale compute orchestration (e.g., Spark, Ray, Kubernetes, Airflow, MLflow)
  • Reliability & Performance Mindset – Experience driving platform reliability improvements, defining SLAs/SLOs, and building self-healing and observable systems that operate at “four nines” availability or better
  • Hands-On Systems Design – Deep understanding of distributed computing, workflow orchestration, data modeling, and API design, with the ability to write and review production-quality code
  • Collaborative Influence – Excellent communication and cross-functional collaboration skills
  • ability to guide engineers, managers, and researchers toward unified technical direction
  • Mentorship & Culture – Demonstrated success in mentoring engineers across levels and cultivating a culture of engineering excellence
  • Education – Bachelor’s degree in Computer Science, Software Engineering, or related field (advanced degree preferred, or equivalent experience)
Job Responsibility
Job Responsibility
  • Design and evolve the overarching architecture of the model development platform, ensuring system-wide reliability, observability, and scalability
  • Work across disciplines—from front-end web UIs to large-scale distributed training, from Spark-based data pipelines to experiment scheduling algorithms using linear optimization—to unify the platform’s architecture and ensure smooth interoperability between systems
  • Dive deep into the thorniest technical challenges faced by individual subteams, bringing your expertise in distributed systems, large-scale compute, and system design to bear
  • Develop and refine systems that optimize how models are tested—whether in simulation or on-road—balancing constraints like hardware availability, safety requirements, and research priorities
  • Architect data processing pipelines capable of ingesting, transforming, and enriching petabytes of sensor data from the global fleet
  • Serve as a mentor and coach for engineers across the organization—developing technical talent, improving design practices, and fostering a culture of learning and technical excellence
  • Partner with Product Management, Research, and Operations to align technical architecture with user needs and product vision
Read More
Arrow Right

Principal Engineer, Computer Vision & AI /3D Data

Cesium is the leading open platform for streaming and visualizing huge 3D geospa...
Location
Location
Salary
Salary:
Not provided
bentley.com Logo
Bentley Systems
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • PhD or equivalent in Computer Vision, AI, or Machine Learning
  • 5+ years of experience in AI/3D vision development including industry experience of deploying AI and working with 3D data, including ML ops and practical, user-focused product development
  • Expertise in deep learning, computer vision, 3D geometry, and multimodal AI, including experience with large language models
  • Proven experience at least 5+ years leading and mentoring technical teams
  • Strong programming skills in Python and/or C++ with ML frameworks (PyTorch, TensorFlow), GPU programming (CUDA)
  • Excellent communication and leadership skills
  • Fluent in English
Job Responsibility
Job Responsibility
  • Lead and mentor a team of 5 engineers, providing technical direction and project coordination in computer vision and AI/3D data modeling projects, providing coaching and guidance
  • Design and deploy advanced AI/ML and 3D vision algorithms generated by our modelling team for large-scale datasets for practical, user focused product development including point clouds, meshes, sensor data and Gaussian splatting
  • Define the AI strategy and contribute to product roadmap decisions
  • Implement ML Ops practices for scalable, automated training and inference pipelines
  • Conduct research on emerging AI techniques for 3D understanding and integrate findings into production
  • Ensure quality through rigorous evaluation, optimization, and code reviews
What we offer
What we offer
  • A great Team and culture
  • An exciting career as an integral part of a world-leading software company providing solutions for architecture, engineering, and construction
  • An attractive salary and benefits package
  • A commitment to inclusion, belonging and colleague wellbeing through global initiatives and resource groups
  • A company committed to making a real difference by advancing the world’s infrastructure for better quality of life, where your contributions help build a more sustainable, connected, and resilient world
Read More
Arrow Right

Principal Engineering Manager, Core Platform & AI Systems

We are looking for a Principal Engineering Manager, Core Platform & AI Systems t...
Location
Location
United States , Seattle
Salary
Salary:
208000.00 - 313000.00 USD / Year
highspot.com Logo
Highspot
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of software engineering experience with 4+ years in engineering leadership roles
  • Experience managing senior/principal engineers, ideally across multiple functional areas
  • Strong technical background in cloud-native distributed systems, platform engineering, or AI/ML infrastructure
  • Proven track record of scaling SaaS platforms and leading teams responsible for mission-critical backend systems
  • Experience working closely with cross-functional teams such as Product, Infrastructure, AI/ML, and Security
  • Deep understanding of reliability, operational excellence, and cost optimization in cloud environments (AWS, Azure, GCP)
  • Excellent communication, collaboration, and executive stakeholder management skills
  • Passion for developing people and building strong, healthy engineering teams
Job Responsibility
Job Responsibility
  • Lead and grow the Core Platform & AI Systems team
  • Drive the technical roadmap for the platform, ensuring scalability, performance, availability, and cost-efficiency
  • Partner closely with product engineering teams to deliver platform capabilities that unlock business features while simplifying the developer experience
  • Collaborate with Data Science, ML, and AI teams to provide robust ML Ops and AI infrastructure that enables rapid experimentation and production-grade AI deployments
  • Own platform-wide reliability and operational health, continuously investing in observability, incident management, and system resilience
  • Contribute to architectural decisions that shape the long-term direction of Highspot’s SaaS platform
  • Attract, retain, and develop top engineering talent, building a high-performing and inclusive team culture
  • Communicate effectively with senior leadership, providing visibility into roadmap progress, technical trade-offs, and organizational needs
What we offer
What we offer
  • Comprehensive medical, dental, vision, disability, and life benefits
  • Health Savings Account (HSA) with employer contribution
  • 401(k) Matching with immediate vesting on employer match
  • Flexible PTO
  • 8 paid holidays and 5 paid days for Annual Holiday Week
  • Quarterly Recharge Fridays (paid days off for mental health recharge)
  • 18 weeks paid parental leave
  • Access to Coaches and Therapists through Modern Health
  • 2 volunteer days per year
  • Commuting benefits
  • Fulltime
Read More
Arrow Right

Principal Product Manager, AI

In this role, you will be joining the Product Management team reporting to the D...
Location
Location
United States , Seattle
Salary
Salary:
167000.00 - 278000.00 USD / Year
highspot.com Logo
Highspot
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years product management experience building B2B SaaS products
  • 4+ years working directly with ML/AI products or platforms
  • Demonstrated technical fluency with modern generative AI / ML concepts (LLMs, RAG, embeddings, vector DBs, fine-tuning, prompt engineering, evaluation metrics, model ops)
  • Proven record of shipping complex, cross-functional AI products or platforms to production and driving measurable business impact
  • Strong quantitative and analytical skills
  • Exceptional communication skills
  • Experience mentoring PMs and participating in hiring
  • a track record of raising the bar for product craft
Job Responsibility
Job Responsibility
  • Define vision & strategy
  • Lead execution end-to-end
  • Own model + product success
  • Drive platform & scale thinking
  • Champion safety, fairness & compliance
  • Be the cross-functional glue
  • Mentor & raise the bar
What we offer
What we offer
  • Comprehensive medical, dental, vision, disability, and life benefits
  • Health Savings Account (HSA) with employer contribution
  • 401(k) Matching with immediate vesting on employer match
  • Flexible PTO
  • 8 paid holidays and 5 paid days for Annual Holiday Week
  • Quarterly Recharge Fridays (paid days off for mental health recharge)
  • 18 weeks paid parental leave
  • Access to Coaches and Therapists through Modern Health
  • 2 volunteer days per year
  • Commuting benefits
  • Fulltime
Read More
Arrow Right

Principal Product Manager, AI

In this role, you will be joining the Product Management team reporting to the D...
Location
Location
Canada , Vancouver
Salary
Salary:
146000.00 - 220000.00 CAD / Year
highspot.com Logo
Highspot
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years product management experience building B2B SaaS products
  • 4+ years working directly with ML/AI products or platforms
  • Demonstrated technical fluency with modern generative AI / ML concepts (LLMs, RAG, embeddings, vector DBs, fine-tuning, prompt engineering, evaluation metrics, model ops)
  • Proven record of shipping complex, cross-functional AI products or platforms to production and driving measurable business impact
  • Strong quantitative and analytical skills
  • Exceptional communication skills
  • Experience mentoring PMs and participating in hiring
  • a track record of raising the bar for product craft
Job Responsibility
Job Responsibility
  • Define vision & strategy
  • Lead execution end-to-end
  • Own model + product success
  • Drive platform & scale thinking
  • Champion safety, fairness & compliance
  • Be the cross-functional glue
  • Mentor & raise the bar
What we offer
What we offer
  • Comprehensive medical, dental, vision, disability, and life benefits
  • Group Retirement Savings Plan (RRSP) and matching employer contributions (DPSP) with immediate vesting
  • Flexible PTO
  • Generous Holiday Schedule + 5 Days for Annual Holiday Week
  • Quarterly Recharge Fridays (paid days off for mental health recharge)
  • Flexible work schedules
  • Access to Coaches and Therapists through Modern Health
  • 2 Volunteer days per year
  • Monthly transportation allowance for employees that work in our Vancouver Hub location
  • Stock options
  • Fulltime
Read More
Arrow Right
New

Gaming Platform Finance Lead

At Team Gaming, we are on a mission to bring joy, excitement, and community of g...
Location
Location
United States , Redmond
Salary
Salary:
116900.00 - 203600.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Master's Degree in Business Administration, Accounting, Finance, Economics, Data Science or related field AND 4+ years experience in financial analysis, accounting, controllership or finance, or related field OR Bachelor's Degree in Business Administration, Accounting, Finance, Economics, Data Science or related field AND 6+ years experience in financial analysis, accounting, controllership or finance, or related field OR equivalent experience
  • Master's Degree in Business Administration, Accounting, Finance, Economics, Data Science or related field AND 8+ years of post-graduate financial analysis, accounting, controllership, or finance work experience OR Bachelor's degree in Business Administration, Accounting, Finance, Economics, Data Science or related field AND 12+ years of post-graduate financial analysis, accounting, controllership, or finance work experience OR equivalent experience
  • 3+ years experience in multinationals with multi-product/multi-segment finance roles
  • 3+ years work experience in matrix-based organization
  • 3+ years work experience in the technology or software industry
  • Ability to distill complex information into clear, compelling narratives
Job Responsibility
Job Responsibility
  • Help lead and drive Rhythm of Business processes for the Gaming Platform team, such as month end close and forecast/budgeting cycles. Manage and execute processes through engagement with internal and external stakeholders
  • Solid partnership and collaboration. Partner with stakeholders across Central Finance Team, Investor Relations, marketing & engineering finance, and other teams to craft our Gaming story – internal and external – through deep business understanding, solid collaboration across organizations and team leadership
  • Finance thought leadership. Conceptualize and design new approaches and metrics to how relevant financial data is collected and evaluated to impact future outcomes and decisions
  • Financial Analysis. Understand the interconnections between data, information, and results and translate this into forward-looking narratives and forecasts. Complete and review analysis, modeling, and research to support preliminary and adjusted forecast cycles. Generate relevant and actionable insights to drive the business forward
  • Clear communication. Integrate varied data sources and assimilate diverse information into a comprehensive and clear narrative. Communicate results in a compelling and understandable message. Able to communicate complex financial models in a simple way
  • Embody our culture and values
  • Fulltime
Read More
Arrow Right