CrawlJobs Logo

AI/ML DevOps Engineer

nttdata.com Logo

NTT DATA

Location Icon

Location:
India , Noida

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

The AI/ML DevOps Engineer will be responsible for designing and maintaining Infrastructure-as-Code templates using Terraform for Azure AI/ML services. This role requires a blend of engineering and coordination skills to ensure safe deployments of AI workloads. Candidates should have experience with Azure services, automation tools, and security practices.

Job Responsibility:

  • Hands-on engineer responsible for designing, building, and maintaining Infrastructure-as‑Code (IaC) templates with Terraform to provision and operate Azure AI/ML services for multiple application teams
  • The role blends engineering (Terraform, pipelines, AKS, security) with coordination (Kanban flow, cross-team alignment, risk/issue tracking) to accelerate safe, repeatable deployments of AI workloads for onboarding application teams to the bank's AI Platform (Azure)

Requirements:

  • Engineer IaC modules and reusable templates (Terraform) to provision Azure resources for AI/ML (e.g., Azure OpenAI/AI Studio, Azure ML, Cognitive Search, Key Vault, Storage, networking)
  • Automate pipelines (Azure DevOps/GitHub Actions) for plan/apply, policy checks, and environment promotion
  • integrate secrets, approvals, and drift detection
  • Stand up access & identity using Microsoft Entra ID patterns (app registrations, groups/roles, RBAC) for app teams and automation
  • Support AKS-based deployments and platform integrations (ingress, images, namespaces, quotas) for AI services that land on Kubernetes
  • Harden & govern: embed guardrails (Azure Policy, role assignments, private endpoints), tagging/FinOps, logging/monitoring baselines
  • Scripting with Python or Bash for IaC tooling and deployment helpers
  • Experience codifying policies/controls (OPA/Conftest, Azure Policy as Code) and cost governance tags

Additional Information:

Job Posted:
January 24, 2026

Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for AI/ML DevOps Engineer

DevOps Quality Engineer

Engineer reliability at scale. We’re looking for a DevOps Quality Engineer who t...
Location
Location
Bulgaria , Sofia
Salary
Salary:
Not provided
ebrd.com Logo
European Bank for Reconstruction and Development
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Holds ISTQB Foundation as a minimum
  • Advanced Test Analyst or equivalent certifications desirable
  • Qualification in IT Service Management (ITIL v3 or v4 Foundation) or demonstrable experience integrating QA practices into ITSM processes
  • Familiar with the NIST Cybersecurity Framework (CSF) and Digital Operational Resilience Act (DORA), with practical awareness of how they influence quality standards and assurance planning
  • Demonstrates solid understanding of automation and non-functional testing concepts, including performance, accessibility, and shift-left/shift-right practices
  • Experience working within Agile, DevOps, and product-aligned teams, contributing to sprint-based delivery and continuous integration testing strategies
  • Proficient in test tooling and CI/CD frameworks including Azure DevOps, Selenium, Cypress, Jenkins, Git, and test management platforms such as TestRail or Zephyr
  • Familiarity with AI/ML use cases in quality engineering, including AI-assisted test case generation, defect clustering, and predictive analytics
  • Strong communication and collaboration skills, with the ability to explain test scenarios, defects, and coverage to technical and non-technical stakeholders
  • Awareness of security, compliance, and resilience considerations such as OWASP Top 10, ISO 27001, GDPR, and DORA, with practical experience embedding these into quality practices
Job Responsibility
Job Responsibility
  • Plans and performs testing of infrastructure changes, configuration updates, and cloud deployments across data centres, branch offices, and Azure environments, ensuring functional and non-functional validations
  • Collaborates with DevOps, Infrastructure, and Cloud Platform teams to verify IaC deployments, perform smoke testing post-deployment, and document any issues in platform reliability
  • Writes and maintains basic automation scripts or checklists for network, AV, and platform components, contributing to regression detection and provisioning assurance
  • Investigates environment or deployment defects, escalates issues with supporting diagnostics, and retests fixes with attention to uptime, latency, and failover coverage
  • Engages in sprint reviews and planning sessions, raising risks tied to infrastructure changes, recommending testing scope for AV, cloud services, or network resilience improvements
What we offer
What we offer
  • Varied, stimulating and engaging work that gives you an opportunity to interact with a wide range of experts in the financial, political, public and private sectors across the regions we invest in
  • A working culture that embraces inclusion and celebrates diversity
  • An environment that places sustainability, equality and digital transformation at the heart of what we do
  • Fulltime
Read More
Arrow Right

Senior DevOps Engineer (GCP)

Our client is a global UK-based financial services and investment banking organi...
Location
Location
Salary
Salary:
Not provided
n-ix.com Logo
N-iX
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in DevOps, Cloud Engineering, or SRE roles
  • Strong hands-on experience with Google Cloud Platform, including: GKE / Kubernetes, Cloud Run, Cloud Functions, Pub/Sub, Cloud Storage, VPC, IAM, networking, security
  • Expertise in Terraform, Helm, or other IaC tools
  • Experience building CI/CD pipelines (GitHub Actions, GitLab CI, CircleCI, Jenkins, etc.)
  • Strong understanding of containerization and orchestration: Docker, Kubernetes
  • Solid experience with monitoring, observability, and logging stacks
  • Familiarity with networking, load balancing, security hardening, and zero-trust principles
  • Experience supporting production systems in high-availability, distributed environments
  • Strong scripting skills (Python, Bash, or similar)
  • Experience working with agile engineering teams
Job Responsibility
Job Responsibility
  • Design, implement, and maintain cloud infrastructure on Google Cloud (GKE, Cloud Run, Cloud Functions, Pub/Sub, Cloud Storage)
  • Build and optimize CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, or similar)
  • Develop infrastructure-as-code using Terraform or similar tools
  • Set up and maintain container orchestration (Kubernetes, GKE) and automated deployment workflows
  • Implement monitoring, alerting, and observability using tools such as Prometheus, Grafana, ELK/Elastic, Stackdriver, or OpenTelemetry
  • Ensure compliance with security and governance standards across all environments
  • Collaborate closely with engineering teams to ensure scalable, high-performance deployment architectures
  • Support AI/ML and GenAI workloads (Vertex AI pipelines, model hosting, GPU workloads, inference optimization)
  • Manage environment strategies, release pipelines, configuration management, and secrets management
  • Optimize cloud costs and recommend improvements for performance and reliability
What we offer
What we offer
  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits
Read More
Arrow Right

DevOps Engineer

We are looking for a skilled and motivated DevOps Engineer to join our team and ...
Location
Location
Poland , Warsaw
Salary
Salary:
Not provided
https://www.inetum.com Logo
Inetum
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 2+ years of hands-on experience in DevOps or Cloud Engineering
  • Experience with Azure (preferred), AWS, or similar cloud platforms
  • Proficiency in Kubernetes, Docker, Helm, and Terraform
  • Familiarity with CI/CD tools (Azure DevOps, Jenkins) and scripting (Python, Bash)
  • Understanding of cloud networking, API gateways, load balancing, and monitoring/logging tools (Prometheus, Grafana, ELK)
  • Experience with data storage solutions and ETL/ELT concepts
  • Basic knowledge of cloud security, GDPR compliance, RBAC, and auditability
  • Strong troubleshooting skills and willingness to support incident management
  • English and Polish language at a minimum B2 level
  • Degree in Computer Science, IT, or related field—or equivalent experience.
Job Responsibility
Job Responsibility
  • Design, implement, and manage cloud infrastructure for GenAI-based platforms (Azure-focused)
  • Maintain and enhance CI/CD pipelines for deploying AI/ML and conversational AI solutions
  • Automate provisioning, monitoring, and scaling of cloud-native microservices (Kubernetes, Docker, Helm, Terraform)
  • Support production operations including monitoring, logging, alerting, and disaster recovery
  • Collaborate with AI/ML engineers, backend developers, and compliance teams to deliver GenAI products
  • Follow best practices for Infrastructure as Code (IaC) and contribute to cloud cost optimization
  • Integrate and manage data storage solutions (Azure PostgreSQL Flexible Server, data lakes, warehousing)
  • Build and maintain secure data pipelines (ETL/ELT)
  • Support API integrations, API gateways, and service mesh components for multi-channel (chat/voice) deployments
  • Ensure compliance with privacy, GDPR, and secure data handling standards
What we offer
What we offer
  • Flexible working hours
  • Hybrid work model
  • A cafeteria system that allows employees to personalize benefits by choosing from a variety of options
  • Generous referral bonuses, offering up to PLN6,000 for referring specialists
  • Additional revenue sharing opportunities for initiating partnerships with new clients
  • Ongoing guidance from a dedicated Team Manager for each employee
  • Tailored technical mentoring from an assigned technical leader, depending on individual expertise and project needs
  • Dedicated team-building budget for online and on-site team events
  • Opportunities to participate in charitable initiatives and local sports programs
  • A supportive and inclusive work culture with an emphasis on diversity and mutual respect.
  • Fulltime
Read More
Arrow Right

Principal AI/ML & Innovation Engineer

We are seeking Principal AI/ML & Innovation Engineer who will be leading initiat...
Location
Location
Puerto Rico , Aguadilla
Salary
Salary:
Not provided
https://www.hpe.com/ Logo
Hewlett Packard Enterprise
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or master’s degree in computer science, engineering, data science, machine learning, artificial intelligence, or closely related quantitative discipline
  • Typically, 10-15 years’ experience
  • Solid understanding of fundamental AI and machine learning concepts, including supervised and unsupervised learning, deep learning, reinforcement learning, natural language processing, computer vision, and statistical modeling
  • Proficient in implementing and deploying various machine learning algorithms, such as decision trees, random forests, support vector machines, and neural networks
  • Knowledge of popular machine learning frameworks and libraries like TensorFlow, PyTorch, or sci-kit
  • Strong understanding of GitHub CoPilot, Cursor, N8N, vibe coding, Windsurf, and similar technologies
  • Experience in Cloud Infrastructure (AWS, Azure, etc)
  • Knowledge of Open Source, Linux, etc
  • Understanding of Devops, SRE
  • Expertise in deep learning techniques, architectures, and frameworks (e.g., convolutional neural networks (CNN), recurrent neural networks (RNN), generative adversarial networks (GAN), etc.)
Job Responsibility
Job Responsibility
  • Designing, developing, and deploying advanced machine learning models and algorithms
  • Leading research initiatives to explore novel approaches and technologies
  • Designing the architecture of AI systems and ensuring scalability, performance, and reliability
  • Collaborating with other teams, such as data scientists, software engineers, and product managers
  • Providing technical leadership and mentorship to junior engineers
  • Overseeing and guiding multiple design review sessions across different projects
  • Partnering with the engineering manager and team lead to establish long-term design and implementation strategies
  • Leading efforts to incorporate feedback loops and continuous improvement processes
  • Leading meetings, ensuring efficient progress tracking, issue resolution, and team coordination
  • Creating and delivering high-level presentations and reports to executive stakeholders
What we offer
What we offer
  • Health & Wellbeing
  • Personal & Professional Development
  • Unconditional Inclusion
  • Fulltime
Read More
Arrow Right

Senior Devops & AI Engineer

This role presents a unique opportunity to contribute to the future of impactful...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
fissionlabs.com Logo
Fission Labs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Engineering, or related field
  • 6+ years of experience in Infrastructure Mgmt. roles, with a focus on cloud platforms (Azure and AWS Preferred)
  • Hands-on experience with operations (DevSecOps) principles and best practices
  • Proficiency in scripting languages such as Python, PowerShell, or Bash
  • Excellent communication and collaboration skills
  • In-depth knowledge of Linux operating systems, including CentOS, Ubuntu, and Red Hat, with expertise in shell scripting, package management, and system administration
  • Hands-on experience with a wide range of AWS and Azure services
  • Develop and maintain Infrastructure as Code (IAC) templates using tools such as Terraform or AWS CloudFormation
  • Experience setting up cloud infrastructure stack, databases, service endpoints, GPU as well as CPU resource scaling, optimization etc.
  • Should have worked AIOps/MLOP
Job Responsibility
Job Responsibility
  • Configure and optimize Linux-based servers for performance, security, and resource utilization, including kernel tuning, file system management, and network configuration
  • Architect cloud solutions leveraging best practices and services offered by AWS and Azure, optimizing for scalability, reliability, and cost-effectiveness
  • Implement and manage hybrid cloud environments, facilitating seamless integration and interoperability between AWS and Azure services
  • Establish version control practices for IAC templates, ensuring traceability, auditability, and reproducibility of infrastructure changes
What we offer
What we offer
  • Opportunity to work on impactful technical challenges with global reach
  • Vast opportunities for self-development, including online university access and knowledge sharing opportunities
  • Sponsored Tech Talks & Hackathons to foster innovation and learning
  • Generous benefits packages including health insurance, retirement benefits, flexible work hours, and more
  • Supportive work environment with forums to explore passions beyond work
  • Fulltime
Read More
Arrow Right

Platform Engineer

Motorica is at a breakthrough moment. We’ve built a generative AI animation plat...
Location
Location
Sweden , Stockholm
Salary
Salary:
Not provided
motorica.ai Logo
Motorica
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proven experience in Platform Engineering, SRE, or DevOps, ideally in high-growth or AI/ML-heavy environments
  • Strong grasp of CI/CD systems, cloud infrastructure (AWS/GCP), and containerization (Docker/Kubernetes)
  • Familiarity with observability, monitoring, and incident response best practices
  • Security mindset with hands-on experience in audits, compliance (ISO 27001, SOC2, etc.), and vulnerability management
  • Strong communication skills
  • you’ll be interfacing with developers daily and need to translate infrastructure into clarity, not complexity
  • A proactive, solution-oriented mindset: you anticipate friction before others feel it
Job Responsibility
Job Responsibility
  • Provide common infrastructure guidance, reusable patterns, and automated tooling to engineering teams
  • Own the “paved road” for developers, reducing friction and cognitive load
  • Champion and implement security best practices across the entire platform
  • Play a key role in achieving ISO 27001 certification through technical implementation and evidence gathering
  • Build and operate a highly reliable and cost-efficient platform, with particular focus on optimizing GPU-heavy AI/ML workloads
  • Manage CI/CD systems (GitHub Actions, GitLab CI) and track key metrics like build times, deployment frequency, and failure rates
  • Oversee cloud environments (AWS, GCP), including health, security, and cost reporting
  • Lead security scans, audits, and vulnerability remediation
  • Maintain observability stack (Prometheus, Grafana, Datadog, GCP Logging), ensuring meaningful dashboards and alerts
  • Act as point-of-contact for ML Research team’s infra requests (GPU access, specialized pipelines)
What we offer
What we offer
  • Stock Options program
  • Retirement Plan
  • Health Benefits (5000 SEK/year)
  • Life Insurance / Health Insurance / Injury Insurance
  • Competitive compensation
  • Fulltime
Read More
Arrow Right

Head of Engineering

Lead the Technical Transformation Towards an AI-Driven SaaS Future We are undert...
Location
Location
Sweden , Stockholm; Malmö
Salary
Salary:
Not provided
danads.com Logo
DanAds
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Extensive experience in a senior engineering leadership role, such as Head of Engineering, CTO, or Principal Architect, within a SaaS or technology-driven environment
  • Proven ability to design and operate large-scale distributed systems with a focus on scalability, performance, and reliability
  • Advanced proficiency in cloud-native engineering, containerization, and DevOps practices
  • Experience integrating AI/ML technologies into production environments and maintaining robust MLOps workflows
  • Strong knowledge of data engineering and architectural design for AI-readiness
  • Exceptional technical judgment and the ability to balance innovation with stability and quality
  • Demonstrated ability to translate business and product requirements into effective, maintainable technical solutions
Job Responsibility
Job Responsibility
  • Provide strategic and operational leadership to the Engineering organization, ensuring the successful delivery of an AI-first, cloud-native SaaS platform
  • Design and evolve a highly scalable multi-tenant SaaS architecture capable of supporting millions of transactions efficiently
  • Oversee the development and maintenance of a cloud-native infrastructure, leveraging containerization (Docker, Kubernetes) and CI/CD automation
  • Collaborate with data science and ML teams to integrate AI and machine learning models into production systems
  • Develop and manage MLOps pipelines and data infrastructure, including data pipelines, data lakes, and data warehouses
What we offer
What we offer
  • Competitive salary
  • Wellness grant and occupational pension
  • A fun and entrepreneurial environment
  • Career growth opportunities
  • Industry leading clients such as Paramount Advertising, Yahoo, Nine Group, Roku and many more
Read More
Arrow Right

Senior Vice President of Engineering

Actian is a global leader in hybrid data management, cloud data warehousing, and...
Location
Location
France , Paris
Salary
Salary:
Not provided
actian.com Logo
Actian
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 20+ years of experience in software engineering
  • At least 8 years in senior executive leadership roles
  • Proven track record of scaling large, distributed engineering organizations in fast-growing software or SaaS environments
  • Deep expertise in cloud-native architectures, AI/ML integration, connectivity, and modern DevOps practices
  • Strong experience managing multi-disciplinary teams (engineering, quality, operations) across international locations
  • Exceptional leadership, organizational design, and stakeholder management skills
  • Fluency in English (both written and spoken) is required
  • Based in Europe, with the ability to collaborate effectively across global time zones
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related field (MBA or equivalent experience a plus)
Job Responsibility
Job Responsibility
  • Define and execute the engineering vision and strategy in alignment with company objectives and product roadmaps
  • Partner with the CPTO and Product leaders to drive the successful delivery of innovative, high-quality, and secure solutions
  • Oversee the technical strategy for AI, connectivity, and cloud operations, ensuring scalability and performance
  • Lead transformation initiatives that evolve engineering processes, tools, and culture to meet global scale demands
  • Build and scale a world-class engineering organization across multiple geographies
  • Develop strong leadership within the engineering function, mentoring senior managers and fostering a high-performance culture
  • Establish organizational structures, processes, and metrics that enable predictable, efficient, and scalable software delivery
  • Drive talent acquisition and retention strategies to support growth and innovation
  • Ensure delivery excellence across product development, AI initiatives, quality engineering, and cloud operations
  • Champion modern engineering practices, including copilots, CI/CD, DevOps, observability, and automation
What we offer
What we offer
  • Competitive salary and benefits package
  • Flexible work arrangements (remote or hybrid)
  • Opportunities for professional growth and development
  • Fulltime
Read More
Arrow Right