CrawlJobs Logo

AI DevOps Engineer

amaris.com Logo

Amaris Consulting

Location Icon

Location:
Vietnam , Ho Chi Minh City

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

ABOUT THE JOB: Design, implement, and maintain infrastructure to support AI model training or deployment using modern DevOps tools and technologies. Manage CI/CD processes for AI and software projects. Integrate or wrap around AI models into dockerized APIs. Set up and install on-prem LLMs on GPU and associated data pipelines for grounding them on pre-defined document sets. Automate deployment and monitoring of AI solutions. Ensure security and compliance in all DevOps practices. Collaborate with cross-functional teams to deliver robust ML/AI solutions. Troubleshoot and optimize infrastructure for performance and reliability.

Job Responsibility:

  • Design, implement, and maintain infrastructure to support AI model training or deployment using modern DevOps tools and technologies
  • Manage CI/CD processes for AI and software projects
  • Integrate or wrap around AI models into dockerized APIs
  • Set up and install on-prem LLMs on GPU and associated data pipelines for grounding them on pre-defined document sets
  • Automate deployment and monitoring of AI solutions
  • Ensure security and compliance in all DevOps practices
  • Collaborate with cross-functional teams to deliver robust ML/AI solutions
  • Troubleshoot and optimize infrastructure for performance and reliability

Requirements:

  • Bachelor’s degree in computer science, information systems, or a related field
  • 5+ years in DevOps Engineering
  • Solid knowledge of docker, bash, GIT, Kubernetes, OpenShift
  • Experience with AI generative tools
  • Experience with templated syntaxes (Ansible, Azure pipelines, Helm charts)
  • Experience in CICD pipelines and automation (Ansible, docker registries, Helm charts, API Manager)
  • Basic understanding of web development (backend/frontend segregation, HTTP communication, etc.)
  • Fluency in English, both spoken and written, is required
  • You demonstrate analytical and problem-solving mindset, strong teamwork, collaboration skills, security by design and by default mindset
  • You are proactive in troubleshooting contexts

Nice to have:

Experience with GPU setup in containerized environments is a plus

What we offer:
  • Competitive salary and 13th-month salary
  • 14+ annual leaves per year
  • Premium healthcare insurance, starting from your probation period
  • Project reviews and yearly performance appraisals
  • Annual company trips
  • Teambuilding activities: Team lunch/dinner, events, and celebrations, sports clubs (football, basketball, badminton, pickleball)
  • International team with flexible working time
  • Tailor-made career path
  • Technical workshops and training courses
  • Mobility: Opportunities to be on-site abroad in our offices in over 60+ countries

Additional Information:

Job Posted:
January 15, 2026

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for AI DevOps Engineer

Senior Devops & AI Engineer

This role presents a unique opportunity to contribute to the future of impactful...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
fissionlabs.com Logo
Fission Labs
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree in Computer Science, Engineering, or related field
  • 6+ years of experience in Infrastructure Mgmt. roles, with a focus on cloud platforms (Azure and AWS Preferred)
  • Hands-on experience with operations (DevSecOps) principles and best practices
  • Proficiency in scripting languages such as Python, PowerShell, or Bash
  • Excellent communication and collaboration skills
  • In-depth knowledge of Linux operating systems, including CentOS, Ubuntu, and Red Hat, with expertise in shell scripting, package management, and system administration
  • Hands-on experience with a wide range of AWS and Azure services
  • Develop and maintain Infrastructure as Code (IAC) templates using tools such as Terraform or AWS CloudFormation
  • Experience setting up cloud infrastructure stack, databases, service endpoints, GPU as well as CPU resource scaling, optimization etc.
  • Should have worked AIOps/MLOP
Job Responsibility
Job Responsibility
  • Configure and optimize Linux-based servers for performance, security, and resource utilization, including kernel tuning, file system management, and network configuration
  • Architect cloud solutions leveraging best practices and services offered by AWS and Azure, optimizing for scalability, reliability, and cost-effectiveness
  • Implement and manage hybrid cloud environments, facilitating seamless integration and interoperability between AWS and Azure services
  • Establish version control practices for IAC templates, ensuring traceability, auditability, and reproducibility of infrastructure changes
What we offer
What we offer
  • Opportunity to work on impactful technical challenges with global reach
  • Vast opportunities for self-development, including online university access and knowledge sharing opportunities
  • Sponsored Tech Talks & Hackathons to foster innovation and learning
  • Generous benefits packages including health insurance, retirement benefits, flexible work hours, and more
  • Supportive work environment with forums to explore passions beyond work
  • Fulltime
Read More
Arrow Right

Senior Platform Engineer - CI/CD & AI Automation (AI-first)

Groupon is undergoing a critical platform transformation, modernizing its core d...
Location
Location
Czechia , Prague
Salary
Salary:
Not provided
groupon.com Logo
Groupon
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of dedicated experience in Platform Engineering, DevOps, or Infrastructure roles
  • Deep expertise building, scaling, and migrating CI/CD systems, with strong practical experience in Jenkins and/or GitHub Actions
  • Expertise in scripting and automation (Python, Go, or Bash)
  • Solid understanding of container technologies, Kubernetes, and cloud build systems
  • Proven experience leveraging AI tooling (e.g., Claude Code, code analysis) to meaningfully increase developer output and optimize platform work
  • Excellent communication and ability to drive technical decisions across multiple platform and product teams
Job Responsibility
Job Responsibility
  • Platform Transformation: Lead the design, planning, and execution of the Jenkins-to-GitHub Actions migration across a large portfolio of microservices
  • Pipeline Engineering: Design and optimize high-performance, secure, and observable CI/CD workflows across GitHub Actions, Jenkins, and Kubernetes environments
  • AI-First Automation: Drive an AI-First workflow by leveraging tools (e.g., Copilot, code generation) to eliminate infrastructure toil, accelerate development, and analyze pipeline failures
  • Core Automation: Develop robust platform automation (e.g., Python, Go, Bash) to improve build efficiency, artifact caching, reliability, and repository hygiene
  • Security & Compliance: Harden CI/CD infrastructure with robust controls for secrets management, RBAC, audit logging, and secure runner design
  • Observability: Implement and enhance CI/CD observability using tools like Prometheus, Grafana, and OpenTelemetry to provide deep insights into performance and reliability
  • Technical Leadership: Mentor engineers and partner across Cloud, Security, and Developer Experience teams to define and evolve our end-to-end delivery platform architecture
Read More
Arrow Right

Principal AI Engineer

At JFrog, we’re reinventing DevOps to help the world’s greatest companies innova...
Location
Location
Israel , Netanya/Tel Aviv
Salary
Salary:
Not provided
jfrog.com Logo
JFrog
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • A bachelor's degree or higher in Computer Science, Data Science, or a related field
  • Proven experience in software development
  • Proficiency in LLM-related tools, processes, and frameworks, including OpenAI Models and APIs, Hugging Face Transformers, LangChain, vector databases, and prompt management tools like PromptPerfect/PromptBase and Guardrails
  • Experience with cloud platforms, such as AWS, Google Cloud, or Azure
  • Proficiency in Python programming
  • Experience deploying LLM-based applications in a production environment
  • Excellent problem-solving and analytical skills
  • Experience with CI / CD tools
  • Strong communication skills and the ability to collaborate effectively in a team
Job Responsibility
Job Responsibility
  • Recommend and test agentic productivity tools
  • Collaborate with key organizational stakeholders to understand AI requirements and design end-to-end AI productivity solutions
  • Explore and experiment with novel ML and AI techniques and architectures to drive DevX and productivity innovation
  • Evaluate and recommend ML and AI tools and frameworks to enhance productivity and effectiveness
  • Provide technical guidance and mentorship to development teams on AI and ML technologies and practices
  • Define meaningful KPIs and closely monitor cost
Read More
Arrow Right

Senior Software Engineer - Build AI Tools

This role sits within the newly formed GenAI Security team, which is responsible...
Location
Location
United Kingdom , Belfast
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Highly motivated self-starter with excellent interpersonal and problem-solving skills
  • Bachelor’s degree or equivalent work experience
  • Good oral and written communication skills
  • Significant relevant industry work experience
  • Experience of the full lifecycle of design, implementation and running of enterprise software solutions involving cross functional team collaboration
  • Expertise in a major programming language such as Python and/or Go, and associated tooling (Git, Maven, IDEs, Jenkins, Bitbucket etc)
  • Expertise in designing and implementing secure APIs and libraries
  • Experience in Generative AI, LLM frameworks, LLM prompt engineering and/or adversarial testing is a bonus
  • Experience with Cyber engineering and Operations, which could include DevSecOps or MLSecOps
  • Experience contributing to the architecture and design (architecture, design patterns, reliability, scaling) of new and current systems
Job Responsibility
Job Responsibility
  • Designing, developing, optimizing, and enhancing a GenAI prompt security platform to protect firm AI/LLM-based applications from adversarial attacks and prompt injections
  • Building and automating a security testing framework to validate protection mechanisms for various LLM use cases
  • Owning solutions that are expected to operate and perform at scale across the organisation
  • Collaboration with multiple stakeholders and partners across Engineering and Operations as well as partner teams within the wider Citi organisation, across different time zones
What we offer
What we offer
  • 27 days annual leave (plus bank holidays)
  • A discretional annual performance related bonus
  • Private Medical Care & Life Insurance
  • Employee Assistance Program
  • Pension Plan
  • Paid Parental Leave
  • Special discounts for employees, family, and friends
  • Access to an array of learning and development resources
  • Fulltime
Read More
Arrow Right

Senior Software Engineer – AI

NStarX is seeking a highly skilled Senior Software Engineer – AI with a strong f...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
nstarxinc.com Logo
NStarX
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s or Master’s degree in Computer Science, Machine Learning, Data Science, or a related field (PhD is a plus)
  • 9+ years of experience in AI/ML engineering or related roles
  • 3+ years of experience in Generative AI with team leadership responsibilities
  • Proven track record of production-grade ML and GenAI model development and deployment
  • Programming: Python (preferred)
  • GenAI Frameworks: Hugging Face Transformers, Diffusers, LangChain, TGI
  • Serving & Inference: FastAPI, gRPC, NVIDIA Triton, TorchServe
  • Cloud Platforms: AWS (SageMaker, EKS), GCP (Vertex AI, GKE), Azure (Azure ML, AKS)
  • MLOps & DevOps: Kubeflow, MLflow, GitHub Actions, Jenkins, Helm, Terraform
  • Optimization Techniques: Model quantization, distillation, pipeline and tensor parallelism
Job Responsibility
Job Responsibility
  • Design, develop, and deploy machine learning models and AI algorithms to address complex business challenges
  • Lead and mentor a team of AI/ML engineers, ensuring quality and scalability in solution design and implementation
  • Collaborate closely with cross-functional teams including data scientists, software engineers, product managers, and UX designers
  • Lead the development and deployment of Generative AI applications across text, code, image, and audio modalities using state-of-the-art LLMs
  • Design and implement CI/CD pipelines for the GenAI model lifecycle including training, validation, packaging, and deployment
  • Apply best practices for model performance tuning, cost optimization, and scalable deployment in cloud and hybrid environments
  • Develop prompt engineering, fine-tuning strategies (LoRA, QLoRA, PEFT), and evaluation protocols tailored to business use cases
  • Stay current with emerging trends in AI, ML, and Generative AI and drive adoption across teams
  • Document processes, model architectures, and deployment strategies for traceability and knowledge sharing
  • Work closely with cross-functional teams to gather requirements and deliver high-quality solutions
What we offer
What we offer
  • Competitive salary aligned with market standards
  • Opportunities for professional development and skill enhancement
  • A collaborative and innovative work environment
  • Fulltime
Read More
Arrow Right

DevOps Quality Engineer

Engineer reliability at scale. We’re looking for a DevOps Quality Engineer who t...
Location
Location
Bulgaria , Sofia
Salary
Salary:
Not provided
ebrd.com Logo
European Bank for Reconstruction and Development
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Holds ISTQB Foundation as a minimum
  • Advanced Test Analyst or equivalent certifications desirable
  • Qualification in IT Service Management (ITIL v3 or v4 Foundation) or demonstrable experience integrating QA practices into ITSM processes
  • Familiar with the NIST Cybersecurity Framework (CSF) and Digital Operational Resilience Act (DORA), with practical awareness of how they influence quality standards and assurance planning
  • Demonstrates solid understanding of automation and non-functional testing concepts, including performance, accessibility, and shift-left/shift-right practices
  • Experience working within Agile, DevOps, and product-aligned teams, contributing to sprint-based delivery and continuous integration testing strategies
  • Proficient in test tooling and CI/CD frameworks including Azure DevOps, Selenium, Cypress, Jenkins, Git, and test management platforms such as TestRail or Zephyr
  • Familiarity with AI/ML use cases in quality engineering, including AI-assisted test case generation, defect clustering, and predictive analytics
  • Strong communication and collaboration skills, with the ability to explain test scenarios, defects, and coverage to technical and non-technical stakeholders
  • Awareness of security, compliance, and resilience considerations such as OWASP Top 10, ISO 27001, GDPR, and DORA, with practical experience embedding these into quality practices
Job Responsibility
Job Responsibility
  • Plans and performs testing of infrastructure changes, configuration updates, and cloud deployments across data centres, branch offices, and Azure environments, ensuring functional and non-functional validations
  • Collaborates with DevOps, Infrastructure, and Cloud Platform teams to verify IaC deployments, perform smoke testing post-deployment, and document any issues in platform reliability
  • Writes and maintains basic automation scripts or checklists for network, AV, and platform components, contributing to regression detection and provisioning assurance
  • Investigates environment or deployment defects, escalates issues with supporting diagnostics, and retests fixes with attention to uptime, latency, and failover coverage
  • Engages in sprint reviews and planning sessions, raising risks tied to infrastructure changes, recommending testing scope for AV, cloud services, or network resilience improvements
What we offer
What we offer
  • Varied, stimulating and engaging work that gives you an opportunity to interact with a wide range of experts in the financial, political, public and private sectors across the regions we invest in
  • A working culture that embraces inclusion and celebrates diversity
  • An environment that places sustainability, equality and digital transformation at the heart of what we do
  • Fulltime
Read More
Arrow Right

Founding DevOps Engineer

As a Founding Engineer with a DevOps focus, you will play a central role in buil...
Location
Location
France , Paris
Salary
Salary:
60000.00 - 80000.00 EUR / Year
bluselection.com Logo
Blu Selection
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong full-stack engineering background with a DevOps mindset
  • Hands-on expertise in cloud platforms (GCP ideal): deployments, infra, CI/CD
  • Proficient in TypeScript and Python
  • Experienced with React, Next.js, and FastAPI
  • Solid knowledge of PostgreSQL and Redis
  • Ability to design scalable systems and deliver production-grade code quickly
  • Comfortable using AI-assisted coding tools (Cursor, Claude Code, Copilot)
  • Thrives in fast-moving, ambiguous, 0→1 environments
  • Fluent in English (French is a plus)
Job Responsibility
Job Responsibility
  • Build fast, lightweight, and reliable front-end applications (React)
  • Develop backend services powering conversational and AI-driven experiences
  • Design and maintain scalable cloud infrastructure (preferably GCP)
  • Set up CI/CD pipelines, observability, monitoring, and infra-as-code
  • Optimize performance, reliability, and security across systems
  • Own core architectural decisions and enforce best engineering practices
  • Ensure code quality, stability, and scalability from the ground up
  • Collaborate closely with early users and customers to iterate quickly
  • Fulltime
Read More
Arrow Right

Senior DevOps Engineer (GCP)

Our client is a global UK-based financial services and investment banking organi...
Location
Location
Salary
Salary:
Not provided
n-ix.com Logo
N-iX
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of experience in DevOps, Cloud Engineering, or SRE roles
  • Strong hands-on experience with Google Cloud Platform, including: GKE / Kubernetes, Cloud Run, Cloud Functions, Pub/Sub, Cloud Storage, VPC, IAM, networking, security
  • Expertise in Terraform, Helm, or other IaC tools
  • Experience building CI/CD pipelines (GitHub Actions, GitLab CI, CircleCI, Jenkins, etc.)
  • Strong understanding of containerization and orchestration: Docker, Kubernetes
  • Solid experience with monitoring, observability, and logging stacks
  • Familiarity with networking, load balancing, security hardening, and zero-trust principles
  • Experience supporting production systems in high-availability, distributed environments
  • Strong scripting skills (Python, Bash, or similar)
  • Experience working with agile engineering teams
Job Responsibility
Job Responsibility
  • Design, implement, and maintain cloud infrastructure on Google Cloud (GKE, Cloud Run, Cloud Functions, Pub/Sub, Cloud Storage)
  • Build and optimize CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, or similar)
  • Develop infrastructure-as-code using Terraform or similar tools
  • Set up and maintain container orchestration (Kubernetes, GKE) and automated deployment workflows
  • Implement monitoring, alerting, and observability using tools such as Prometheus, Grafana, ELK/Elastic, Stackdriver, or OpenTelemetry
  • Ensure compliance with security and governance standards across all environments
  • Collaborate closely with engineering teams to ensure scalable, high-performance deployment architectures
  • Support AI/ML and GenAI workloads (Vertex AI pipelines, model hosting, GPU workloads, inference optimization)
  • Manage environment strategies, release pipelines, configuration management, and secrets management
  • Optimize cloud costs and recommend improvements for performance and reliability
What we offer
What we offer
  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits
Read More
Arrow Right