CrawlJobs Logo

AI Ops Engineer

nttdata.com Logo

NTT DATA

Location Icon

Location:
India , Noida

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

The AI Ops Engineer role involves designing, deploying, and optimizing AI-powered applications on Azure. Candidates should have 4+ years of experience in software engineering or cloud platforms, with strong skills in operationalizing AI/ML applications. Responsibilities include managing deployment pipelines, monitoring performance, and ensuring system reliability. A degree in Computer Science or related field is required.

Job Responsibility:

  • Design, deploy, operate, and optimize enterprise-grade AI-powered applications and intelligent agents on Azure that support business workflows and customer interactions at scale
  • Operationalize AI/ML models and LLM-powered applications by managing deployment pipelines, monitoring performance, ensuring reliability, and maintaining scalability in production environments
  • Work closely with engineering, product, and CX teams to ensure AI systems run efficiently in production
  • Leverage Azure services such as Azure OpenAI, Azure Machine Learning, Cognitive Services, Kubernetes, and DevOps pipelines to operationalize AI workloads, continuously monitor model performance, improve latency and accuracy, and ensure governance, security, and system stability
  • Deploy AI agents and AI-powered applications to production environments
  • Maintain CI/CD pipelines for AI models and applications
  • Monitor AI system performance, reliability, and usage metrics
  • Troubleshoot operational issues including latency, hallucinations, or integration failures
  • Implement logging, observability, and evaluation frameworks for AI systems
  • Manage Azure infrastructure supporting AI workloads
  • Ensure security, compliance, and governance for AI deployments
  • Continuously improve system scalability, stability, and operational efficiency
  • Collaborate with AI engineers and product teams to operationalize new AI features

Requirements:

  • 4+ years of hands-on software engineering, cloud, or platform engineering experience
  • Strong experience operationalizing AI/ML or GenAI applications in production environments
  • Proven expertise with Microsoft Azure cloud platform, especially AI/ML services
  • Experience with CI/CD pipelines, infrastructure automation, and cloud deployments
  • Strong troubleshooting, monitoring, and production reliability experience
  • Ability to independently manage AI deployments end-to-end
  • Degree in Computer Science, Engineering, Data Science, or equivalent practical experience
  • Experience deploying and managing AI/ML and LLM-based applications in production
  • Hands-on experience with Azure OpenAI, Azure Machine Learning, Azure AI Studio, and Cognitive Services
  • Knowledge of containerization and orchestration (Docker, Kubernetes, AKS)
  • Experience with CI/CD pipelines such as Azure DevOps or GitHub Actions
  • Familiarity with agentic AI frameworks such as LangChain, LlamaIndex, Semantic Kernel, AutoGen, or CrewAI from an operational perspective
  • Understanding of RAG architectures, vector databases, and AI observability tools
  • Strong Python scripting and automation experience
  • Experience monitoring AI models including logging, evaluation, performance metrics, and alerting
  • Knowledge of MLOps/LLMOps practices including model versioning, governance, and lifecycle management
  • Familiarity with Git, infrastructure-as-code, and standard DevOps workflows
  • Strong debugging, production support, and performance optimization skills

Additional Information:

Job Posted:
March 21, 2026

Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for AI Ops Engineer

Machine Learning Ops Engineer

The Customer AI & Rapid Prototyping department stands at the forefront of digita...
Location
Location
Portugal , Oporto; Lisbon; Funchal; Ponta delgada
Salary
Salary:
Not provided
https://www.tui.com Logo
TUI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience in productionising and using various AI models and algorithms
  • Experience in deploying AI solutions using CI/CD pipelines, API development and containers
  • Strong programming skills in Python
  • Understanding of machine learning/AI frameworks and libraries
  • Hands-on experience with cloud technologies and services (e.g., AWS, Azure, Google Cloud)
  • Experience with monitoring and log collection systems (e.g. DataDog)
  • Some experience with Generative AI technologies (e.g. Bedrock, Langchain, LangGraph)
  • Customer-focused engineer with a passion for crafting high-quality digital products, continuous improvement, and effective team collaboration
  • Strong problem-solving and communication skills, with an understanding of the social, legal, and ethical impact of AI technologies
Job Responsibility
Job Responsibility
  • Develop, implement, and maintain machine learning models and algorithms
  • Work closely with cross-functional teams to integrate ML solutions into production systems
  • Monitor and optimize the performance of deployed AI models
  • Collaborate with engineering colleagues on AI-related tasks to deliver impactful, data-driven solutions
  • Research, evaluate, and test new approaches, processes, and tools
What we offer
What we offer
  • Attractive remuneration
  • bonus opportunity
  • exclusive travel perks & discounts
  • extensive health & wellbeing support
  • Flexible working
  • hybrid or remote working models
  • Opportunities to upskill, reskill and grow your career
  • Access the TUI Tech Learning Hub
  • Participate in our tech communities and collaborate on global projects and teams
  • Get involved with incredible local charity and sustainability initiatives like the TUI Care Foundation and the Sustainable Tech Community
  • Fulltime
Read More
Arrow Right

ML Ops Engineer

As an MLOps Engineer, you will be responsible for building, maintaining, and opt...
Location
Location
India , Hyderabad
Salary
Salary:
Not provided
nstarxinc.com Logo
NStarX
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 4 to 10 years of experience in MLOps, DevOps, or ML Engineering
  • Strong proficiency with cloud platforms such as AWS, Azure, or GCP
  • Experience with containerization and orchestration tools like Docker and Kubernetes
  • Hands-on experience with ML model deployment, monitoring, and scaling
  • Proficiency with CI/CD tools such as Jenkins or GitLab CI
  • Familiarity with data versioning and management tools such as DVC
  • Strong coding skills in Python with knowledge of ML libraries like TensorFlow or PyTorch
  • Strong problem-solving skills and ability to work in a collaborative environment
  • Effective communication skills for cross-functional teamwork
Job Responsibility
Job Responsibility
  • Develop and manage infrastructure for end-to-end ML workflows including model training, deployment, monitoring, and maintenance
  • Implement CI/CD pipelines for ML models and data workflows
  • Collaborate with cross-functional teams to build scalable and robust ML infrastructure on cloud and on-premises environments
  • Monitor and optimize model performance and infrastructure to ensure efficient resource usage
  • Manage data versioning and model versioning across multiple environments
  • Implement security, governance, and compliance protocols in ML deployment and data pipelines
  • Support troubleshooting, debugging, and incident management for ML infrastructure issues
What we offer
What we offer
  • Competitive compensation
  • Opportunity to work with a dynamic team on cutting-edge AI and ML solutions
  • Professional growth and development opportunities
  • Fulltime
Read More
Arrow Right

Senior AI Engineer

We are seeking a Senior AI Engineer (L4, Individual Contributor) to design, buil...
Location
Location
India , Chennai
Salary
Salary:
Not provided
arcadia.com Logo
Arcadia
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 12+ years of professional software engineering experience
  • 3+ years in AI/ML development
  • Strong expertise in Python, PyTorch/TensorFlow, scikit-learn, and ML tooling (MLflow, LangChain)
  • Proficiency with SQL, cloud services (AWS), containers (Docker, Kubernetes), and distributed systems
  • Understanding of modern AI research (LLMs, diffusion models, transformers)
  • Experience deploying ML models in production with CI/CD
  • Strong analytical skills, ability to balance speed and rigor in experimentation
  • A passion for sustainability and the clean-energy mission
  • Experienced with building agentic pipelines with the latest models from Anthropic, Google, OpenAI, and more
Job Responsibility
Job Responsibility
  • Integrate with LLMs and be an expert in prompt engineering to derive the right results from the models with limited hallucination
  • Design and train ML/AI models (forecasting, NLP, graph learning, generative AI) to improve data quality, cost effectiveness, and system scalability
  • Deploy and optimize models for large-scale production workloads using Python-based services in AWS/Kubernetes environments
  • Build robust, automated data pipelines and ML Ops workflows for continuous training and deployment
  • Research and experiment with modern AI methods (transformers, foundation models, reinforcement learning) and adapt them to energy-sector challenges not limited to utility statements
  • Drive performance improvements in model accuracy, latency, and cost efficiency
  • Collaborate with Product, SRE, and Analytics teams to deliver AI-enabled features across Arcadia’s platform
  • Write clean, maintainable code, contribute to architecture reviews, and mentor junior engineers
  • Build true agentic workflows with multi-step processing incorporating RAG pipelines and MCPs
What we offer
What we offer
  • Competitive compensation and employee stock options
  • Hybrid/remote-first working model (India-based role, with global collaboration)
  • Flexible leave policy
  • Comprehensive medical insurance (self + family members)
  • Annual performance cycle + quarterly recognition awards
  • A supportive, diverse engineering culture grounded in empathy, teamwork, and innovation
  • Fulltime
Read More
Arrow Right

AI Engineer

We are seeking a Senior AI Engineer to help us speed up the improvements in our ...
Location
Location
Mexico , Mexico City
Salary
Salary:
70000.00 - 90000.00 USD / Year
https://www.snappr.com Logo
Snappr
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proficiency in Python
  • Experience working in a Typescript tech stack
  • Excellent problem-solving and communication skills
  • Excellent English communication skills and experience working with English-speaking teams
  • Bachelor degree in engineering with distinctive performance
  • Experience with ML OPs is a plus
  • Experience working in infrastructure is a plus
  • Experience deploying to GCP and AWS is a plus
  • Experience working as part of one of the labs that created a T21 model is a plus
Job Responsibility
Job Responsibility
  • Automating our AI Image Generation
  • Deploy these models to production, ensuring efficient performance
  • Collaborate with cross-functional teams for broader product integration
  • Keep up to date with the latest advancements in machine learning and artificial intelligence
  • Taking a leading role in building an engineering culture built on rapid iteration and continuous improvement
What we offer
What we offer
  • Equity
  • 12O USD per month for professional growth
  • 20 days of PTO
  • Fulltime
Read More
Arrow Right

AI Engineer

We are seeking a Senior AI Engineer to help us speed up the improvements in our ...
Location
Location
Colombia , Medellin
Salary
Salary:
70000.00 - 90000.00 USD / Year
https://www.snappr.com Logo
Snappr
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Proficiency in Python
  • Experience working in a Typescript tech stack
  • Excellent problem-solving and communication skills
  • Excellent English communication skills and experience working with English-speaking teams
  • Bachelor degree in engineering with distinctive performance
  • Experience with ML OPs is a plus
  • Experience working in infrastructure is a plus
  • Experience deploying to GCP and AWS is a plus
  • Experience working as part of one of the labs that created a T21 model is a plus
Job Responsibility
Job Responsibility
  • Automating our AI Image Generation
  • Deploy these models to production, ensuring efficient performance
  • Collaborate with cross-functional teams for broader product integration
  • Keep up to date with the latest advancements in machine learning and artificial intelligence
  • Taking a leading role in building an engineering culture built on rapid iteration and continuous improvement
What we offer
What we offer
  • Equity
  • 12O USD per month for professional growth
  • 20 days of PTO
  • Fulltime
Read More
Arrow Right

AI Engineer

Location
Location
United Kingdom , London
Salary
Salary:
Not provided
light-it.net Logo
Light IT Global
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong Python (typing, tests, packaging) and web/service basics (FastAPI, Docker)
  • Hands-on building LLM apps with one or more: LangChain, LangGraph, AutoGen, Dify (or equivalents)
  • RAG end-to-end experience: corpus prep → embeddings → retrieval → re-rank → context assembly → measurable gains
  • Evaluation mindset: design offline eval sets and release gates
  • reason about faithfulness vs relevancy
  • control p95 latency & $/session
  • Cloud proficiency in AWS or Azure (using at least one in production)
  • Comfort with observability (OpenTelemetry/LangSmith/Prometheus/Grafana or similar)
  • English B2+
  • clear written & verbal client communication
Job Responsibility
Job Responsibility
  • Design & build LLM applications: RAG over client KBs, multi-tool/agent graphs, prompt strategies, and voice UX (STT/TTS)
  • Implement retrieval & data flows: ingestion, chunking, metadata/embeddings, hybrid retrieval, re-ranking, caching
  • Own evaluation & quality: latency/cost budgets, A/Bs
  • Automate with n8n (or similar): connect services, data, and back-office workflows
  • expose reliable admin ops for non-engineers
  • Ship pipelines: Airflow for orchestration
  • dbt for transformations/semantic layer where relevant
  • Productionize: tracing/observability, feature flags, safe rollouts, secrets & config hygiene
  • Collaborate with clients: discover scope, explain trade-offs, demo progress
  • optional presales spikes/estimates
What we offer
What we offer
  • Flexible work-from-home policy
  • Competitive salary and performance review
  • PE accounting and support
  • 18 paid vacation days per year
  • Unlimited paid sick days per year
  • The system of bonuses (Sport/Health/Education)
  • Expert community within the company
  • Paid courses and trainings, internal knowledge library
Read More
Arrow Right

Ops Performance Manager - Data & Ai Focus

OPS Performance Manager - Data & AI Focus (m/w/d). We love what we do – from day...
Location
Location
Salary
Salary:
Not provided
condor.com Logo
condor
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • University degree in Data Analytics, Business Informatics, Aviation Management, Engineering, or a comparable field of study
  • Solid experience in data analysis and visualization, ideally with Power BI, SQL, Python, or R
  • Passion for artificial intelligence and hands-on experience with GPTs, machine learning approaches, or workflow automation
  • Strong ability to translate complex data into clear, actionable insights
  • Structured, analytical, and results-oriented working style with a high degree of ownership and initiative
  • Excellent English communication skills, both written and spoken
Job Responsibility
Job Responsibility
  • Independently identify areas for improvement and develop data-driven solutions to sustainably enhance operational performance
  • Develop and implement AI-driven analytical methods (e.g., predictive analytics, GPT-powered tools, or process automations) to support data-informed operational decisions
  • Build, maintain, and continuously improve interactive dashboards and reports in Power BI to visualize performance indicators, process stability, and operational efficiency
  • Analyze large and complex operational datasets (e.g., punctuality, rotation stability, turnaround performance, irregularities) to identify trends, root causes, and optimization opportunities
  • Design and maintain data interfaces and automation workflows (e.g., using Python, Power Automate, or API-based systems)
  • Prepare executive-level presentations with clear insights and actionable recommendations for senior management
  • Lead and support special projects related to digital transformation and the advancement of data-driven decision-making processes
What we offer
What we offer
  • Atmosphere - through a friendly and motivated team and flat hierarchies
  • Excitement - through an interesting and varied job in a fascinating industry
  • Further development - through competent induction as well as training and development opportunities
  • Benefits - through attractive travel discounts and social benefits in addition to industry-standard compensation
Read More
Arrow Right

Staff Machine Learning Engineer

Machine Learning Engineers at Rocket Money further our mission by building produ...
Location
Location
United States , San Francisco; Washington, D.C.; New York City; Silver Spring; Miami; Denver
Salary
Salary:
210000.00 - 260000.00 USD / Year
truebill.com Logo
Truebill
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years of professional experience in machine learning engineering or data science roles
  • Proven track record of designing and implementing ML systems at consumer tech scale and speed
  • Extensive hands-on experience integrating ML and AI methods into production workflows, including creating evaluation tooling and effective user feedback mechanisms
  • Experience with prompt engineering and management, creating robust systems for testing and optimizing LLM-based applications
  • Expert-level proficiency in Python, SQL, and at least a handful of common ML frameworks
  • Understanding of ML methods at a fundamental level
  • Master at taking ambiguous problems, creating clarity, and breaking down work into manageable chunks for implementation
  • Owned the development, launch, and maintenance for several scaled ML/AI powered product experiences
  • Understand basic software engineering and computer science fundamentals and have applied them at consumer grade scale to build ML powered products in production environments
  • Technical leader who can identify both emergent technical opportunities and gaps relative to best practice
Job Responsibility
Job Responsibility
  • Lead the architecture and development of complex AI and ML powered features across Rocket Money's product suite
  • Design, implement, and maintain robust evaluation frameworks
  • Develop novel new product experiences
  • Own end to end development and implementation of ML and AI product features in collaboration with cross-functional product development teams
  • Provide technical mentorship
What we offer
What we offer
  • Health, Dental & Vision Plans
  • Competitive Pay
  • 401k Matching
  • Unlimited PTO
  • Lunch daily (in-office only)
  • Snacks & Coffee (in-office only)
  • Commuter benefits (in-office only)
  • Bonus
  • Fulltime
Read More
Arrow Right