CrawlJobs Logo

Cloud Infra Engineering Lead

India, Chennai · Job Posted March 26, 2026
Apply Position
Job Link Share

Job Description

The Infrastructure Technology Lead Analyst is a senior level position responsible for leading a variety of infrastructure-Storage related operations with respect to SDS storage and object storage operational and implementation of engineering configurations. The activities include the design, acquisition and deployment of hardware, software, and network infrastructure in coordination with the Technology team. The job required in-depth knowledge in supporting software defined storage (SDS/Object) and expected to provide follow-the-sun support model, day to day management of SDS/Object estate from availability, performance, risk and control, capacity management, monitoring, regulatory compliance and infra stability.

Job Responsibility

  • Create complex project and task plans related to operational initiatives such as version upgrades, service improvement plans, perform impact analyses, solve/work high impact problems/projects, and provide resolution to restore services
  • Provide follow the sun operational support model related to SDS and Cloud object storage
  • Provide Root Cause Analysis (RCA) post restoration of service
  • Design testing approaches, complex processes, reporting streams, and assist with the automation of repetitive tasks
  • Provide technical/strategic direction to team members
  • Review requirement documents, define hardware requirements and update processes and procedures as necessary
  • Ensure ongoing compliance with regulatory requirements
  • Responsible for applications dealing with the overall operating system
  • Conduct project related research
  • Has the ability to operate with a limited level of direct supervision
  • Can exercise independence of judgement and autonomy
  • Acts as SME to senior stakeholders and /or other team members
  • Appropriately assess risk when business decisions are made, demonstrating consideration for the firm's reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behavior, conduct and business practices, and escalating, managing and reporting control issues with transparency

Requirements

  • 12-15 years of relevant experience in a Storage Operations role with sound knowledge of software defined storage and Cloud object storage
  • Proficient in software defined storage like Dell PowerFlex, power scale, and IBM cloud object storage, netapp StorageGrid solutions
  • Experience working in Financial Services or a large complex and/or global environment
  • Sound knowledge of RHEL operating system, VTM remediations/patch installation, firmware upgrades, and troubleshooting experience in a complex software defined storage estate and Cloud object storage estate
  • Design testing approaches, complex processes, reporting streams, and creating automation of repetitive tasks using shell scripting, Pearl scripting, C scripting, ansible and python scripts
  • Provide technical/strategic direction and act as advisor/coach to lower-level analysts
  • Perform hardware capacity forecasting, planning and utilization monitoring
  • To analyze and apply patches / code upgrade, enhancements and perform management tools upgrades
  • Apply new technology and processes to improve system operation, supportability, recoverability, availability and performance
  • Ensure compliance to Citigroup Information Technology Management Policies (CITMP) and Standards
  • Implement network configurations for Software defined backbone VLAN and collaborate with network team
  • Project Management experience or experience in working in complex datacenter project and technology refresh projects
  • Consistently demonstrates clear and concise written and verbal communication
  • Comprehensive knowledge of design metrics, analytics tools, benchmarking activities and related reporting to identify best practices
  • Demonstrated analytic/diagnostic skills
  • Ability to work in a matrix environment and partner with virtual teams
  • Ability to work independently, multi-task, and take ownership of various parts of a project or initiative
  • Ability to work under pressure and manage to tight deadlines or unexpected changes in expectations or requirements
  • Proven track record of operational process changes and improvement
  • Bachelor’s/University degree or equivalent with relevant working experience
  • Ability to develop automation and AI based tools using ansible and coding platform mentioned in the job description
  • Able to review code releases, code/version testing, implementation, post implementation checks, to ensure business applications are stable
  • Strong knowledge and expertise on operating system IBM AIX, Redhat, Windows and virtualization platforms
  • Ability to communicate technical concepts to non-technical audience
  • Ability to work with virtual and in-person teams, and work under pressure or to a deadline

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Cloud Infra Engineering Lead

8 matching positions

Senior Cloud Solution Architect - Cloud & AI Infra

We're seeking a senior Cloud Solution Architect (CSA) specializing in Azure Infr...
Location
Location
Germany , Multiple Locations
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Information Technology, Engineering, Business, or related field OR several years of experience in cloud and infrastructure technologies, IT consulting/support, systems administration, network operations, or architecture
  • Demonstrated technical leadership, including executive presence, strong influencing skills, and proven experience leading end-to-end infrastructure or cloud transformation initiatives
  • Several years of experience in a customer-facing technical role
  • Strong experience with enterprise-scale cloud and hybrid infrastructure, including areas such as: Azure IaaS & PaaS foundations
  • Networking (VNets, connectivity, hybrid networking)
  • Identity & Access Management (Entra ID)
  • Security & Governance
  • Migration, modernization, and business continuity
Job Responsibility
Job Responsibility
  • Build trusted relationships with IT executives and senior technical leaders, acting as a strategic advisor for cloud infrastructure, hybrid, and security transformation
  • Lead architectural design sessions and guide the implementation of secure, scalable, and resilient Azure infrastructure solutions, leveraging Microsoft best practices and frameworks such as Cloud Adoption Framework (CAF) and Well-Architected Framework (WAF)
  • Drive technical excellence across Azure Infrastructure workloads, ensuring mission-critical systems are optimized for availability, security, performance, cost, and operational readiness
  • Own end-to-end technical delivery for a defined set of strategic accounts, aligning infrastructure strategy with customer success plans and business outcomes
  • Accelerate Azure consumption and customer outcomes by resolving complex technical blockers, delivering repeatable IP, and providing delivery oversight for key engagements
  • Support customers through migration and modernization journeys, including Windows & Linux workloads, networking, identity, security, business continuity, and disaster recovery
  • Collaborate with internal teams and partners to design impactful delivery proposals and support execution with clarity and confidence
  • Actively contribute to Microsoft's technical communities by mentoring peers, sharing best practices, and representing infrastructure thought leadership in internal and external forums
  • Maintain deep technical expertise and advanced certifications across Azure Infrastructure services such as Compute, Networking, Storage, Identity, Security, Governance, and Operations
  • Demonstrate a growth mindset by continuously developing skills, aligning with business priorities, and contributing to a culture of learning and excellence
  • Fulltime
Read More
Arrow Right

Cloud Solution Architect - Cloud & AI Infra

Are you excited about Microsoft Azure? Join our team as a Cloud Solution Archite...
Location
Location
Germany , Multiple Locations
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Information Technology, Engineering, Business, or related field AND demonstrated experience in cloud/infrastructure technologies, information technology (IT) consulting/support, systems administration, network operations, software development/support, technology solutions, practice development, architecture, and/or consulting OR equivalent experience
  • Demonstrated experience working in a customer-facing role (e.g., internal and/or external)
  • Demonstrated experience working on technical projects
  • Technical Certification in Cloud (e.g., Azure, Amazon Web Services, Google, security certifications)
Job Responsibility
Job Responsibility
  • Create business value by translating customer challenges into actionable solutions aligned to high ROI customer outcomes
  • Lead architectural design sessions and deliver secure, scalable, and resilient infrastructure solutions aligned to customer business goals using frameworks like CAF and WAF
  • Partner with technical and sales teams to identify opportunities and develop tailored solutions to drive expansion and business value realization
  • Drive migration and modernization initiatives, including committed proof of concept and production milestones, across infrastructure, data, SAP, and AI workloads
  • Ensure customer environments are optimized for health, resiliency, security, and performance—enabling production-scale AI use cases
  • Deliver repeatable IP and contribute to centralized IP development to accelerate deployment and achieve targeted outcomes
  • Identify and resolve technical blockers to accelerate go-live and ensure delivery excellence
  • Generate incremental pipeline from each engagement by driving next best actions and aligning with business priorities
  • Maintain technical intensity through continuous skilling and certifications in priority workloads such as Azure SQL, PostgreSQL, AKS, App Service, AVS, SAP (Native + RISE), Windows, Linux and Defender for Cloud
  • Engage in technical communities, share best practices, and contribute to knowledge reuse to accelerate customer transformation and success
  • Fulltime
Read More
Arrow Right

Cloud Solution Architect - Cloud & AI Infra

Are you excited about Microsoft Azure? Join our team as a Cloud Solution Archite...
Location
Location
Germany , Multiple Locations
Salary
Salary:
Not provided
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Information Technology, Engineering, Business, or related field AND previous solid demonstrated experience in cloud/infrastructure technologies, information technology (IT) consulting/support, systems administration, network operations, software development/support, technology solutions, practice development, architecture, and/or consulting OR equivalent experience
  • Previous experience working in a customer-facing role (e.g., internal and/or external), defining architecture and influencing stakeholders/ advising on adoption strategy
  • Previous experience working on technical projects designing enterprise-scale solutions
  • Technical Certification in Cloud (e.g., Azure, Amazon Web Services, Google, security certifications)
Job Responsibility
Job Responsibility
  • Create business value by translating customer challenges into actionable solutions aligned to high ROI customer outcomes
  • Lead architectural design sessions and deliver secure, scalable, and resilient infrastructure solutions aligned to customer business goals using frameworks like CAF and WAF
  • Partner with technical and sales teams to identify opportunities and develop tailored solutions to drive expansion and business value realization
  • Drive migration and modernization initiatives, including committed proof of concept and production milestones, across infrastructure, data, SAP, and AI workloads
  • Ensure customer environments are optimized for health, resiliency, security, and performance—enabling production-scale AI use cases
  • Deliver repeatable IP and contribute to centralized IP development to accelerate deployment and achieve targeted outcomes
  • Identify and resolve technical blockers to accelerate go-live and ensure delivery excellence
  • Generate incremental pipeline from each engagement by driving next best actions and aligning with business priorities
  • Maintain technical intensity through continuous skilling and certifications in priority workloads such as Azure SQL, PostgreSQL, AKS, App Service, AVS, SAP (Native + RISE), Windows, Linux and Defender for Cloud
  • Engage in technical communities, share best practices, and contribute to knowledge reuse to accelerate customer transformation and success
  • Fulltime
Read More
Arrow Right

Sr. Cloud Solution Architect - Cloud & AI Infra

We’re looking for a Cloud Solution Architect (CSA) to help customers migrate, mo...
Location
Location
United States , Washington D.C.
Salary
Salary:
106400.00 - 203600.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science, Information Technology, Engineering, Business, Liberal Arts, or related field OR equivalent experience
  • 4+ years experience in cloud/infrastructure technologies, information technology (IT) consulting/support, systems administration, network operations, software development/support, technology solutions, practice development, architecture, and/or consulting
  • Active U.S. Government TOP SECRET Security Clearance
  • U.S. citizenship
  • Ability to pass Microsoft Cloud background check
Job Responsibility
Job Responsibility
  • Build trusted relationships with key customer IT decision makers to drive long-term cloud adoption and serve as the Voice of the Customer
  • Lead architectural design sessions and deliver secure, scalable, and resilient infrastructure solutions aligned to customer business goals using frameworks like CAF and WAF
  • Own the end-to-end technical delivery results, ensuring completeness and accuracy of consumption and customer success plans in collaboration with the CSAM
  • Drive migration and modernization initiatives, including committed proof of concept and production milestones, across infrastructure, data, SAP, and AI workloads
  • Ensure customer environments are optimized for health, resiliency, security, and performance—enabling production-scale AI use cases
  • Deliver repeatable IP and contribute to centralized IP development to accelerate deployment and achieve targeted outcomes
  • Identify and resolve technical blockers to accelerate go-live and ensure delivery excellence across key Factory engagements
  • Generate incremental pipeline from each engagement by driving next best actions and aligning with Unified Enterprise Support (ES) priorities
  • Maintain technical intensity through continuous skilling and certifications in priority workloads such as Azure SQL, PostgreSQL, AKS, App Service, AVS, SAP (Native + RISE), Windows, Linux and Defender for Cloud
  • Engage in technical communities, share best practices, and contribute to knowledge reuse to accelerate customer transformation and success
  • Fulltime
Read More
Arrow Right

Security Reliability Engineering Lead

This is a new, bootstrap team focused on applying strong Site Reliability Engine...
Location
Location
United States , San Francisco
Salary
Salary:
293000.00 - 385000.00 USD / Year
openai.com Logo
OpenAI
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10 or more years of experience operating and architecting mission critical infrastructure in high reliability environments
  • Have led the design and maturation of complex on prem, hybrid, or cloud integrated systems, setting durable architectural patterns used by multiple teams
  • Apply Site Reliability Engineering principles at scale, using observability, automation, and incident learnings to materially reduce risk and operational toil
  • Operate comfortably in ambiguity, making sound architectural decisions under pressure while staying close to technical detail
  • Influence cross functional partners across security, identity, network, and platform teams to land reliability improvements without direct authority
Job Responsibility
Job Responsibility
  • Set direction and establish strong foundations
  • Define and evolve infrastructure patterns for on prem and hybrid environments, including self hosted platforms, vendor supported systems, and lab environments
  • Establish standardized, production grade deployment and operational models that replace bespoke implementations
  • Partner with IT, Security, Identity, and Network teams to ensure infrastructure meets reliability, security, and access requirements by design
  • Design and mature the production architecture for IAM adjacent platforms such as Microsoft Entra using SRE principles
  • Establish common management rules and shared resources within Azure subscriptions to ensure consistent, policy aligned operations
  • Build, operate, and scale reliably
  • Own the full lifecycle of infrastructure systems, including deployment, upgrades, patching, recovery, and ongoing operations
  • Operate and harden shared infrastructure provisioned through Infra Terraform, ensuring repeatability, auditability, and safe change management
  • Design and implement infrastructure as code and configuration management to support shared services, identity adjacent systems, and endpoint platforms using tools like Chef, Ansible and Terraform
What we offer
What we offer
  • Medical, dental, and vision insurance for you and your family, with employer contributions to Health Savings Accounts
  • Pre-tax accounts for Health FSA, Dependent Care FSA, and commuter expenses (parking and transit)
  • 401(k) retirement plan with employer match
  • Paid parental leave (up to 24 weeks for birth parents and 20 weeks for non-birthing parents), plus paid medical and caregiver leave (up to 8 weeks)
  • Paid time off: flexible PTO for exempt employees and up to 15 days annually for non-exempt employees
  • 13+ paid company holidays, and multiple paid coordinated company office closures throughout the year for focus and recharge, plus paid sick or safe time (1 hour per 30 hours worked, or more, as required by applicable state or local law)
  • Mental health and wellness support
  • Employer-paid basic life and disability coverage
  • Annual learning and development stipend to fuel your professional growth
  • Daily meals in our offices, and meal delivery credits as eligible
  • Fulltime
Read More
Arrow Right

Senior Engineer – (Systems Engineering, Enterprise Infra & Platform Support)

The Senior Infrastructure & Platform Support Engineer provides end-to-end techni...
Location
Location
United States , Chevy Chase
Salary
Salary:
80000.00 USD / Year
geico.com Logo
Geico
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience leading engineering efforts or owning internal, enterprise-scale platforms and working directly with enterprise customers
  • Familiarity with enterprise application lifecycle (selection, deployment, user adoption, decommission, integration layers)
  • Strong background in enterprise infrastructure supporting Windows and Linux systems, including builds, configuration, hardening, and troubleshooting
  • Advanced skills with Windows Server, Active Directory, authentication protocols (Kerberos / LDAP / SAML / OAuth), and Azure AD/identity integrations
  • Solid Linux administration experience (Ubuntu, RHEL, or equivalent), with certifications preferred
  • Proficiency in automation and scripting (PowerShell, Bash, Python)
  • Strong understanding of networking fundamentals: TCP/IP, DNS, DHCP, routing, VPNs, firewalls, load balancers, VLANs, and secure connectivity
  • Hands-on experience with cloud platforms (Azure/AWS), hybrid environments, virtualization (vSphere/Hyper-V), and containers (Docker, Kubernetes)
  • Knowledge of monitoring and observability tools, such as Prometheus, Grafana, or equivalent solutions
  • Familiarity with database concepts, performance tuning, and integration of MySQL/PostgreSQL/SQL Server/Oracle with enterprise systems
Job Responsibility
Job Responsibility
  • Provide technical leadership to ensure strong engineering standards and operational excellence
  • Support, configure, and maintain both Linux and Windows server platforms, including application servers, integration components, and system services
  • Design and implement infrastructure solutions for workplace technologies including but not limited to digital mailroom, physical security & safety, and real estate facility management technology platforms—covering on-prem systems, hybrid setups, and SaaS applications
  • Build production-ready configurations emphasizing reliability, maintainability, scalability, and testability
  • Lead incident response, troubleshooting, root-cause analysis, and drive ongoing performance optimization
  • Execute DevOps activities including CI/CD pipeline management, automation scripting, monitoring setup, and Infrastructure as Code
  • Ensure platform observability through logging, alerting, dashboards, and automated health checks
  • Apply secure design practices, compliance controls, network segmentation, encryption, and access management
  • Manage platform lifecycle activities such as patching, upgrades, capacity planning, backups, disaster recovery and identifying opportunities for automation and standardization
  • Collaborate with cross-functional teams, vendors, and senior engineers, communicating clearly with technical and non-technical stakeholders
What we offer
What we offer
  • Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being
  • Financial benefits including market-competitive compensation
  • a 401K savings plan vested from day one that offers a 6% match
  • performance and recognition-based incentives
  • and tuition assistance
  • Access to additional benefits like mental healthcare as well as fertility and adoption assistance
  • Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year
  • Fulltime
Read More
Arrow Right

Senior DevOps Lead Data Engineering

We are seeking an experienced and highly skilled Senior DevOps Engineering Lead ...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master's degree in Computer Science, Engineering, or a related technical field
  • 7-10 years of hands-on experience in DevOps, Site Reliability Engineering (SRE), or a similar role
  • Proven expertise in designing and implementing DevOps leadership
  • In-depth experience with container orchestration platforms like AWS, Openshift ECS (or Kubernetes)
  • Strong practical experience with Elastic Stack (Elasticsearch, Kibana, Logstash, Beats) for transaction, logging and monitoring
  • Proficiency in scripting languages (Schell scripting, Python is a must), and strong Java development skills, particularly for tooling and automation
  • Demonstrated knowledge of microservices architecture principles and operational challenges
  • Familiarity with machine-to-machine authentication and authorization mechanisms
  • Must have knowledge of automation principles and practices
  • Experience with job scheduling tools like Autosys
Job Responsibility
Job Responsibility
  • CI/CD Pipeline Ownership: Design, implement, and maintain robust, scalable, and secure CI/CD pipeline architecture for microservices applications, ensuring continuous integration, delivery, and deployment
  • Infrastructure Planning & Management: Lead the design, provisioning, optimization and management of scalable data infrastructure (compute, storage, networking) across cloud , ECS and/or on-premise environments, specifically supporting a data mesh paradigm
  • Elastic Stack Expertise: Manage and optimize Elastic Stack (Elasticsearch, Kibana, Logstash, Beats) for centralized logging, monitoring, and analytics
  • Automation & Scripting: Design, Develop and maintain automation scripts and tools using Shell scripts, Python, Java, or other relevant languages to streamline operational tasks and improve efficiency
  • Infrastructure Procurement & Lifecycle: Oversee the end-to-end Solution (SLTN) process for infrastructure procurement, ensuring timely and compliant acquisition of resources
  • Capacity Estimation & Planning: Conduct thorough capacity planning and performance analysis for microservices and underlying infrastructure to ensure scalability and reliability
  • Access Management & Security: Design and implement secure machine-to-machine communication strategies and manage infrastructure access, adhering to security best practices
  • Microservices Operations: Provide operational expertise and support for highly distributed microservices architectures, including troubleshooting, performance tuning, and incident response
  • Governance & Observability: Implement and enforce data governance policies through automation, and establish comprehensive observability (monitoring, logging, alerting) for data pipelines and infrastructure
  • Mentorship & Best Practices: Mentor junior DevOps engineers, promote DevOps best practices (e.g., IaC, GitOps, observability), and foster a culture of continuous improvement
  • Fulltime
Read More
Arrow Right

Cloud Tech Lead

As a Tech Lead in our Cloud Infrastructure Services team, you drive how we desig...
Location
Location
Czech Republic , Prague
Salary
Salary:
Not provided
ataccama.com Logo
Ataccama
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Solid hands‑on experience with Kubernetes in production and at least one major cloud provider (AWS or Azure)
  • Strong skills in infrastructure as code (for example Terraform) and CI/CD
  • A track record of improving reliability, security, or observability of critical services
  • Experience leading technical work: designing solutions, breaking down projects, unblocking teammates
  • Clear and pragmatic communication with both engineers and non‑technical stakeholders
Job Responsibility
Job Responsibility
  • Lead technical delivery for selected CIS domains (for example networking, ORLOP, observability, EDGE infra, governance)
  • Shape infrastructure designs and implementation details with our Cloud Architect and Product / TPM
  • Improve reliability and operability of core services through better automation, monitoring, and runbooks
  • Support on‑call and incident response, turning learnings into lasting improvements
  • Coach engineers so the whole team can move faster with confidence
What we offer
What we offer
  • Long-Term Incentive Program
  • 2 sick days and 25 days of vacation, with the option to request additional Flexible Time-Off days when needed
  • The Global Family Support Program - a paid leave program to help all parents focus on the new addition to their family
  • Flexible working hours & hybrid work setup
  • Benefit Plus - flexible employee benefit platform (incl. Multisport card)
  • Annual package for mental health support
  • "Bring Your Friend" referral program
  • Shared company cards for free entrance to Prague Zoo & Botanical garden
  • Company bikes, longboards, e-scooters
  • Conference tickets to the best industry events of the year
  • Fulltime
Read More
Arrow Right