CrawlJobs Logo

SRE Consultant

United States, New Jersey 125088.00 - 156360.00 USD / Year · Job Posted March 25, 2026
Apply Position
Job Link Share

Job Description

The Site Reliability Engineering Consultant at NTT DATA will be responsible for developing and implementing software solutions in a complex environment. This role is part of a multi-year transformation journey that will require a successful candidate to establish best practices, motivate and promote a cultural shift that will ensure a successful adoption of Engineering Principles and Practices within Production Management.

Job Responsibility

  • Demonstrate an in-depth understanding of Software Development Lifecycle
  • Ability to operate in a global environment with on-/near-/off-shore matrix reporting structures
  • Operate into a highly regulated environment
  • Improve the service level the team provides to our end users
  • Drive Continuous Delivery and Automation efforts across the supported applications
  • Foster a culture that promotes transparency and innovation for increased team productivity
  • Coach members of the team and outside the immediate reporting line about the best practices
  • Implement the Agile Framework through one of its implementations like SCRUM or Kanban
  • Avidly communicate progress and project status across the organization

Requirements

  • Bachelors degree in computer science/mathematics/physics or related technical subject
  • 9+ years in a site reliability engineering related role
  • Proven hands-on expertise and the capability to demonstrate technical proficiency in Programming (Java, Python, or equivalent)
  • Containerization
  • Kubernetes
  • GitOps
  • High Availability Systems
  • Infrastructure as a code
  • Configuration Management
  • Observability (tools and implementation)
  • Hyperscale Systems
  • Middleware configuration
  • Relevant experience in a critical software development role with high business impact
  • Excellent engineering skills and senior architecture
  • Excellent working knowledge of key computer science concepts (networking, operating systems, virtualization, containerization, etc.)
  • Polyglot full-stack developer mentality
  • Excellent understanding of Software Engineering concepts like Software Development Life Cycle and GitOps
  • Excellent debugging and analytical skills
  • Experience of delivering software using Agile delivery methodologies is a must (SCRUM/Kanban)
  • Experience of senior stakeholder management
  • Consistently demonstrates clear and concise written and verbal communication skills

Nice to have

  • Operational experience of deploying and running services at scale on top of Docker/Kubernetes stack and a service mesh, like Istio
  • Operational experience with orchestration tools for CI/CD and Infrastructure-as-Code tooling (Terraform, Cloud Formation, etc.)
  • Operational experience of using middleware technologies (MQ, Apache Kafka, etc.) to run services at scale
  • Strong experience with end-to-end observability stacks (Datadog, AppDynamics, Dynatrace, etc.)
  • Degree in computer science/mathematics/physics or related technical subject

What we offer

  • Medical, dental, and vision insurance with an employer contribution
  • Flexible spending or health savings account
  • Life and AD&D insurance
  • Short and long term disability coverage
  • Paid time off
  • Employee assistance
  • Participation in a 401k program with company match
  • Additional voluntary or legally-required benefits

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

SRE Consultant

8 matching positions

Staff/Senior DevOps Consultant (SRE)

We are seeking a highly skilled Senior Site Reliability Engineer to join our tea...
Location
Location
Pakistan , Karachi, Lahore, Islamabad
Salary
Salary:
Not provided
10pearls.com Logo
10Pearls
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of hands-on experience contributing to DevOps transformations and infrastructure evolution
  • Strong proficiency with Google Cloud Platform (GCP), including services such as Cloud Run, BigQuery, Cloud Storage, and networking
  • Solid experience with Infrastructure as Code (IaC) tools, preferably Terraform, for provisioning and managing GCP resources
  • Hands-on experience with containerization technologies, including Docker
  • Experience with GCP-specific monitoring and logging tools such as Cloud Monitoring, Cloud Logging, Prometheus, and Grafana
  • Working knowledge of at least one programming language, preferably TypeScript, JavaScript, Go, or Python
  • Experience using version control systems such as Git
  • Strong preference for working in agile, fast-paced environments
  • Proven ability to collaborate effectively across cross-functional teams
  • Strong teamwork, communication, and problem-solving skills
Job Responsibility
Job Responsibility
  • Design, build, and maintain scalable and secure cloud infrastructure on GCP and AWS/Azure platforms
  • Develop and implement CI/CD pipelines for automated build, testing, and deployment processes
  • Monitor, troubleshoot, and optimize cloud-based systems to ensure high availability, performance, and reliability
  • Configure and manage Argo CD installations for continuous delivery of applications to Kubernetes clusters
  • Collaborate with development and operations teams to streamline workflows and improve efficiency
  • Stay current with emerging cloud technologies and best practices, and provide recommendations for continuous improvement
  • Have a thorough process methodology and ability to communicate with all stakeholders
Read More
Arrow Right

Staff/Senior DevOps Consultant (SRE)

We are seeking a highly skilled Senior Site Reliability Engineer to join our tea...
Location
Location
Pakistan , Karachi; Lahore; Islamabad
Salary
Salary:
Not provided
10pearls.com Logo
10Pearls
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of hands-on experience contributing to DevOps transformations and infrastructure evolution
  • Strong proficiency with Google Cloud Platform (GCP), including services such as Cloud Run, BigQuery, Cloud Storage, and networking
  • Solid experience with Infrastructure as Code (IaC) tools, preferably Terraform, for provisioning and managing GCP resources
  • Hands-on experience with containerization technologies, including Docker
  • Experience with GCP-specific monitoring and logging tools such as Cloud Monitoring, Cloud Logging, Prometheus, and Grafana
  • Working knowledge of at least one programming language, preferably TypeScript, JavaScript, Go, or Python
  • Experience using version control systems such as Git
  • Strong preference for working in agile, fast-paced environments
  • Proven ability to collaborate effectively across cross-functional teams
  • Strong teamwork, communication, and problem-solving skills
Job Responsibility
Job Responsibility
  • Design, build, and maintain scalable and secure cloud infrastructure on GCP and AWS/Azure platforms
  • Develop and implement CI/CD pipelines for automated build, testing, and deployment processes
  • Monitor, troubleshoot, and optimize cloud-based systems to ensure high availability, performance, and reliability
  • Configure and manage Argo CD installations for continuous delivery of applications to Kubernetes clusters
  • Collaborate with development and operations teams to streamline workflows and improve efficiency
  • Stay current with emerging cloud technologies and best practices, and provide recommendations for continuous improvement
  • Have a thorough process methodology and ability to communicate with all stakeholders
Read More
Arrow Right

Staff/Senior DevOps Consultant (SRE)

We are seeking a highly skilled Senior Site Reliability Engineer to join our tea...
Location
Location
Pakistan , Karachi, Lahore, Islamabad
Salary
Salary:
Not provided
10pearls.com Logo
10Pearls
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 3+ years of hands-on experience contributing to DevOps transformations and infrastructure evolution
  • Strong proficiency with Google Cloud Platform (GCP), including services such as Cloud Run, BigQuery, Cloud Storage, and networking
  • Solid experience with Infrastructure as Code (IaC) tools, preferably Terraform, for provisioning and managing GCP resources
  • Hands-on experience with containerization technologies, including Docker
  • Experience with GCP-specific monitoring and logging tools such as Cloud Monitoring, Cloud Logging, Prometheus, and Grafana
  • Working knowledge of at least one programming language, preferably TypeScript, JavaScript, Go, or Python
  • Experience using version control systems such as Git
  • Strong preference for working in agile, fast-paced environments
  • Proven ability to collaborate effectively across cross-functional teams
  • Strong teamwork, communication, and problem-solving skills
Job Responsibility
Job Responsibility
  • Design, build, and maintain scalable and secure cloud infrastructure on GCP and AWS/Azure platforms
  • Develop and implement CI/CD pipelines for automated build, testing, and deployment processes
  • Monitor, troubleshoot, and optimize cloud-based systems to ensure high availability, performance, and reliability
  • Configure and manage Argo CD installations for continuous delivery of applications to Kubernetes clusters
  • Collaborate with development and operations teams to streamline workflows and improve efficiency
  • Stay current with emerging cloud technologies and best practices, and provide recommendations for continuous improvement
  • Have a thorough process methodology and ability to communicate with all stakeholders
Read More
Arrow Right

Technical Architect

Lead the design, modernization, and implementation of scalable, secure, and resi...
Location
Location
United States , Armonk
Salary
Salary:
247319.00 - 250000.00 USD / Year
nytimes.com Logo
The New York Times
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's degree or equivalent in Computer Science, Information Technology, Engineering or related and five (5) years of experience as a Consultant Architect, Virtualization Architect, Senior Cloud Architect or related
  • Five (5) years of experience must include utilizing Hybrid Cloud, AWS, Azure, Red Hat Linux, Terraform, Ansible, Python, VMware Cloud Foundation (VCF) Stack
Job Responsibility
Job Responsibility
  • Lead the design, modernization, and implementation of scalable, secure, and resilient hybrid cloud and containerized infrastructure platforms
  • Define and lead the technical architecture strategy for hybrid cloud, container orchestration (Kubernetes, RedHat OpenShift, VMware Tanzu), and virtualized environments (VMware, Nutanix, RedHat)
  • Architect secure and scalable infrastructure across private, public, and hybrid cloud ecosystems
  • Evaluate, design, and implement solutions for computing, storage, networking, identity, and availability zones across global regions
  • Design and implement Kubernetes, RedHat OpenShift clusters across multi-cloud and on-prem environments, including CI/CD integration, policy enforcement, and workload orchestration
  • Define governance, observability, and security patterns for containerized workloads
  • Lead Infrastructure-as-Code (IaC) initiatives using Terraform, Ansible, GitOps, GitHub, PowerShell, and Python
  • Enable self-service infrastructure capabilities through automation frameworks and developer platforms
  • Partner with DevSecOps, SRE, Infrastructure Operations, Security, and Datacenter Operation teams to scope, define, size, and execute application onboarding, modernization, and consolidation initiatives
  • Mentor engineering teams and influence enterprise architecture (EA) roadmaps
  • Fulltime
Read More
Arrow Right

Public Cloud Network Lead

Join us at Barclays as a Public Cloud Network Lead, to architect, implement and ...
Location
Location
United Kingdom , London; Glasgow
Salary
Salary:
Not provided
barclays.co.uk Logo
Barclays
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Multi-Cloud Network Architecture & Hybrid Connectivity – Lead enterprise-scale network design across AWS, Azure, and GCP, delivering hybrid connectivity, encrypted interconnects (MACsec/IPsec), circuit provider management, and legacy infrastructure remediation through Infrastructure as Code
  • Network Security & Compliance – Implement Zero Trust segmentation, deploy cloud-native firewall controls, and ensure compliance with PCI-DSS, DORA, and internal governance frameworks
  • Strategic Planning, Consultancy & Stakeholder Engagement – Define cloud network strategy, evaluate emerging technologies, produce ADRs and HLD/LLD designs, lead Landing Zone design, and influence senior stakeholders on risk, strategy, and cost optimisation
  • Operational Excellence & Incident Response – Own incident escalation, SLA/SLO monitoring, flow analysis, and SRE enablement to drive network operational excellence
  • Automation, IaC & DevOps Practices – Build reusable Terraform, CloudFormation, and Bicep IaC with CI/CD pipelines and Python/Bash automation for standardised network provisioning
Job Responsibility
Job Responsibility
  • architect, implement and operate enterprise-grade multi-cloud network infrastructure at scale for Barclays
  • design secure, high-performance hybrid and multi-cloud architectures connecting thousands of cloud accounts across global regions to Barclays' on-premises infrastructure
  • work horizontally across GTIS Networks, SRE, DevOps, Product, and senior leadership to deliver strategic initiatives and resolve complex technical debt
  • mentor engineers and serving as the escalation point for critical network incidents
  • Build Engineering: Development, delivery, and maintenance of high-quality infrastructure solutions to fulfil business requirements
  • Incident Management: Monitoring of IT infrastructure and system performance to measure, identify, address, and resolve any potential issues, vulnerabilities, or outages
  • Automation: Development and implementation of automated tasks and processes to improve efficiency and reduce manual intervention
  • Security: Implementation of a secure configuration and measures to protect infrastructure against cyber-attacks, vulnerabilities, and other security threats
  • Teamwork: Cross-functional collaboration with product managers, architects, and other engineers to define IT Infrastructure requirements, devise solutions, and ensure seamless integration and alignment with business objectives
  • Learning: Stay informed of industry technology trends and innovations, and actively contribute to the organization's technology communities to foster a culture of technical excellence and growth
What we offer
What we offer
  • Competitive holiday allowance
  • Life assurance
  • Private medical care
  • Pension contribution
  • Fulltime
Read More
Arrow Right

Staff Engineer, Site Reliability Engineer

OnStar is a cornerstone of General Motors' connected services—bringing safety, s...
Location
Location
Ireland , Dublin
Salary
Salary:
Not provided
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years in SRE, DevOps, or systems engineering, including experience managing or mentoring high-impact teams
  • Track record of building and maintaining high-scale, cloud-native systems (preferably AWS, GCP, or Azure)
  • Expertise in container orchestration and deployment strategies using Kubernetes and CI/CD pipelines
  • Proficiency in Python, Go, or Java, with strong code review and readability standards
  • Experience leading cross-functional infrastructure projects, configuration strategy, or organizational tooling initiatives
  • Ability to think and act under pressure
  • Strong communication skills
Job Responsibility
Job Responsibility
  • Lead the design and implementation of scalable, fault-tolerant, and observable infrastructure supporting OnStar mobile and web experiences, in-vehicle services, and the backend platforms and integrations that power them
  • Champion configuration management, infrastructure refactoring, and testing frameworks to strengthen system resilience
  • Partner across SRE, development, and product teams to improve service reliability, deployment safety, and incident response practices
  • Drive internal consultation and strategic planning on reliability standards for new OnStar capabilities, customer-facing releases, and platform initiatives
  • Define and evolve observability strategy using tools such as Prometheus, Grafana, and Datadog, with automated alerting and actionable SLO dashboards
  • Own and improve on-call practices, manage blameless postmortems, and guide root cause analysis to eliminate recurring failures
  • Mentor engineers and help shape a high-performance culture rooted in extreme ownership and operational excellence
  • Support compliance and privacy-driven engineering initiatives across connected services, with potential crossover into areas like data retention and safety certification tooling
  • Fulltime
Read More
Arrow Right

Systems Operations Senior Manager

Wells Fargo is seeking a Systems Operations Senior Manager to lead production st...
Location
Location
United States , CHARLOTTE
Salary
Salary:
Not provided
https://www.wellsfargo.com/ Logo
Wells Fargo
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 7+ years of Systems Engineering and Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
  • 3+ years of SRE management or leadership experience
  • 2+ years of experience in a financial crimes production environment
Job Responsibility
Job Responsibility
  • Manage and develop teams of analysts, associates, and less experienced managers in roles that provide technical services and support for the relevant supported systems
  • Engage and influence stakeholders, internal partners, and peers in order to engineer projects, identify new products and solutions, and research solutions for existing systems
  • Identify and recommend opportunities for administration and maintenance of the remote monitoring and management system, as well as the periodic system review
  • Perform network assessments, security audits, and system enhancement consultations
  • Determine appropriate strategy and actions of Systems Operations team to meet moderate to high risk deliverables
  • Interpret and develop policies and procedures, and understand compliance and risk management requirements for supported system area
  • Provide implementation support for key risk initiatives
  • Collaborate with and influence all levels of professionals, analysts, or associates
  • Ensure the Systems Operations team communicates with customers to keep them informed of incident progress, and notify them of impending changes or agreed outages
  • Manage allocation of people and financial resources for Systems Operations
  • Fulltime
Read More
Arrow Right

Site Reliability Engineer

Location
Location
South Africa , Johannesburg
Salary
Salary:
Not provided
nintex.com Logo
Nintex
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • You provide guidance on infrastructure architecture and contribute to high-quality and successful product releases.
  • You contribute to your team and domain through successfully leading and consistently delivering on projects of ambiguous scope, high complexity, and critical business impact.
  • You contribute to relevant guilds, practice forums and other initiatives to improve Nintex’s DevOps and SRE discipline.
  • You have an in-depth understanding of distributed systems architecture, as well as monitoring and observability practices and tools.
  • You quickly resolve priority infrastructure issues and help other technical team members or Product Managers understand how to avoid them in the future.
  • You provide detailed estimates for work items you propose or assigned.
  • You assist in decision-making around tooling, automation practices, and testing solutions.
  • You stay up-to-date with technology trends and use this knowledge help your team and the broader Engineering practice.
  • You run Nintex infrastructure with IaC tools (as Terraform) and GitHub Actions for automation, containerize our environments (Kubernetes) and leverage cloud technologies to meet our goals
  • You build monitoring that alerts on symptoms rather than outages using tools like Prometheus, Grafana, Alertmanager and PagerDuty
Job Responsibility
Job Responsibility
  • You are highly skilled and sufficiently experienced in Nintex DevOps tools and processes to own a long-term program or technology such as Kubernetes, etc.
  • You write scripts, tools and utilities that support and integrate with delivery pipelines and you integrate telemetry where appropriate.
  • You are called into incidents and bring trusted knowledge in your platform domain.
  • You debug and fix infrastructure issues on production environments quickly using the relevant tools and guidelines to prevent recurrence.
  • You build, promote and support infrastructure patterns and practices within Nintex.
  • You provide coaching/mentoring to other Engineers on the team
  • You lead or contribute to post-mortems for incidents, including root cause analysis and identification of preventative and remedial actions.
  • You continuously monitor our platform performance and take immediate action to improve it
  • You review and advise on appropriate design patterns to solve automation and infrastructure problems without creating technical debt.
  • You design and build complex infrastructure components for distributed systems as Kubernetes.
What we offer
What we offer
  • Global Gratitude and Recharge Days
  • Flexible, paid time off policy
  • Employee wellness programs and counseling resources
  • Meaningful peer recognition and awards
  • Paid parental leave
  • Invention/patenting assistance
  • Community impact, paid volunteer time, and opportunities
  • Intercultural learning and celebration
  • Multiple tools through which to learn and grow, and an incredible global community
Read More
Arrow Right