CrawlJobs Logo

Site Reliability Engineering Consultant

nttdata.com Logo

NTT DATA

Location Icon

Location:
United States , New Jersey

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

125088.00 - 156360.00 USD / Year

Job Description:

The Site Reliability Engineering Consultant will be responsible for developing and implementing software solutions in a complex, multi-disciplinary environment. The role requires a comprehensive understanding of software development lifecycle, excellent engineering skills, and the ability to operate in a global environment. The candidate will drive continuous delivery and automation efforts while coaching team members on best practices.

Job Responsibility:

  • Demonstrate an in-depth understanding of Software Development Lifecycle and how it integrates within the overall technology landscape to deliver scalable, reliable and resilient applications
  • Ability to operate in a global environment with on-/near-/off-shore matrix reporting structures
  • Operate into a highly regulated environment that requires in-depth understanding of the regulatory requirements and the industry implications for our technologies
  • Improve the service level the team provides to our end users, which includes maximizing operational efficiencies, strengthening incident management, problem management and knowledge sharing practices
  • Drive Continuous Delivery and Automation efforts across the supported applications by means of Root Cause Analysis reviews, Knowledge management, Performance tuning, and user training
  • Foster a culture that promotes transparency and innovation for increased team productivity
  • Coach members of the team and outside the immediate reporting line about the best practices and recognize anti-patterns that are quickly addressed
  • Implement the Agile Framework through one of its implementations like SCRUM or Kanban and ensure it integrates with overall organization processes
  • Avidly communicate progress and project status across the organization and ensure that stakeholders are managed appropriately throughout the execution period

Requirements:

  • Relevant experience in a critical software development role with high business impact
  • Excellent engineering skills and senior architecture
  • Excellent working knowledge of key computer science concepts (networking, operating systems, virtualization, containerization, etc.)
  • Polyglot full-stack developer mentality
  • Excellent understanding of Software Engineering concepts like Software Development Life Cycle and GitOps
  • Excellent debugging and analytical skills
  • Operational experience of deploying and running services at scale on top of Docker/Kubernetes stack and a service mesh, like Istio, is highly desirable
  • Operational experience with orchestration tools for CI/CD and Infrastructure-as-Code tooling (Terraform, Cloud Formation, etc.) is a highly desirable
  • Experience of delivering software using Agile delivery methodologies is a must (SCRUM/Kanban)
  • Operational experience of using middleware technologies (MQ, Apache Kafka, etc.) to run services at scale is desirable
  • Strong experience with end-to-end observability stacks (Datadog, AppDynamics, Dynatrace, etc.) is desirable
  • Degree in computer science/mathematics/physics or related technical subject is desirable
  • Experience of senior stakeholder management
  • Consistently demonstrates clear and concise written and verbal communication skills
  • Bachelors degree in computer science/mathematics/physics or related technical subject
  • 9+ years in a site reliability engineeringrelated role with proven hands-on expertise and the capability to demonstrate technical proficiency in the following: Programming (Java, Python, or equivalent)
  • Containerization
  • Kubernetes
  • GitOps
  • High Availability Systems
  • Infrastructure as a code
  • Configuration Management
  • Observability (tools and implementation)
  • Hyperscale Systems
  • Middleware configuration

Nice to have:

  • Operational experience of deploying and running services at scale on top of Docker/Kubernetes stack and a service mesh, like Istio
  • Operational experience with orchestration tools for CI/CD and Infrastructure-as-Code tooling (Terraform, Cloud Formation, etc.)
  • Operational experience of using middleware technologies (MQ, Apache Kafka, etc.) to run services at scale
  • Strong experience with end-to-end observability stacks (Datadog, AppDynamics, Dynatrace, etc.)
  • Degree in computer science/mathematics/physics or related technical subject
What we offer:
  • medical, dental, and vision insurance with an employer contribution
  • flexible spending or health savings account
  • life and AD&D insurance
  • short and long term disability coverage
  • paid time off
  • employee assistance
  • participation in a 401k program with company match

Additional Information:

Job Posted:
April 23, 2026

Employment Type:
Fulltime
Work Type:
Hybrid work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Site Reliability Engineering Consultant

Operations Consultant

The goal of an Operations Consultant is to help implement and integrate complex ...
Location
Location
United States , Indianapolis
Salary
Salary:
Not provided
quantumlytix.com Logo
Quantumlytix
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree and or Masters in Industrial Engineering
  • Excellent analytical and proactive problem solving skills
  • Ability to communicate with clarity
  • Excellent written and communication skills
  • Ability to interact professionally with employees, managers and top executives
  • Eagerness to contribute in team setting
  • Fully versed in MS Excel, MS PowerPoint, MS Outlook and MS Word.
  • Required fluency in English, both written and spoken
  • Technical writing ability required
  • Flexibility to travel 100% to client sites in the US and international
Job Responsibility
Job Responsibility
  • Assist and integrate client companies implement global change initiatives by working with all levels within our clients company.
  • Coordinate quality control objectives and activities to resolve production problems, maximize product reliability, and minimize cost.
  • Develop manufacturing methods, labor utilization standards, and cost analysis systems to promote efficient staff and facility utilization.
  • Apply statistical methods and perform mathematical calculations to determine manufacturing processes, staff requirements, and production standards.
  • Study operations sequence, material flow, functional statements, organization charts, and project information to determine worker functions and responsibilities and collect data.
  • Analyze statistical data and product specifications to determine standards and establish quality and reliability objectives of finished product.
  • Draft and design layout of facilities, equipment, materials, and workspace to illustrate maximum efficiency, using drafting tools and computer, simulation modeling and 3D design.
  • Confer with vendors, staff, and management personnel regarding purchases, procedures, product specifications, manufacturing capabilities, and project status.
  • Review production schedules, engineering specifications, orders, and related information to obtain knowledge of manufacturing methods, procedures, and activities.
  • Evaluate precision and accuracy of production and testing equipment and engineering drawings to formulate corrective action plan.
  • Fulltime
Read More
Arrow Right

Technical Customer Support Engineer

We are seeking a Technical Support Engineer who is fluent in English to join our...
Location
Location
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Technical breadth and depth in ClickHouse open-source or ClickHouse Cloud, or in domains relevant to ClickHouse, such as: SQL databases, OLAP, cloud-native SaaS, distributed systems
  • Previous technical experience in roles such as Support Engineer, Consultant, Database Administrator, Site Reliability Engineer, Solutions Engineer, Software Engineer, and/or Systems Engineer
  • Be present and available according to the scheduling required to deliver high-quality 24x7 Support in a global, distributed environment
  • Strong written and verbal English communication skills and the ability to work fully remote with reliable connectivity
  • A mindset of teamwork, global engagement, empathy, and solving challenging problems
  • A sense of adventure and urgency in building the most scalable, high-performing, largest, and fastest databases on the planet
  • The ability to build trusted relationships with colleagues, customers, and partners
  • You are self-driven, curious, and eager to learn and grow continuously
Job Responsibility
Job Responsibility
  • Supporting and guiding our ClickHouse users, customers, and prospects via cases, chat, Slack, community, and meetings
  • Develop solutions based on ClickHouse Cloud and ClickHouse open-source that can be shared with our users, community, and customers via documentation, knowledge base, blogs, meetups, webinars, and training
  • Work closely with our global Support Services, Engineering, Go to Market, and Product Management teams to help define functionality required by users and customers
  • Assist with mentoring, training, and sharing your knowledge with colleagues, users, and customers
  • You will deliver excellent customer service as a first-line technical engineer and representative of ClickHouse. Our Support Engineers provide professional response, on-call coverage, and guidance within the required Service Level Agreements ("SLAs") on technical cases that are opened via a ticketing system, email, Slack, chat, and/or phone
  • You will build strong, trusted relationships with colleagues, customers, and partners
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right

Technical Customer Support Engineer

We are currently growing our support team at ClickHouse who provides excellent s...
Location
Location
United Kingdom
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Technical breadth and depth in ClickHouse open-source or ClickHouse Cloud, or in domains relevant to ClickHouse, such as: SQL databases, OLAP, cloud-native SaaS, distributed systems
  • Previous technical experience in roles such as Support Engineer, Consultant, Database Administrator, Site Reliability Engineer, Solutions Engineer, Software Engineer, and/or Systems Engineer
  • Be present and available according to the scheduling required to deliver high-quality 24x7 Support in a global, distributed environment
  • Strong written and verbal English communication skills and the ability to work fully remote with reliable connectivity
  • A mindset of teamwork, global engagement, empathy, and solving challenging problems
  • A sense of adventure and urgency in building the most scalable, high-performing, largest, and fastest databases on the planet
  • The ability to build trusted relationships with colleagues, customers, and partners
  • You are self-driven, curious, and eager to continuously learn and grow
Job Responsibility
Job Responsibility
  • Supporting and guiding our ClickHouse users, customers, and prospects via cases, chat, Slack, community, and meetings
  • Develop solutions based on ClickHouse Cloud and ClickHouse open-source that can be shared with our users, community, and customers via documentation, knowledge base, blogs, meetups, webinars, and training
  • Work closely with our global Support Services, Engineering, Go to Market, and Product Management teams to help define functionality required by users and customers
  • Assist with mentoring, training, and sharing your knowledge with colleagues, users, and customers
  • You will deliver excellent customer service as a first-line technical engineer and representative of ClickHouse. Our Support Engineers provide professional response, on-call coverage, and guidance within the required Service Level Agreements ("SLAs") on technical cases that are opened via a ticketing system, email, Slack, chat, and/or phone
  • You will build strong, trusted relationships with colleagues, customers, and partners
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
Read More
Arrow Right

Technical Customer Support Engineer

We are currently growing our support team at ClickHouse who provides excellent s...
Location
Location
United States
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Technical breadth and depth in ClickHouse open-source or ClickHouse Cloud, or in domains relevant to ClickHouse, such as: SQL databases, OLAP, cloud-native SaaS, distributed systems
  • Previous technical experience in roles such as Support Engineer, Consultant, Database Administrator, Site Reliability Engineer, Solutions Engineer, Software Engineer, and/or Systems Engineer
  • Be present and available according to the scheduling required to deliver high-quality 24x7 Support in a global, distributed environment
  • Strong written and verbal English and German communication skills and the ability to work fully remote with reliable connectivity
  • A mindset of teamwork, global engagement, empathy, and solving challenging problems
  • A sense of adventure and urgency in building the most scalable, high-performing, largest, and fastest databases on the planet
  • The ability to build trusted relationships with colleagues, customers, and partners
  • You are self-driven, curious, and eager to continuously learn and grow
Job Responsibility
Job Responsibility
  • Supporting and guiding our ClickHouse users, customers, and prospects via cases, chat, Slack, community, and meetings
  • Develop solutions based on ClickHouse Cloud and ClickHouse open-source that can be shared with our users, community, and customers via documentation, knowledge base, blogs, meetups, webinars, and training
  • Work closely with our global Support Services, Engineering, Go to Market, and Product Management teams to help define functionality required by users and customers
  • Assist with mentoring, training, and sharing your knowledge with colleagues, users, and customers
  • You will deliver excellent customer service as a first-line technical engineer and representative of ClickHouse. Our Support Engineers provide professional response, on-call coverage, and guidance within the required Service Level Agreements ("SLAs") on technical cases that are opened via a ticketing system, email, Slack, chat, and/or phone
  • You will build strong, trusted relationships with colleagues, customers, and partners
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
  • Fulltime
Read More
Arrow Right

Technical Customer Support Engineer

We are seeking a Technical Support Engineer to join our global Support Engineeri...
Location
Location
France
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Technical breadth and depth in ClickHouse open-source or ClickHouse Cloud, or in domains relevant to ClickHouse, such as: SQL databases, OLAP, cloud-native SaaS, distributed systems
  • Previous technical experience in roles such as Support Engineer, Consultant, Database Administrator, Site Reliability Engineer, Solutions Engineer, Software Engineer, and/or Systems Engineer
  • Be present and available according to the scheduling required to deliver high-quality 24x7 Support in a global, distributed environment
  • Strong written and verbal English and French communication skills and the ability to work fully remote with reliable connectivity
  • A mindset of teamwork, global engagement, empathy, and solving challenging problems
  • A sense of adventure and urgency in building the most scalable, high-performing, largest, and fastest databases on the planet
  • The ability to build trusted relationships with colleagues, customers, and partners
  • You are self-driven, curious, and eager to continuously learn and grow
Job Responsibility
Job Responsibility
  • Supporting and guiding our ClickHouse users, customers, and prospects via cases, chat, Slack, community, and meetings
  • Develop solutions based on ClickHouse Cloud and ClickHouse open-source that can be shared with our users, community, and customers via documentation, knowledge base, blogs, meetups, webinars, and training
  • Work closely with our global Support Services, Engineering, Go to Market, and Product Management teams to help define functionality required by users and customers
  • Assist with mentoring, training, and sharing your knowledge with colleagues, users, and customers
  • You will deliver excellent customer service as a first-line technical engineer and representative of ClickHouse. Our Support Engineers provide professional response, on-call coverage, and guidance within the required Service Level Agreements ("SLAs") on technical cases that are opened via a ticketing system, email, Slack, chat, and/or phone
  • You will build strong, trusted relationships with colleagues, customers, and partners
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
  • Fulltime
Read More
Arrow Right

Technical Customer Support Engineer - APJ

We are currently growing our support team at ClickHouse who provides excellent s...
Location
Location
Australia
Salary
Salary:
Not provided
clickhouse.com Logo
ClickHouse
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Technical breadth and depth in ClickHouse open-source or ClickHouse Cloud, or in domains relevant to ClickHouse, such as: SQL databases, OLAP, cloud-native SaaS, distributed systems
  • Previous technical experience in roles such as Support Engineer, Consultant, Database Administrator, Site Reliability Engineer, Solutions Engineer, Software Engineer, and/or Systems Engineer
  • Be present and available according to the scheduling required to deliver high-quality 24x7 Support in a global, distributed environment
  • Strong written and verbal English and German communication skills and the ability to work fully remote with reliable connectivity
  • A mindset of teamwork, global engagement, empathy, and solving challenging problems
  • A sense of adventure and urgency in building the most scalable, high-performing, largest, and fastest databases on the planet
  • The ability to build trusted relationships with colleagues, customers, and partners
  • You are self-driven, curious, and eager to continuously learn and grow
Job Responsibility
Job Responsibility
  • Supporting and guiding our ClickHouse users, customers, and prospects via cases, chat, Slack, community, and meetings
  • Develop solutions based on ClickHouse Cloud and ClickHouse open-source that can be shared with our users, community, and customers via documentation, knowledge base, blogs, meetups, webinars, and training
  • Work closely with our global Support Services, Engineering, Go to Market, and Product Management teams to help define functionality required by users and customers
  • Assist with mentoring, training, and sharing your knowledge with colleagues, users, and customers
  • You will deliver excellent customer service as a first-line technical engineer and representative of ClickHouse. Our Support Engineers provide professional response, on-call coverage, and guidance within the required Service Level Agreements ("SLAs") on technical cases that are opened via a ticketing system, email, Slack, chat, and/or phone
  • You will build strong, trusted relationships with colleagues, customers, and partners
What we offer
What we offer
  • Flexible work environment - ClickHouse is a globally distributed company and remote-friendly. We currently operate in 20 countries
  • Healthcare - Employer contributions towards your healthcare
  • Equity in the company - Every new team member who joins our company receives stock options
  • Time off - Flexible time off in the US, generous entitlement in other countries
  • A $500 Home office setup if you’re a remote employee
  • Global Gatherings – We believe in the power of in-person connection and offer opportunities to engage with colleagues at company-wide offsites
  • Fulltime
Read More
Arrow Right

Site Reliability Engineer

As a Site Reliability Engineer (SRE), you will be a key player in ensuring our p...
Location
Location
Portugal , Lisboa
Salary
Salary:
Not provided
tekever.com Logo
Tekever
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in Computer Science, Information Technology, Engineering, or a related field
  • 3+ years of experience in Site Reliability Engineering, DevOps, or a related software/systems engineering role
  • Proficiency in one or more programming languages such as Python, Go, or Bash for automation and tooling
  • Deep understanding of Linux/Unix operating systems and networking fundamentals (TCP/IP, DNS, HTTP, load balancing)
  • Experience with cloud platforms such as AWS, Azure, or Google Cloud, with a focus on Google Cloud
  • Strong knowledge of CI/CD tools like Jenkins, GitLab CI, or CircleCI
  • Strong hands-on experience operating Kubernetes in production, including troubleshooting of networking, storage, scheduling, autoscaling, and stateful workloads
  • Experience with Infrastructure as Code (IaC) tools such as Terraform and Ansible
  • Understanding of version control systems (e.g., Git) and with CI/CD principles and tools (e.g., GitLab CI, Jenkins)
  • Knowledge of monitoring, logging and tracing tools (e.g., Prometheus, Grafana, ELK stack)
Job Responsibility
Job Responsibility
  • Design, build, and maintain highly available, scalable infrastructure for distributed and stateful workloads, supporting real-time data ingestion, AI inference pipelines, and hybrid cloud/edge deployment
  • Automate repetitive manual tasks, infrastructure provisioning, and operational workflows to reduce toil and improve system efficiency
  • Implement and manage robust monitoring, logging, and alerting solutions to proactively detect and address issues
  • Define and track Service Level Indicators (SLIs) and Service Level Objectives (SLOs)
  • Participate in an on-call rotation to respond to production incidents
  • Lead blameless post-mortem analyses for incidents in complex distributed systems, identifying root causes, systemic weaknesses, and implementing long-term preventative measures
  • Manage and provision cloud and on-premise infrastructure using IaC principles and tools like Terraform and Ansible
  • Conduct performance analysis, system tuning, and capacity planning to ensure our services meet performance and cost-efficiency goals
  • Develop, test, and maintain disaster recovery plans and business continuity strategies to ensure service resilience
  • Work closely with software development teams to consult on system design, platform choices, and reliability best practices for new features and services
What we offer
What we offer
  • An excellent work environment and an opportunity to create a real impact in the world
  • A truly high-tech, state-of-the-art engineering company with flat structure and no politics
  • Working with the very latest technologies in Data & AI, including Edge AI, Swarming - both within our software platforms and within our embedded on-board systems
  • Flexible work arrangements
  • Professional development opportunities
  • Collaborative and inclusive work environment
  • Salary compatible with the level of proven experience
  • Fulltime
Read More
Arrow Right

Staff+ Product Security Engineer

Verkada is transforming how organizations protect their people and places with a...
Location
Location
United States , San Mateo
Salary
Salary:
200000.00 - 300000.00 USD / Year
verkada.com Logo
Verkada
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor of Science in Computer Science degree or equivalent
  • Strong experience with AWS, GCP or other cloud service provider
  • 7 - 10+ years of experience as a security engineer, software engineer, site reliability engineer, or security consultant
  • Understanding of security weaknesses, exploits, attacks and mitigations
  • Experience and enthusiasm for learning about new security products, features, and strategies
  • Coding ability
  • Excellent collaborative skills
  • Outstanding written and verbal communication
  • Experience with most of the following: Security Development Lifecycle, Threat Modeling, Architecture Analysis, Technical Design Review, Security Code Review, Open Policy Agent, SIEM
Job Responsibility
Job Responsibility
  • Facilitate the security baked into our applications throughout the software development lifecycle
  • Evangelize software security best practices through training and information sharing
  • Partner closely with engineering and product teams to improve the security of Verkada’s products and exceed customers’ expectations
  • Explore innovative solutions to enable Verkada business instead of “Security says No”
  • Collaborate with other engineering leaders to define, communicate, and execute on goals, priorities and process
  • Set up security tooling and secure defaults to ensure software security best practices
  • Perform architecture analysis, threat modeling and technical design reviews of sensitive features and infrastructure
  • Create and operate a bug bounty program
  • Triage and recommend solutions for security bugs from tools, third party assessments and bug bounties
  • Collaborate with the CISO and security team to grow the broader Verkada security program
What we offer
What we offer
  • Healthcare programs that can be tailored to meet the personal health and financial well-being needs - Premiums are 100% covered for the employee under at least one plan and 80% for family premiums under all plans
  • Nationwide medical, vision and dental coverage
  • Health Saving Account (HSA) with annual employer contributions and Flexible Spending Account (FSA) with tax saving options
  • Expanded mental health support
  • Paid parental leave policy & fertility benefits
  • Time off to relax and recharge through our paid holidays, firmwide extended holidays, flexible PTO and personal sick time
  • Professional development stipend
  • Fertility Stipend
  • Wellness/fitness benefits
  • Healthy lunches provided daily
  • Fulltime
Read More
Arrow Right