Senior Staff Site Reliability Engineer Job at Palo Alto Networks (Tel Aviv)

FX Applications Support Senior Analyst

This hybrid role involves working as part of the FX Applications Support team to...

Location

Australia , Sydney

Salary:

Not provided

Citi

Expiration Date

Until further notice

Requirements

5-8 years experience in an Application Support role
experience installing, configuring or supporting business applications
experience with some programming languages and willingness/ability to learn
advanced execution capabilities and ability to adjust quickly to changes and re-prioritization
effective written and verbal communications including ability to explain technical issues in simple terms that non-IT staff can understand
demonstrated analytical skills
issue tracking and reporting using tools
knowledge/experience of problem management tools
good all-round technical skills
ability to effectively share information with other support team members and with other technology teams

Job Responsibility

provides technical and business support for users of Citi applications
maintains application systems running in daily operations
manages, maintains and supports applications and their environments
performs start-of-day checks, continuous monitoring, and regional handovers
performs same day risk reconciliations
develops and maintains technical support documentation
assesses risk and impact and escalates in a timely manner
ensures storage and archiving procedures are functioning correctly
participates in application releases, from development to post-implementation analysis
identifies risks, vulnerabilities and security issues

What we offer

rewarding work
supportive environment
clear opportunities for progression
exciting company benefits

Fulltime

FX Applications Support Senior Analyst

As an FX Application Support Analyst, you will play a key role in running and ma...

Location

Australia , Sydney

Salary:

Not provided

Citi

Expiration Date

Until further notice

Requirements

5-8 years’ experience in an Application Support role
experience installing, configuring or supporting business applications
experience with some programming languages and willingness/ability to learn
advanced execution capabilities and ability to adjust quickly to changes and re-prioritization
effective written and verbal communications including ability to explain technical issues in simple terms that non-IT staff can understand
demonstrated analytical skills
issue tracking and reporting using tools
knowledge/experience of problem management tools
good all-round technical skills
ability to effectively share information with other support team members and with other technology teams

Job Responsibility

provides technical and business support for users of Citi Applications
maintains application systems that have completed development stage and are running in daily operations
manages, maintains and supports applications and their operating environments, focusing on stability, quality and functionality
start of day checks, continuous monitoring, and regional handover
perform same day risk reconciliations
develop and maintain technical support documentation
identifies ways to maximize potential of applications used
assess risk and impact of production issues and escalate to business and technology management
ensures storage and archiving procedures are in place and functioning correctly
formulates and defines scope and objectives for complex application enhancements and problem resolution

What we offer

rewarding work in a supportive environment
clear opportunities for progression
exciting company benefits
diverse team of professionals
global network of people, data and relationships

Fulltime

Staff Site Reliability Engineer

Our Site Reliability Engineering team is growing, and we are looking for a highl...

Location

Finland , Helsinki

Salary:

Not provided

AlphaSense

Expiration Date

Until further notice

Requirements

8+ years of experience in Site Reliability Engineering, DevOps, or a similar role
at least 3+ of those years operating in a Senior+ SRE position
Strong background in running production SaaS systems at scale
Proficiency in at least one programming/scripting language (Python, Go, or similar)
Hands-on expertise with cloud platforms (AWS, GCP, or Azure) and Kubernetes
Deep understanding of networking fundamentals (TCP/IP, DNS, HTTP/S, load balancing)
Experience with monitoring & alerting (Prometheus, Grafana, Datadog, ELK)
Familiarity with advanced observability (OTEL, continuous profiling)
Proven incident management experience, including leading high-severity incidents and postmortems
Strong troubleshooting skills across the full stack

Job Responsibility

Architect Reliability Paved Paths: Build frameworks and self-service tooling that let teams own the reliability of their services
Lead AI-Driven Reliability: Drive our AIOps strategy — automating diagnostics, remediation, and proactive failure prevention
Champion Reliability Culture: Embed SRE practices across engineering via design reviews, production readiness, and operational standards
Incident Leadership: Act as Incident Commander during critical events, modeling operational excellence, and ensuring blameless postmortems lead to lasting improvements
Advance Observability: Deliver end-to-end monitoring, tracing, and profiling (Prometheus, Grafana, OTEL, Continuous Profiling) to optimize performance proactively
Mentor & Multiply: Elevate engineers across SRE and product teams through mentorship, technical guidance, and knowledge sharing

Staff Site Reliability Engineer

Our Site Reliability Engineering team is growing, and we are looking for a highl...

Location

India , Bengaluru

Salary:

Not provided

AlphaSense

Expiration Date

Until further notice

Requirements

8+ years of experience in Site Reliability Engineering, DevOps, or a similar role
At least 3+ of those years operating in a Senior+ SRE position
Strong background in running production SaaS systems at scale
Proficiency in at least one programming/scripting language (Python, Go, or similar)
Hands-on expertise with cloud platforms (AWS, GCP, or Azure) and Kubernetes
Deep understanding of networking fundamentals (TCP/IP, DNS, HTTP/S, load balancing)
Experience with monitoring & alerting (Prometheus, Grafana, Datadog, ELK)
Familiarity with advanced observability (OTEL, continuous profiling)
Proven incident management experience, including leading high-severity incidents and postmortems
Strong troubleshooting skills across the full stack

Job Responsibility

Architect Reliability Paved Paths: Build frameworks and self-service tooling that let teams own the reliability of their services in a “You Build It, You Run It” culture
Lead AI-Driven Reliability: Drive our AIOps strategy — automating diagnostics, remediation, and proactive failure prevention
Champion Reliability Culture: Embed SRE practices across engineering via design reviews, production readiness, and operational standards
Incident Leadership: Act as Incident Commander during critical events, modeling operational excellence, and ensuring blameless postmortems lead to lasting improvements
Advance Observability: Deliver end-to-end monitoring, tracing, and profiling (Prometheus, Grafana, OTEL, Continuous Profiling) to optimize performance proactively
Mentor & Multiply: Elevate engineers across SRE and product teams through mentorship, technical guidance, and knowledge sharing

Staff Site Reliability Engineer

Our Site Reliability Engineering team is growing, and we are looking for a highl...

Location

India , Delhi

Salary:

Not provided

AlphaSense

Expiration Date

Until further notice

Requirements

8+ years of experience in Site Reliability Engineering, DevOps, or a similar role
at least 3+ of those years operating in a Senior+ SRE position
strong background in running production SaaS systems at scale
proficiency in at least one programming/scripting language (Python, Go, or similar)
hands-on expertise with cloud platforms (AWS, GCP, or Azure) and Kubernetes
deep understanding of networking fundamentals (TCP/IP, DNS, HTTP/S, load balancing)
experience with monitoring & alerting (Prometheus, Grafana, Datadog, ELK)
familiarity with advanced observability (OTEL, continuous profiling)
proven incident management experience, including leading high-severity incidents and postmortems
strong troubleshooting skills across the full stack

Job Responsibility

Architect Reliability Paved Paths: Build frameworks and self-service tooling that let teams own the reliability of their services in a “You Build It, You Run It” culture
Lead AI-Driven Reliability: Drive our AIOps strategy — automating diagnostics, remediation, and proactive failure prevention
Champion Reliability Culture: Embed SRE practices across engineering via design reviews, production readiness, and operational standards
Incident Leadership: Act as Incident Commander during critical events, modeling operational excellence, and ensuring blameless postmortems lead to lasting improvements
Advance Observability: Deliver end-to-end monitoring, tracing, and profiling (Prometheus, Grafana, OTEL, Continuous Profiling) to optimize performance proactively
Mentor & Multiply: Elevate engineers across SRE and product teams through mentorship, technical guidance, and knowledge sharing

Staff Site Reliability Engineer

Our Site Reliability Engineering team is growing, and we are looking for a highl...

Location

India , Pune

Salary:

Not provided

AlphaSense

Expiration Date

Until further notice

Requirements

8+ years of experience in Site Reliability Engineering, DevOps, or a similar role
at least 3+ of those years operating in a Senior+ SRE position
strong background in running production SaaS systems at scale
proficiency in at least one programming/scripting language (Python, Go, or similar)
hands-on expertise with cloud platforms (AWS, GCP, or Azure) and Kubernetes
deep understanding of networking fundamentals (TCP/IP, DNS, HTTP/S, load balancing)
experience with monitoring & alerting (Prometheus, Grafana, Datadog, ELK)
familiarity with advanced observability (OTEL, continuous profiling)
proven incident management experience, including leading high-severity incidents and postmortems
strong troubleshooting skills across the full stack

Job Responsibility

Architect Reliability Paved Paths: Build frameworks and self-service tooling that let teams own the reliability of their services in a “You Build It, You Run It” culture
Lead AI-Driven Reliability: Drive our AIOps strategy — automating diagnostics, remediation, and proactive failure prevention
Champion Reliability Culture: Embed SRE practices across engineering via design reviews, production readiness, and operational standards
Incident Leadership: Act as Incident Commander during critical events, modeling operational excellence, and ensuring blameless postmortems lead to lasting improvements
Advance Observability: Deliver end-to-end monitoring, tracing, and profiling (Prometheus, Grafana, OTEL, Continuous Profiling) to optimize performance proactively
Mentor & Multiply: Elevate engineers across SRE and product teams through mentorship, technical guidance, and knowledge sharing

Staff Site Reliability Engineer

As a Staff Site Reliability Engineer, you will be a technical leader and strateg...

Location

Singapore; Australia , Singapore; Melbourne

Salary:

Not provided

Airwallex

Expiration Date

Until further notice

Requirements

10+ years of experience in SRE, DevOps, or infrastructure engineering roles, with progressive responsibility
Proven ability to lead SRE strategy and execution for large-scale, complex, cross-functional projects
Deep expertise with cloud platforms (AWS/GCP), Kubernetes, container orchestration, observability, and incident response frameworks
Strong experience supporting production systems with stringent high availability, compliance, and security requirements
Demonstrated leadership in mentoring and growing technical teams
Excellent collaboration and communication skills, able to influence stakeholders at all levels
Degree in Computer Science or related field

Job Responsibility

Drive the strategic vision and roadmap for Site Reliability Engineering at Airwallex, aligned with business objectives and product goals
Architect and oversee the implementation of highly scalable, secure, and resilient cloud infrastructure for new services and platform-wide initiatives
Lead and mentor senior engineers and cross-functional teams in reliability engineering best practices, automation, and incident management
Champion and evolve operational excellence through advanced observability, SLO management, runbooks, and proactive risk mitigation
Lead incident response for high-severity incidents, facilitating post-mortems and driving continuous improvements
Collaborate closely with Product, Engineering, Security, and DevOps leadership to ensure compliance, resilience, and alignment across functions
Influence and shape engineering culture around reliability, scalability, and DevOps principles across multiple teams
Advocate for innovation in tooling, automation, and infrastructure to improve developer productivity and service uptime

Fulltime

New