CrawlJobs Logo

Sre Observability Sme

Canada, Toronto · Job Posted May 05, 2026
Apply Position
Job Link Share

Job Description

Our Financial client in Toronto is seeking a hands-on SRE Observability SME to provide day-one expertise in improving system reliability, performance, and incident response across complex distributed environments. This is a HYBRID , embedded role working closely with engineering teams to drive observability best practices.

Job Responsibility

  • Provide hands-on SRE and observability expertise across applications and infrastructure
  • Implement and optimize monitoring, alerting, and observability frameworks
  • Troubleshoot complex performance and reliability issues using metrics, events, logs, and traces (MELT)
  • Design and build advanced dashboards and visualization solutions
  • Guide teams on SRE best practices and reliability improvements
  • Support incident response, root cause analysis, and remediation
  • Develop creative observability solutions for systems with limited visibility

Requirements

  • Strong hands-on experience with Dynatrace (DQL, dashboards, Grail, ActiveGate, plugins, workflows, BizEvents)
  • Deep expertise in APM and observability tools (Dynatrace or similar)
  • Advanced troubleshooting across distributed, multi-tier environments
  • Strong understanding of SRE principles (Google SRE framework)
  • Experience with AWS observability (CloudWatch, Application Signals, metrics, logs, traces, Lambda, API Gateway)
  • Development experience with Python, AWS Lambda, ECS, Azure Functions
  • Knowledge of OpenTelemetry (OTEL)
  • Experience with AI-based system monitoring concepts
  • Strong dashboard design (UI/UX for observability)

Nice to have

  • Experience monitoring complex systems (e.g., IBM DataPower)
  • Background in financial services or large-scale enterprise environments

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Sre Observability Sme

8 matching positions

New

Applications Support Senior Analyst

Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 6 plus years experience in an Application Support role
  • Practical problem solving and strategic thinking skills
  • Demonstrated leadership, interpersonal skills and relationship building skills
  • Service oriented attitude
  • Ability to work in a fast-paced environment
  • Experience working or leading requirement gathering efforts for multiple large development projects at one-time
  • Proficient using basic technical tools and systems
  • Good interpersonal and communication skills
  • Good all-round technical skills
  • Effectively share information with other support team members and with other technology teams
Job Responsibility
Job Responsibility
  • Provides technical and business support for users of Citi Applcations
  • Partner with multiple technology teams to ensure appropriate integration of functions to meet goals
  • Perform Incident/Outage Management
  • Investigation of incidents reported across a range of applications within our Custody Digital Assets and Settlements applications
  • Engagement with ITIL processes including Major Incident management, problem management, change management etc
  • Operate independently to identify process bottlenecks and proactively drive improvements
  • Experience in performing resiliency activities such as disaster recovery coordination
  • Confidently handle outage communication with stakeholders in Business and Operations
  • Actively participate in defining and implementing application onboarding guidelines and standards
  • Acts as SME to senior stakeholders and /or other team members
  • Fulltime
Read More
Arrow Right

Solution Architect

BNEW RCE Lab Operations is part of Business Area Networks. We provide lab infras...
Location
Location
India , Bangalore
Salary
Salary:
Not provided
ericsson.com Logo
Ericsson
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience designing cloud-native system architectures
  • hands-on knowledge of Golang, Svelte, Docker/K8s, Cortex, tracing, Postgres/Cassandra/Kafka
  • experience with CI/CD pipelines and automation
  • strong security mindset
  • familiarity with observability tools (e.g., Grafana, Prometheus, Loki) is a plus
  • Ericsson experience is a bonus
  • proactive problem-solver
  • comfortable as an internal SME
  • communicate to technical and non-technical audiences
Job Responsibility
Job Responsibility
  • Provide observability services to R&D and Lab Operations
  • lead a global team
  • refine and extend architecture of COS (Central Observability System)
  • drive key architectural decisions
  • define COS architecture principles, standards, and patterns
  • translate stakeholder needs into target architecture and roadmap
  • explore how AI can add value
  • promote observability by design
  • own cloud-native microservice architecture
  • enable AI in observability/telemetry
  • Fulltime
Read More
Arrow Right

Senior Software Engineer

We are seeking a Senior Software Engineer to design, build, and operate high-thr...
Location
Location
United States , Dallas
Salary
Salary:
Not provided
aquent.com Logo
Aquent
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 8+ years Strong Java expertise with experience building production-grade services
  • 8+ years Hands-on API development experience, including RESTful services and API security
  • Cloud deployment experience on GCP and/or equivalent cloud platforms
  • 2+ Solid experience with Access Management protocols: OAuth 2.0, OpenID Connect (OIDC), SAML
  • Proven experience building and operating high-criticality systems with strict SLAs
  • Strong understanding of distributed systems, concurrency, and performance optimization and resiliency
  • Experience with observability (metrics, logs, traces) and production troubleshooting
  • Proficiency in building server-side applications (API SME) using C# and .NET Technologies
  • Solution design and implementation experience for high availability, High throughput, high scalability Application
  • Good understanding of the latest System Architecture and Development Standards and Guidelines
Job Responsibility
Job Responsibility
  • Lead Solution in the development and delivery of the organization’s software products to QA, UAT and Production
  • Manage day-to-day activities and promote Agile software development practices within the team
  • Collaborate with product owners and key stakeholders in Project Management, Business, QA, and Technology Operations to ensure timely and budget-friendly software project delivery
  • Work with Scrum Master and product owner to provide development sizing and cost analysis estimates
  • Collaborate with the product owner and team members in story decomposition, feature design, and task prioritization
  • Utilize automated software testing tools and frameworks, including test-driven development, to meet software quality standards
  • Support Single Sign-On (SSO) integration efforts to connect systems both internally and externally to Schwab
  • Assist the release manager in assembling releases and improving the release process
  • Help resolve needs and roadblocks identified by team members with the Scrum Master
  • Ensure the coordination of individual team deliverables to achieve product releases
  • Fulltime
Read More
Arrow Right

Devops Senior Programmer - Assistant Vice President

The Applications Development Senior Programmer Analyst is an intermediate level ...
Location
Location
India , Pune
Salary
Salary:
Not provided
https://www.citi.com/ Logo
Citi
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5-8 years of relevant experience
  • Experience in systems analysis and programming of software applications
  • Experience in managing and implementing successful projects
  • Working knowledge of consulting/project management techniques/methods
  • Ability to work under pressure and manage deadlines or unexpected changes in expectations or requirements
  • Bachelor’s degree/University degree or equivalent experience
  • Expert-level container orchestration and cloud-native technologies
  • Advanced infrastructure automation and IaC practices
  • Strong security engineering and DevSecOps implementation
  • Proficient in scripting languages
Job Responsibility
Job Responsibility
  • Conduct tasks related to feasibility studies, time and cost estimates, IT planning, risk technology, applications development, model development, and establish and implement new or revised applications systems and programs to meet specific business needs or user areas
  • Monitor and control all phases of development process and analysis, design, construction, testing, and implementation as well as provide user and operational support on applications to business users
  • Utilize in-depth specialty knowledge of applications development to analyze complex problems/issues, provide evaluation of business process, system process, and industry standards, and make evaluative judgement
  • Recommend and develop security measures in post implementation analysis of business usage to ensure successful system design and functionality
  • Consult with users/clients and other technology groups on issues, recommend advanced programming solutions, and install and assist customer exposure systems
  • Ensure essential procedures are followed and help define operating standards and processes
  • Serve as advisor or coach to new or lower level analysts
  • Has the ability to operate with a limited level of direct supervision
  • Can exercise independence of judgement and autonomy
  • Acts as SME to senior stakeholders and /or other team members
  • Fulltime
Read More
Arrow Right

Managed Services Operations Manager

The Managed Services organization ensures the reliability, stability, and operat...
Location
Location
France , Paris
Salary
Salary:
Not provided
https://www.ledger.com Logo
Ledger
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Senior experience in production tech operations, SRE, or platform operations
  • Proven leadership of tech operations teams with accountability for service stability
  • Strong experience with cloud-native environments (AWS, Kubernetes, observability tooling)
  • Hands-on incident management and tech incident leadership experience
  • Strong communication, ownership mindset, and ability to work cross-functionally
Job Responsibility
Job Responsibility
  • Own and lead the MS Operations function, including people management, priorities, and operational accountability
  • Provide technical leadership on operational architecture, tooling, automation, and incident response
  • Own day-to-day operational stability, operational readiness, and change validation for Ledger services
  • Define and own the on-call operating model
  • act as operational lead during major incidents
  • Act as the SME for operational observability, defining requirements and validating monitoring and alerting effectiveness
  • Drive continuous improvement, reducing operational debt and improving reliability and efficiency
  • Own and track key operational KPIs (MTTR, incident recurrence, on-call health)
  • Represent Managed Services Operations in cross-team operational and governance discussions
  • Identify, assess, and escalate operational risks (capacity, tooling, skills, process gaps), with clear mitigation proposals to the Managed Services Director
What we offer
What we offer
  • Flexible work options - Our hybrid policy allows employees to work from home up to 3 times per week
  • Health & Wellness support - Health and Life Insurance
  • Financial growth opportunities - Employees can become shareholders in Ledger as well as other financial benefits depending on your country of work
  • Commuter allowance - Ledger offers a commuter allowance to contribute to your preferred means of transportation
  • Learning & Development - A comprehensive suite of training solutions providing a personalised learning experience for every employee
  • Fulltime
Read More
Arrow Right
New

IT Training Lead

The IT Training Lead will drive technology learning and user adoption across the...
Location
Location
United States , Delray Beach
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Experience in IT training, instructional design, technical enablement, or learning and development
  • Strong knowledge of Microsoft 365
  • Excellent communication, facilitation, and content development skills
  • Ability to translate technical concepts into practical, user-friendly training.
Job Responsibility
Job Responsibility
  • Design, develop, and deliver IT training programs in instructor-led, virtual, and self-paced formats
  • Take lead in the Microsoft Copilot and AI training strategy, including onboarding, advanced use cases, responsible AI usage, and ongoing enablement
  • Partner with IT leadership to support new technology rollouts, system upgrades, and digital transformation initiatives
  • Create and maintain training content, including videos, guides, tutorials, and job aids
  • Identify skill gaps and develop targeted learning solutions to improve adoption and productivity
  • Gather feedback and measure training effectiveness to continuously improve programs.
Read More
Arrow Right
New

K Kitchen Representative

The position includes, but is not limited to, the following essential job duties...
Location
Location
United States , New Albany
Salary
Salary:
Not provided
https://www.circlek.com Logo
Circle K
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Excellent communication skills
  • Team player who can work well with others or independently
  • Acts with integrity
  • keeps commitments
  • Contagious positive attitude
  • Focuses on achieving results while having fun
  • Frequently bend, twist at waist, kneel, squat, stand, and walk
  • Occasionally climb and descend ladders
  • Tolerate extreme cold and hot temperatures and work in and around fryers, ovens, grills, coolers, freezers, sharp objects, and loud noises
  • Reach, grasp, and manipulate objects with hands for entire shift, including reaching for objects overhead
Job Responsibility
Job Responsibility
  • Provides excellent guest service in a fast and friendly manner
  • Maintains a clean restaurant environment by cleaning and performing general housekeeping duties
  • Prepares and serves food items in accordance with all Brand, Company, and health department regulations
  • Ensures product quality, food safety, and operational standards are met
  • Keeps accurate cash, sales, and inventory control records
  • Follows all government laws and safety codes
  • Completes reports on all incidents following our 5-minute rule policy
  • Lives our Company values: One Team, Do the Right Thing, Takes Ownership, Play to Win
What we offer
What we offer
  • Medical, Dental, Vision, Term Life and AD&D plans
  • Flexible spending and health savings accounts (FT)
  • Vacation paid time off
  • Company holidays paid at time and a half
  • Matching 401(k)
  • Tuition Reimbursement
  • Stock Purchase Plan
  • Employee Discount Program
  • Discount Meal Benefit
  • Wellness Plan
Read More
Arrow Right
New

K Kitchen Representative

Location
Location
United States , Decatur
Salary
Salary:
Not provided
https://www.circlek.com Logo
Circle K
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Excellent communication skills
  • Team player who can work well with others or independently
  • Acts with integrity
  • keeps commitments
  • Contagious positive attitude
  • Focuses on achieving results while having fun
  • Frequently bend, twist at waist, kneel, squat, stand, and walk
  • Occasionally climb and descend ladders
  • Tolerate extreme cold and hot temperatures and work in and around fryers, ovens, grills, coolers, freezers, sharp objects, and loud noises
  • Reach, grasp, and manipulate objects with hands for entire shift, including reaching for objects overhead
Job Responsibility
Job Responsibility
  • Provides excellent guest service in a fast and friendly manner
  • Maintains a clean restaurant environment by cleaning and performing general housekeeping duties
  • Prepares and serves food items in accordance with all Brand, Company, and health department regulations
  • Ensures product quality, food safety, and operational standards are met
  • Keeps accurate cash, sales, and inventory control records
  • Follows all government laws and safety codes
  • Completes reports on all incidents following our 5-minute rule policy
  • Lives our Company values: One Team, Do the Right Thing, Takes Ownership, Play to Win
What we offer
What we offer
  • Medical, Dental, Vision, Term Life and AD&D plans
  • Flexible spending and health savings accounts (FT)
  • Vacation paid time off
  • Company holidays paid at time and a half
  • Matching 401(k)
  • Tuition Reimbursement
  • Stock Purchase Plan
  • Employee Discount Program
  • Discount Meal Benefit
  • Wellness Plan
Read More
Arrow Right