CrawlJobs Logo

Sr. Software Engineer, Observability

India, Bengaluru · Job Posted December 29, 2025
Apply Position
Job Link Share

Job Description

As a Sr. Software Engineer in Observability, you’ll be responsible for our metrics and log collection platform. You’ll work closely with other Infrastructure engineers to determine resource usage and requirements. You’ll also help create tooling, libraries, and documentation that enable other engineers to instrument their own projects. In addition, you’ll keep our team aware of trends in the larger observability/monitoring industry.

Job Responsibility

  • Develop and improve instrumentation for monitoring and logging the health and availability of services
  • Develop and maintain the observability stack within Dialpad engineering
  • Define best practices and standards around making systems and services measurable and work with various teams to get those best practices applied
  • Create tools and libraries for other engineering teams to enable them to build self-monitoring capabilities
  • Create and own internal documentation used by the other engineering teams
  • Stay up-to-date with the latest trends in observability, logging, monitoring, and cloud technologies
  • Collaborate with different engineering teams to integrate observability practices into their workflows
  • Participate in a rotating on-call within the larger Infrastructure Engineering division.

Requirements

  • Background in both Systems and/or Software Engineering
  • Experience in designing, automating, maintaining, and optimizing observability platforms (logging, metrics, and tracing)
  • Experience with configuration management tools such as Ansible, Terraform, etc.
  • Experience with Public Cloud environments such as GCP, AWS, etc.
  • Familiarity with languages such as Python, Go, Rust, etc.

Nice to have

  • Previous direct experience with Grafana, Loki, Prometheus
  • Experience with Linux
  • Experience with Kubernetes (including GKE/EKS) and building containerized applications
  • Undergraduate degree in Computer Science or Engineering.

What we offer

  • Competitive benefits and perks
  • Robust training program
  • Inclusive office environment
  • Recognized Great Place to Work culture.

Looking for more opportunities?

Search for other job offers that match your skills and interests.

Similar Jobs for

Sr. Software Engineer, Observability

8 matching positions

Sr. Software Engineer - QA / Test Automation Engineer

Location
Location
India , Gurgaon
Salary
Salary:
Not provided
https://www.randstad.com Logo
Randstad
Expiration Date
July 09, 2026
Flip Icon
Requirements
Requirements
  • 8+ years of experience in QA automation, SDET, or software engineering roles focused on test automation for distributed or cloud-based systems
  • Strong understanding of QA methodologies, test design, and systems validation
  • Proficiency in .NET 8/C#, Node.js, Python, or TypeScript for automation scripting
  • Hands-on experience with Selenium, Playwright, Cypress, REST API automation, and integration testing frameworks
  • Experience running tests in AWS environments with strong understanding of CI/CD pipelines using Azure DevOps
  • Familiarity with IaC, containerized test execution, and observability tools
  • Experience testing SQL Server 2022, Snowflake, PostgreSQL data flows
  • Ability to validate ETL pipelines, schema changes, and data quality through automation
  • Expertise in automated testing (unit, integration, contract, E2E, regression)
  • Familiarity with blue/green and canary release testing
Job Responsibility
Job Responsibility
  • Contribute to the design of scalable, maintainable QA automation frameworks for API, UI, integration, and performance testing
  • Implement automated test scenarios across microservices, APIs, data workflows, and distributed systems
  • Participate in design discussions to ensure testability, document risks, and propose automation strategies aligned with engineering standards
  • Produce clean, reusable, and maintainable automation scripts following best practices
  • Implement unit, integration, contract, and E2E tests integrated with CI/CD pipelines
  • Conduct root-cause analysis for defects and drive preventive quality improvements
  • Perform debugging, reliability analysis, and optimization of automation suites
  • Own test execution pipelines from development through deployment and monitoring
  • Create automated dashboards, alerts, and quality signals to validate release readiness
  • Collaborate in production issue investigations by building automated repros and validation scripts
  • Fulltime
Read More
Arrow Right
New

Sr. Software Engineer

We are looking for a Sr. Software Engineer to help shape modern cloud platforms ...
Location
Location
United States , Jacksonville
Salary
Salary:
Not provided
https://www.roberthalf.com Logo
Robert Half
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 10+ years of progressive experience in software engineering, including ownership of complex technical initiatives
  • Strong expertise in C#, .NET, ASP.NET, JavaScript, and React.js for building modern full-stack applications
  • Advanced experience designing cloud-based and serverless solutions, including event-driven architectures and distributed systems
  • Hands-on knowledge of CI/CD practices, infrastructure as code, Git-based workflows, and modern engineering delivery standards
  • Solid understanding of web application performance, browser behavior, and front-end architecture considerations
  • Experience working with relational databases, data pipelines, or large-scale data platforms
  • Knowledge of security concepts such as OAuth2, OpenID Connect, cryptography, and secure software design
  • Proven ability to work independently in ambiguous environments while influencing stakeholders and guiding technical direction
Job Responsibility
Job Responsibility
  • Design and deliver large-scale serverless applications using cloud-native services, event-driven patterns, and distributed system principles
  • Define technical architecture for new platforms, reusable services, and core components that support long-term scalability and operational stability
  • Build and improve infrastructure automation, deployment pipelines, and release processes to enable efficient and dependable software delivery
  • Act as a senior technical authority for cloud and serverless engineering, providing guidance on architecture decisions and implementation approaches
  • Lead design reviews, resolve complex production issues, and drive improvements in reliability, observability, and operational performance
  • Collaborate with product, security, DevOps, and business partners to align engineering solutions with strategic objectives and risk controls
  • Establish testing approaches that strengthen quality across automated, integration, and end-to-end validation efforts
  • Mentor engineers and technical leaders by sharing best practices, shaping engineering patterns, and supporting career development
  • Promote secure development, system hardening, and proactive mitigation of technical and operational risks
  • Influence roadmaps and technical strategy through clear communication, sound engineering judgment, and data-informed recommendations
What we offer
What we offer
  • Medical, vision, dental, and life and disability insurance
  • 401(k) plan
  • Fulltime
Read More
Arrow Right
New

Sr Software Engineer

Commercial Engineering & AI (CEAI) partners closely with stakeholders to acceler...
Location
Location
United States , Redmond
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
  • BS in CS or equivalent + 5+ years of software engineering (or MS + 3 / PhD + 1).
  • Experience with at least one of C#, TypeScript, Python
  • comfortable across all three.
  • Demonstrated ability to own and ship significant features or architectural components end to end.
  • 1+ year shipping LLM-based or agent-based systems in production, including hands-on experience with evals, observability, and debugging.
  • Production experience with one or more major agent stacks such as Microsoft 365 Agents SDK, AutoGen, Magentic-One, LangGraph, OpenAI Agents SDK, Anthropic SDK with MCP, or Semantic Kernel.
  • Collaboration across teams: you can align with partners and move work forward together.
  • Proficiency in AI-native development working within Agent Harnesses (GitHub Copilot CLI, Coding Agents), authoring Markdown specs/ADRs and YAML configs as Agent-consumable inputs, orchestrating multi-step Agentic workflows across the SDLC, and reviewing Agent-generated code and PRs with production-grade rigor.
  • Experience shipping quickly with agentic tools.
Job Responsibility
Job Responsibility
  • Design and build major components of our agentic sales platform: orchestration, tools and skills, grounding, evals and observability, and model routing.
  • Own components end to end, from prototype to production, including the harder judgment calls within your area.
  • Partner with AI Foundry, Microsoft Research, Substrate, and the Copilot organization to use shared primitives like agent SDKs, eval harnesses, content safety, and telemetry.
  • Contribute to the eval and Responsible AI bar for shipping agents in the Sales surface, with a focus on production-grade quality.
  • Help raise the agent-engineering bar through code review, design review, and mentoring peers.
  • Bring strong agentic patterns into the team's work and share what you learn.
What we offer
What we offer
  • Certain roles may be eligible for benefits and other compensation.
  • Fulltime
Read More
Arrow Right

Sr. Software Engineer I

We are seeking a highly skilled and motivated Java Technical Leader to lead and ...
Location
Location
Viet Nam , Ho Chi Minh
Salary
Salary:
Not provided
yum.com Logo
Yum!
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Strong expertise in Java, including OOP, multithreading, concurrency, collections, and performance tuning
  • Hands-on experience with Spring / Spring Boot, Spring Data, Dependency Injection (DI), and transaction management
  • Solid experience designing and building RESTful APIs and distributed microservices architectures
  • Experience with SQL databases (e.g., PostgreSQL) and NoSQL solutions (e.g., MongoDB, DynamoDB)
  • Strong understanding of data modeling, query optimization, and backend performance tuning
  • Hands-on experience with AWS (e.g., EKS, S3, RDS, Lambda)
  • Experience with CI/CD pipelines and modern DevOps practices
  • Familiarity with monitoring and logging tools such as DataDog for metrics, logs, and traces
  • Proficient with GitLab for version control and collaboration
  • Experience working in Agile / Scrum environments
Job Responsibility
Job Responsibility
  • Lead, mentor, and coach a team of Software Engineers, supporting their technical growth and career development
  • Act as a hands-on technical leader, setting coding standards, architectural principles, and best practices
  • Foster a collaborative, inclusive, and learning-oriented team culture with strong ownership and accountability
  • Guide engineers on effective and responsible use of AI tools to enhance daily engineering work (e.g., design, coding, debugging, testing, documentation)
  • Proactively identify, troubleshoot, and resolve complex performance or scalability issues
  • Work closely with DevOps/SRE teams to improve CI/CD pipelines, deployment reliability, and runtime stability
  • Drive improvements in system observability, monitoring, and alerting
  • Ensure high standards of code quality through design reviews, code reviews, automated testing, and documentation
  • Design, develop, and review robust Java-based systems using Spring ecosystem and microservices architecture
  • Work effectively with cross-functional and international teams (e.g., UK, US)
  • Fulltime
Read More
Arrow Right

Sr. Software Engineer - Vehicle Order Fulfillment

The Vehicle Order Fulfillment team owns and advances a portfolio of business-vit...
Location
Location
United States , Austin
Salary
Salary:
Not provided
gm.com Logo
General Motors
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or Master’s degree in Computer Science, Software Engineering, or related field
  • 7+ years of software engineering experience with a heavy focus on Java
  • 3+ years mentoring peers and driving technical initiatives
  • Hands-on experience with React, Quarkus, Hibernate, REST APIs, microservices, and design patterns
  • Proficiency with CI/CD tools (e.g., GitHub Actions), automated testing, and agile methodologies
  • Deep understanding of software engineering principles and modern system design
  • Strong debugging and optimization skills
  • Excellent verbal and written communication skills with the ability to simplify complex topics
  • Exceptional planning capabilities to translate features into user stories, prioritize them for execution, and effectively communicate the details of these user stories to the team
Job Responsibility
Job Responsibility
  • Develop and effectively communicate a clear technical vision that aligns with the requirements and goals for our products
  • Collaborate with architects in shaping technology decisions and crafting innovative strategies
  • Proactively evaluate new technologies and drive strategic innovation initiatives
  • Lead and implement end-to-end design, development, and delivery of enterprise-grade applications with a focus on React for UI and a Java, Quarkus, and microservices architecture
  • Ensure that all development work adheres to best practices for software craftsmanship, including SOLID principles, TDD, and clean architecture
  • Drive consistency and code reuse through the use of shared libraries, utilities, and component-based development
  • Spearhead the adoption of modern infrastructure and deployment strategies using Quarkus, Docker, and Kubernetes within a RedHat OpenShift environment
  • Create and maintain robust CI/CD pipelines to automate code integration, testing, and deployment using tools like GitHub Actions
  • Ensure the security, reliability, and observability of applications through logging, monitoring, and incident response planning
  • Foster a collaborative, inclusive, and high-performance engineering culture that emphasizes learning, accountability, and innovation
What we offer
What we offer
  • Relocation benefits (may be eligible)
  • Total Rewards resources (benefits overview)
  • Fulltime
Read More
Arrow Right

Sr. Software Engineer -CTJ- Poly

Microsoft has an exciting opportunity for a Senior Software Engineer in the Micr...
Location
Location
United States , Redmond
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Candidates must be able to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: The successful candidate must have an active U.S. Government Top Secret Clearance with access to Sensitive Compartmented Information (SCI) based on a Single Scope Background Investigation (SSBI) with Polygraph. Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. Failure to maintain or obtain the appropriate U.S. Government clearance and/or customer screening requirements may result in employment action up to and including termination
  • This position requires successful verification of the stated security clearance to meet federal government customer requirements. You will be asked to provide clearance verification information prior to an offer of employment
  • This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter
  • This position requires verification of U.S. citizenship due to citizenship-based legal restrictions. Specifically, this position supports United States federal, state, and/or local United States government agency customer and is subject to certain citizenship-based restrictions where required or permitted by applicable law. To meet this legal requirement, citizenship will be verified via a valid passport, or other approved documents, or verified US government Clearance
Job Responsibility
Job Responsibility
  • Acts as a Designated Responsible Individual (DRI) for service components, owning availability, reliability, and operational health
  • Participates in on-call rotations, responding to incidents by assessing impact, troubleshooting issues, mitigating customer impact, and driving resolution
  • Leads or contributes to root cause analysis (RCA) and postmortems, ensuring learnings translate into systemic improvements
  • Uses existing tools and develops new capabilities to troubleshoot issues affecting availability, performance, security, and efficiency
  • Leverages telemetry and monitoring to identify trends, detect anomalies, and proactively improve service health
  • Drives improvements in observability, alerting, and diagnostics
  • Leads and contributes to architecture and design discussions for components of Power Platform services
  • Identifies dependencies across teams and incorporates them into design specifications and execution plans
  • Ensures systems meet performance, scalability, security, and compliance requirements, especially within air‑gapped constraints
  • Develops and improves CI/CD pipelines and deployment systems, enabling safe, repeatable, and automated releases
  • Fulltime
Read More
Arrow Right

Sr. Software Engineer

Roku is changing how the world watches TV. Roku is the #1 TV streaming platform ...
Location
Location
United States , San Jose
Salary
Salary:
244900.00 - 321100.00 USD / Year
roku.com Logo
Roku
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's or master's degree in Computer Science, Computer Engineering, Electrical Engineering, Data Science, or a related technical field
  • 2+ years of experience in software engineering, AI/ML engineering, backend development, or adjacent domains, with strong software engineering fundamentals and the ability to build production-grade systems
  • Strong proficiency in Python, plus experience with C/C++ or another systems language
  • Hands-on experience with LLM-based systems, including prompt design, retrieval, tool use, memory handling, and agent orchestration patterns
  • Experience building and maintaining RAG pipelines, agent frameworks, MCP servers or equivalent function-calling architectures, and conversational interfaces
  • Familiarity with cloud platforms, REST APIs, containerization, and modern deployment environments
  • Experience with observability, evaluation, experimentation, and feedback loops for AI systems in production
  • Ability to work independently, manage ambiguity, move quickly, and deliver incrementally in a fast-paced environment
  • Excellent communication skills, sound engineering judgment, and a collaborative working style
Job Responsibility
Job Responsibility
  • Architect, develop, and deploy AI agents and copilots for Roku TV use cases, integrating them with internal systems, tools, and services
  • Own end-to-end agentic systems from concept to production, including model selection, prompt and context design, retrieval strategies, backend services, and conversational interfaces
  • Design and implement single-agent and multi-agent orchestration patterns, including handoffs, delegation, and cooperative task execution
  • Build scalable RAG and context pipelines that provide high-quality grounding for AI systems and keep them aligned with evolving data sources and business logic
  • Implement tool-calling, function-calling, and MCP-style integrations so agents can safely take actions and interact with the systems around them
  • Create reusable agent templates, modular components, and paved-path patterns that accelerate adoption across teams and use cases
  • Establish strong evaluation, observability, and monitoring for conversation quality, task success rate, latency, cost, and overall system performance
  • Build safeguards that improve production readiness and reliability, including testing pipelines, controlled rollouts, drift detection, and mechanisms that prevent error amplification in multi-step workflows
  • Prototype quickly, run experiments, and translate successful ideas into durable, scalable software solutions
  • Partner closely with engineering, product, QA, infrastructure, and cross-functional teams to deliver meaningful business and customer outcomes
What we offer
What we offer
  • Health insurance
  • equity awards
  • life insurance
  • disability benefits
  • parental leave
  • wellness benefits
  • paid time off
  • global access to mental health and financial wellness support and resources
  • healthcare (medical, dental, and vision)
  • life
  • Fulltime
Read More
Arrow Right

Sr. Software Engineer - Azure Storage

Are you passionate about shaping the future of cloud storage services? Join the ...
Location
Location
United States , Multiple Locations
Salary
Salary:
119800.00 - 234700.00 USD / Year
https://www.microsoft.com/ Logo
Microsoft Corporation
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
Job Responsibility
Job Responsibility
  • Collaborates with appropriate stakeholders to determine user requirements for a scenario
  • Drives identification of dependencies and the development of design documents for a product, application, service, or platform
  • Creates, implements, optimizes, debugs, refactors, and reuses code to establish and improve performance and maintainability, effectiveness, and return on investment (ROI)
  • Leverages subject-matter expertise of product features and partners with appropriate stakeholders (e.g., project managers) to drive a workgroup's project plans, release plans, and work items
  • Acts as a Designated Responsible Individual (DRI) and guides other engineers by developing and following the playbook, working on call to monitor system/product/service for degradation, downtime, or interruptions, alerting stakeholders about status and initiates actions to restore system/product/service for simple and complex problems when appropriate
  • Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale
  • Fulltime
Read More
Arrow Right