This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
LogicMonitor® is the AI-first hybrid observability platform powering the next generation of digital infrastructure. LogicMonitor delivers complete visibility and actionable intelligence across on-premises, cloud, and edge environments. By anticipating issues before they strike, optimizing resources in real time, and enabling faster, smarter decisions, LogicMonitor helps IT and business leaders protect margins, accelerate innovation, and deliver exceptional digital experiences without compromise. We are seeking a talented and experienced Site Reliability Engineer (SRE) to help ensure the uptime and reliability of our mission-critical systems. In this high-impact role, you’ll automate and streamline operational tasks, continuously looking for ways to improve performance, efficiency, and scalability. You’ll work closely with developers to provide infrastructure-focused feedback that enhances product performance within the LM environment. This is a unique opportunity to sharpen your SRE skill set and become an invaluable member for the core LM Operations team.
Job Responsibility:
Maintain uptime of LogicMonitor’s SaaS-based platform and implement technical and process improvements to enhance system reliability
Ensure the security and stability of the production environment through proactive monitoring and risk mitigation strategies
Design, deploy, and manage scalable infrastructure and system integrations to support business growth and technical innovation
Write code to automate infrastructure maintenance, deployments, and routine operational tasks to increase efficiency and reduce manual effort
Partner closely with development teams to support and influence operational architecture and design changes
Lead cross-functional, technically complex projects, driving execution and alignment across teams
Act as a strategic technical resource across the organization, developing and delivering presentations for internal teams, customers, and external conferences
Mentor junior team members, fostering growth, knowledge sharing, and operational excellence
Set a high standard for documentation and runbook quality, leading by example to promote clarity, consistency, and operational readiness
Requirements:
3+ years of experience in a Linux engineering role, preferably in a SaaS-based company
Solid understanding of Linux system administration in distributed environments
Experience with configuration management tools such as Chef, Puppet, or Ansible
Experience with virtualization and container technologies (e.g., Docker, Kubernetes)