This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
The Observability Sr. Systems Engineer role will define, implement, govern, optimize, and monitor solutions to enhance Marriott's observability platform. The role will collaborate with architects from engineering, application, and enterprise/solution teams to develop, implement, and support logging, monitoring, reporting and automation for infrastructure and application services. This role serves as a subject-matter expert performing research, analysis, design, creation, and implementation of observability systems/solutions to meet current and future requirements across the enterprise.
Job Responsibility
Build and maintain the DTT SOX Compliance solution in Dynatrace and GitHub
Design, implement, and maintain high-performance and scalable observability solutions for Kubernetes – EKS/ACK, DocumentDB, EC2 and other data sources in a complex enterprise environment
Collaborate with cross-functional teams to gather requirements, architect solutions, and deploy logging and monitoring solutions that align with business needs (incl. DTT SOX Compliance)
Leverage in-depth knowledge of AWS, Azure and Alibaba Cloud technologies, including IaaS,PaaS, and SaaS, to architect and manage logging and monitoring tools’ deployments
Enable streamlined operational processes and efficient management of the Dynatrace infrastructure using scripting and automation
Responsible for infrastructure-as- code development and configuration management
Lead optimization efforts for observability platform and explore alternative solutions using other automation technologies like Cribl, etc
Onboard data sources from various IT infrastructure and app. components into observability tools (Dynatrace/Grail, Cribl)
Provide technical leadership, oversight, governance and direction for services related to Marriott solution delivery
Determine customer requirements and work with sourced resources to develop solutions
Provide and present status, analysis and reporting to internal stakeholders, Senior Leadership and Executive Management
Lead analysis of current environment for deficiencies and provides solutions
Identify opportunities to enhance the service delivery, operations and continual service improvement processes
Creates and enhances administrative, operational and technical policies and procedures, adopting best practice guidelines, standards and procedures for employees,contractors and vendor engagements
Management of daily infrastructure operations to ensure availability SLA is met for storage services
Interfaces with stakeholders to establish requirements and formulate priorities for infrastructure projects
Leads/assists in configuration management
Works in a concerted effort with application development and engineering teams to resolve complex issues
Provides oversight, collaboration, provisioning, management and maintenance of technology products and service alternatives that improve the production services environment
Responsible for the establishment and continuous development of monitoring and alerting for all production environments
Develops internal processes and training to ensure team members have the skills needed and tools to support the production environment and deliver on project commitments
Provides consultation for routine and complex systems development
Facilitates achievement of expected deliverables and obligations of Services Providers
Coordinates with Product and Architecture & Development teams for deployment and production support activities
Manages and implements work and projects as assigned
Generates and provides accurate and timely results in the form of reports, presentations, etc
Analyzes information and evaluates results to choose the best solution and solve problems
Provides timely, accurate, and detailed status reports as requested
Understands and meets the needs of key stakeholders
Develops specific goals and plans to prioritize, organize, and accomplish work
Determines priorities, schedules, plans and necessary resources to ensure completion of any projects on schedule
Collaborates with internal partners and stakeholders to support business/initiative strategies
Communicates concepts in a clear and persuasive manner that is easy to understand
Generates and provides accurate and timely results in the form of reports, presentations, etc
Demonstrates an understanding of business priorities
Manages time effectively and conducts activities in an organized manner
Presents ideas, expectations and information in a concise, organized manner
Performs other reasonable duties as assigned by manager
Requirements
Undergraduate degree in engineering or computer science discipline and/or equivalent experience/certification
7+ years’ experience in information technology with hands-on technical/engineering roles including
5+ years’ experience using at least one of the following: JavaScript, Typescript
5+ years' experience developing applications in AWS
5+ years' experience developing and supporting Java applications
3+ years' using application deployment tools including at least two of the following: Git, Harness,Terraform and NPM (Node Package Manager)
Experience using Dynatrace Query Language (DQL) and/or Splunk Processing Language (SPL) to build dashboards, reports and alerts to meet customer requirements
Experience in integrating observability tools with other ITOps solutions (ServiceNow, BigPanda,ReadyAPI, etc.)
Nice to have
Dynatrace, Splunk, Cribl, HashiCorp or other application certifications
Strong scripting experience in at least one of the following: PowerShell, Python
Strong knowledge of emerging tools, software, applications, and AI solutions for attaining best-in-class IT technology across the enterprise
Experience in building scalable pipelines for collecting, processing, and analyzing metrics, logs, and traces
Experience in establishing and implementing Observability best practices to standardize, monitor and control usage/performance of solutions
Excellent verbal and written communication skills for a wide range of audiences including executives, business stakeholders and IT teams
Demonstrated experience delivering technology solutions in a fast-paced, deadline-driven enterprise environment
Excellent problem-solving skills
Ability to work independently and as part of a cross functional team
Excellent understanding of change management, testing requirements, techniques, and tools to ensure quality and high availability of systems
Strong attention to detail with ability to operate effectively across multiple priorities