This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
The Senior Network Observability Engineer, Network Reliability Engineering (NRE) is the subject matter expert in designing and implementing the Network Observability strategy and platforms for the next-gen operations and engineering for all Marriott International (MI) networks including the Property Networks, Datacenter, Corporate and Client Networks and multi-cloud environments into a proactive, telemetry-driven ecosystem. This role will work closely and collaboratively with a matrix team of expert network architects and engineers to drive adoption of SRE practices and operating models across all network product towers, and around globe.
Job Responsibility:
Designing and implementing the Network Observability strategy and platforms for the next-gen operations and engineering for all Marriott International (MI) networks including the Property Networks, Datacenter, Corporate and Client Networks and multi-cloud environments into a proactive, telemetry-driven ecosystem
Work closely and collaboratively with a matrix team of expert network architects and engineers to drive adoption of SRE practices and operating models across all network product towers, and around globe
Architect solutions that leverage AI/ML-driven insights, real-time telemetry, and automation frameworks to predict, prevent, and resolve network issues before they impact business operations
Collaborate with network architects, product owners, and global operations teams to define and enforce observability standards, build automation pipelines, and deliver actionable intelligence across thousands of properties worldwide
Overcome the limitations of traditional monitoring and implement granular instrumentation, distributed tracing, and anomaly detection at scale
Requirements:
Bachelor’s degree in computer science, Network Engineering, or related discipline
advanced certifications (CCNP, AWS and Azure Networking Specialty) strongly preferred
8+ years of progressive experience in network observability, telemetry engineering, and performance optimization for large-scale, mission-critical environments
Proven expertise in collecting, processing, and correlating telemetry data (NetFlow, IPFIX, SNMP, streaming telemetry) to enable predictive analytics and proactive incident prevention
Hands-on experience with enterprise-grade observability, Saas and Security platforms, including Selector.ai, NetScout, NetBrain, ThousandEyes, BigPanda, and other AI/ML-driven monitoring solutions
Demonstrated ability to install, configure, and optimize observability tools, integrate APIs, and build automation workflows for anomaly detection and remediation
Strong proficiency in administration of network tools and policy enforcement, including role-based access control and compliance frameworks
Expertise in developing observability requirements, architecture designs, and implementation roadmaps, ensuring alignment with SRE principles and Agile delivery models
Deep understanding of foundational networking protocols and technologies (ARP, TCP/IP, UDP, DHCP, DNS, NAT) and advanced routing protocols (OSPF, BGP)
Hands on experience with Palo, Prisma, and SDWAN Strata Cloud Manager, Including routing and switching platforms (Cisco, Juniper, HP/Aruba)
Demonstrated experience in delivering written documents, including detailed network solutions and architecture diagrams
Experience with one or more Cloud Computing platforms (Amazon AWS, Microsoft Azure, Google Cloud Platform)
Experience in Agile and DevOps practices, including sprint planning, backlog grooming, and embedding observability into CI/CD pipelines
Ability to design custom dashboards, KPIs, and alerting strategies for real-time visibility and executive reporting
Nice to have:
Advanced Degree (MS, PhD) in Computer Science, Network Engineering, or MBA with a technology focus
Experience managing network observability tools in hospitality or global enterprise environments
Proficiency in leveraging public APIs for automation and integration with observability platforms
Strong ability to collaborate across cross-functional teams in multiple time zones, driving alignment and execution
Demonstrated experience in researching emerging technologies, standards, and trends and translating them into actionable roadmaps
Deep knowledge of next-generation observability tools and frameworks, including Selector.ai, NetScout, NetBrain, ThousandEyes, and AI Ops platforms
Proven ability to design and implement automation for network instrumentation and monitoring, using scripting languages (Python, REST APIs)
Excellent problem-solving skills, capable of working independently and leading outcomes for distributed teams
Strong understanding of change management, testing methodologies, and high-availability strategies for critical platforms
Ability to manage multiple priorities effectively, with exceptional attention to detail
Track record of driving transformation in network technologies and observability practices through data-driven continuous improvement
Experience improving reliability, performance, and agility of complex enterprise networks
Expertise in network infrastructure automation, instrumentation, and emerging observability technologies
Strong influencing and leadership skills to overcome barriers and drive organizational change
Exceptional verbal and written communication skills, including executive-level presentations and technical documentation