CrawlJobs Logo

Monitoring Engineer / Infrastructure Engineer

outsource-uk.co.uk Logo

Outsource UK

Location Icon

Location:
United Kingdom , Hemel Hempstead

Category Icon

Job Type Icon

Contract Type:
Not provided

Salary Icon

Salary:

Not provided

Job Description:

We are seeking a Monitoring Engineer to lead day-to-day technical operations within a Windows infrastructure environment. Reporting into the Head of Operating Systems, this role combines hands-on technical delivery with leadership responsibilities across a specialist infrastructure team. You will play a key part in shaping monitoring capability, improving operational resilience, and supporting both project delivery and live service within a highly governed environment. The role requires strong expertise in enterprise monitoring tools and infrastructure architecture, with the ability to influence technical direction, support decision making, and ensure operational standards are consistently met.

Job Responsibility:

  • Lead and mentor the infrastructure monitoring team, supporting development of SME capability and operational maturity
  • Own and contribute to solution design, estimation, high and low-level design, and implementation activities under Project Manager guidance
  • Ensure adherence to SLAs, responding, resolving, or escalating issues appropriately within defined thresholds
  • Develop and maintain operational and end-user documentation, ensuring consistency and compliance with standards
  • Support pre-sales and solution scoping activities where required
  • Work closely with Architects and Solution Designers to assess options and provide technical recommendations
  • Accurately estimate effort, cost, and delivery timelines for implementation tasks
  • Ensure all team activity is fully documented in line with governance and operational standards
  • Provide regular progress updates to Project Management to support delivery tracking and planning

Requirements:

  • Strong enterprise infrastructure background with extensive operational experience
  • Proven experience leading infrastructure or technical teams within structured delivery environments
  • Deep technical expertise in monitoring and infrastructure tooling, including: Microsoft System Center Operations Manager (SCOM), PRTG Network Monitor
  • Experience in network device monitoring and dashboard configuration
  • Strong fault finding, diagnosis, and resolution skills across complex infrastructure environments
  • Experience with virtualised environments, enterprise storage, file/print services, and hardware evaluation
  • Strong understanding of service management and working within SLA-driven environments
  • Experience working within governed frameworks and structured delivery methodologies
  • Project leadership experience within structured methodologies such as PRINCE2 or Project Management Institute (PMI) approaches
  • Diploma or equivalent in Computer Science or related discipline

Nice to have:

  • Experience working in customer-facing environments and understanding business impact of technical issues
  • Strong documentation skills for both end-user and operational audiences
  • Accreditation at Microsoft Certified Systems Engineer (MCSE) level or equivalent
  • Knowledge of ITIL Foundation principles and service management best practice
  • Experience with enterprise messaging, thin client environments, or virtualization platforms

Additional Information:

Job Posted:
April 23, 2026

Employment Type:
Fulltime
Work Type:
On-site work
Job Link Share:

Looking for more opportunities? Search for other job offers that match your skills and interests.

Briefcase Icon

Similar Jobs for Monitoring Engineer / Infrastructure Engineer

NOC Monitoring Engineer-Infrastructure Management

Project Description: Experience in Service Desk Management and Voice Support wit...
Location
Location
India , Noida
Salary
Salary:
Not provided
https://www.soprasteria.com Logo
Sopra Steria
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Excellent customer service skills with a high level of focus on quality
  • First Point of Contact: Serve as the initial point of contact for end-users seeking technical assistance via phone, email, or in-person
  • Monitor IT infrastructure and applications using tools like CTLM, Centreon, Zabbix, and ITSM platforms (ServiceNow, SMAX)
  • Handle incidents, alerts, and requests via ticketing tools, providing first-line fixes or escalating to L2/L3 as needed
  • Log and document incidents, generate reports, and track recurring issues to identify trends and potential problems
  • Maintain ITIL-compliant procedures and update service documents, SOPs, and KB articles
  • Develop dashboards for uptime, availability, and utilization metrics for infrastructure and applications
  • Collaborate with the monitoring team, manage shifts, and provide training and documentation support
  • Maintain and Update Service Documents and SOP
  • Responsible for identifying potential problems and/or trends of repetitive Incidents
Job Responsibility
Job Responsibility
  • Monitor the events or alerts on monitoring tools, perform initial investigation, and raise with the Support team
What we offer
What we offer
  • Commitment to fighting against all forms of discrimination
  • Inclusive and respectful work environment
  • Open to people with disabilities
Read More
Arrow Right

NOC Monitoring Engineer

Sopra Steria, a major Tech player in Europe recognized for its consulting, digit...
Location
Location
India , Chennai
Salary
Salary:
Not provided
https://www.soprasteria.com Logo
Sopra Steria
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Knowledge of common monitoring tools and ability to interpret their metrics and ITSM tool
  • understanding of key network monitoring protocols including SNMP, NetFlow, WMI, syslog, etc. and devices routers, palo alto, checkpoint etc.
  • knowledge in routing, switching and TCP/IP troubleshooting techniques
  • basic knowledge of the OSI model, diagnostic skills
  • knowledge of Linux , Windows, command line and system diagnostics
Job Responsibility
Job Responsibility
  • Team Member supporting NOC operations for continuous monitoring of Global IT Infrastructure (24x7) and services
  • proactive alerting and notifying stakeholders during critical incidents and escalations
  • contributing towards ensuring IT uptime, and documentation
  • validating for desired performance and health status of all monitoring tools deployed in NOC
  • generating reports related to availability, performance and capacity bottle necks at desired intervals as per operational requirements and on need basis
  • logging of incidents and events based on appropriate categories using ticketing tool and assigning it to the appropriate stakeholders for resolution
  • notifying incidents events using appropriate communication channels
  • provide appropriate inputs to stakeholders during major incidents
  • participate in all review forums to enhance process and procedure involving NOC operations
What we offer
What we offer
  • Inclusive and respectful work environment
  • commitment against all forms of discrimination
  • open positions for people with disabilities
  • Fulltime
Read More
Arrow Right

NOC Monitoring Engineer

Experience in Service Desk Management and Voice Support with strong communicatio...
Location
Location
India , Noida
Salary
Salary:
Not provided
https://www.soprasteria.com Logo
Sopra Steria
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Excellent customer service skills with a high level of focus on quality
  • Serve as the initial point of contact for end-users seeking technical assistance via phone, email, or in-person
  • Monitor IT infrastructure and applications using tools like CTLM, Centreon, Zabbix, and ITSM platforms (ServiceNow, SMAX)
  • Handle incidents, alerts, and requests via ticketing tools, providing first-line fixes or escalating to L2/L3 as needed
  • Log and document incidents, generate reports, and track recurring issues to identify trends and potential problems
  • Maintain ITIL-compliant procedures and update service documents, SOPs, and KB articles
  • Develop dashboards for uptime, availability, and utilization metrics for infrastructure and applications
  • Collaborate with the monitoring team, manage shifts, and provide training and documentation support
  • Maintain and Update Service Documents and SOP
  • Responsible for identifying potential problems and/or trends of repetitive Incidents
Job Responsibility
Job Responsibility
  • Monitor the events or alerts on monitoring tools
  • perform initial investigation
  • raise with the Support team.
What we offer
What we offer
  • Inclusive work environment
  • Respect for all differences
  • Positions open to people with disabilities.
  • Fulltime
Read More
Arrow Right

Engineering Manager, Infrastructure

As an Engineering Manager for the Infrastructure team, you’ll lead the engineers...
Location
Location
Canada; United States
Salary
Salary:
195000.00 - 285000.00 USD / Year
apollo.io Logo
Apollo.io
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of hands-on software or infrastructure engineering experience
  • 2+ years of experience leading teams of senior and staff-level engineers in platform, SRE, or infrastructure domains
  • Proven ability to design and operate large-scale distributed systems in cloud environments (preferably GCP or AWS)
  • Expertise with Kubernetes, Docker, Terraform, Ubuntu, and CI/CD pipelines
  • Familiarity with observability tools (Grafana, Prometheus, ELK, Datadog, NewRelic) and performance tuning
  • Strong grounding in networking, security, and reliability principles
  • Experience managing infrastructure costs, availability SLAs, and high-throughput systems at scale
Job Responsibility
Job Responsibility
  • Lead, coach, and grow a distributed team of high-impact Infrastructure Engineers
  • Partner with senior engineering leadership on strategic initiatives such as cloud migration, infrastructure scaling, platform reliability, and cost efficiency
  • Define and implement modern operational excellence practices, including SLOs, error budgets, incident reviews, and performance monitoring
  • Guide technical decision-making across key areas like Kubernetes, GCP, observability, networking, CI/CD, and IaC (Terraform, Ansible)
  • Collaborate with AI, Data, and Product Engineering teams to ensure infrastructure scalability for ML and AI-native workloads
  • Run effective 1:1s, career development conversations, and quarterly performance reviews
  • Support recruiting efforts to attract top engineering talent across time zones
What we offer
What we offer
  • Equity
  • Company bonus or sales commissions/bonuses
  • 401(k) plan
  • At least 10 paid holidays per year
  • Flex PTO
  • Parental leave
  • Employee assistance program and wellbeing benefits
  • Global travel coverage
  • Life/AD&D/STD/LTD insurance
  • FSA/HSA and medical, dental, and vision benefits
  • Fulltime
Read More
Arrow Right

Sr. Infrastructure Security Engineer

As a Sr. Infrastructure Security Engineer, you will be responsible for protectin...
Location
Location
United States , West Point
Salary
Salary:
84410.00 - 129987.00 USD / Year
haeaus.com Logo
Hyundai AutoEver America
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Bachelor’s degree in computer science, Information Systems, or related field, or equivalent experience and certifications
  • Ability to script using Python
  • 7+ years of experience in Security Engineering, including planning and operations
  • Advanced knowledge of security technologies in medium to complex computing environments
  • Hands-on experience with multiple enterprise security technologies (e.g., firewalls, VPNs, intrusion detection/prevention, endpoint security)
  • Strong understanding of server/network architecture and core networking concepts (e.g., routing, DNS, DHCP)
Job Responsibility
Job Responsibility
  • Design and Deploy Security Solutions: Build, test, and implement new security technologies, including creating operational manuals and runbooks
  • Operate and Optimize Security Systems: Maintain and improve existing security tools such as DLP, Antivirus, IPS/IDS, and Endpoint Protection, while automating monitoring and enforcement processes
  • Conduct Risk Assessments and Incident Response: Lead or support technical risk evaluations and respond to security incidents, ensuring thorough remediation and reporting
  • Collaborate and Advise: Work with internal and external stakeholders to identify security needs, recommend solutions, and stay current with evolving technologies
  • Monitor and Report: Continuously monitor infrastructure for threats, produce security reports for senior leadership, and implement changes following established procedures
  • Fulltime
Read More
Arrow Right

Senior Software Engineer - ML Infrastructure

We build simple yet innovative consumer products and developer APIs that shape h...
Location
Location
United States , San Francisco
Salary
Salary:
180000.00 - 270000.00 USD / Year
plaid.com Logo
Plaid
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5+ years of industry experience as a software engineer, with strong focus on ML/AI infrastructure or large-scale distributed systems
  • Hands-on expertise in building and operating ML platforms (e.g., feature stores, data pipelines, training/inference frameworks)
  • Proven experience delivering reliable and scalable infrastructure in production
  • Solid understanding of ML Ops concepts and tooling, as well as best practices for observability, security, and reliability
  • Strong communication skills and ability to collaborate across teams
Job Responsibility
Job Responsibility
  • Design and implement large-scale ML infrastructure, including feature stores, pipelines, deployment tooling, and inference systems
  • Drive the rollout of Plaid’s next-generation feature store to improve reliability and velocity of model development
  • Help define and evangelize an ML Ops “golden path” for secure, scalable model training, deployment, and monitoring
  • Ensure operational excellence of ML pipelines and services, including reliability, scalability, performance, and cost efficiency
  • Collaborate with ML product teams to understand requirements and deliver solutions that accelerate experimentation and iteration
  • Contribute to technical strategy and architecture discussions within the team
  • Mentor and support other engineers through code reviews, design discussions, and technical guidance
What we offer
What we offer
  • medical, dental, vision, and 401(k)
  • Fulltime
Read More
Arrow Right

Infrastructure Engineer

The Infrastructure Engineer is responsible for building and maintaining secure, ...
Location
Location
United States , Dallas
Salary
Salary:
Not provided
zazz.io Logo
Zazz
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • 5–8 years in infrastructure engineering in MSP or enterprise environments
  • Deep expertise in Windows Server administration (AD, DNS, DHCP, GPO) and Hyper-V virtualization
  • Proven track record with Acronis Cyber Protect and Axcient backup deployments, policy management, and restore/DR testing
  • Strong experience configuring NinjaOne monitoring, alerting, and automated workflows for server infrastructure
  • Experience with Defender for Servers, vulnerability management, and basic server hardening practices
  • Familiarity with hybrid cloud integrations, Azure AD Connect, and file sync solutions
  • Solid PowerShell scripting skills for automation and operational efficiency
  • Strong documentation discipline, change control, and structured troubleshooting approach
  • Understanding of compliance-aligned backup/DR practices (HIPAA, SOC 2, GDPR)
Job Responsibility
Job Responsibility
  • Configure, tune, and manage NinjaOne monitoring policies for servers, virtualization hosts, and critical infrastructure services
  • Implement proactive alerting, service health checks, and event log monitoring with escalation workflows
  • Build PowerShell scripts and NinjaOne workflows for automated remediation, scheduled tasks, and routine maintenance
  • Manage server grouping, tagging, and baseline policy assignment in RMM for structured management
  • Administer and maintain Windows Server environments (2016–2022) — including Active Directory, DNS, DHCP, GPO, Certificate Services, and Print Services
  • Manage Hyper-V virtualization: virtual machine provisioning, snapshots, replication, and failover clustering where applicable
  • Apply server hardening baselines (Defender for Servers, ASR rules, local policies) in coordination with the Security team
  • Configure Acronis Cyber Protect for server-level patching and anti-malware where appropriate
  • Deploy and manage Acronis Cyber Protect and Axcient for server, endpoint, and SaaS (M365) backups across client environments
  • Define and apply backup retention policies, encryption standards, and job schedules aligned with compliance requirements (HIPAA, SOC 2, GDPR)
What we offer
What we offer
  • Thriving work environment
  • Opportunities for continuous learning
  • Chance to work with some of the best minds in the industry
  • Fulltime
Read More
Arrow Right

Senior Infrastructure Engineer

We are seeking a skilled and proactive individual to play a key role in supporti...
Location
Location
United Kingdom , Manchester
Salary
Salary:
Not provided
ans.co.uk Logo
ANS Group
Expiration Date
Until further notice
Flip Icon
Requirements
Requirements
  • Exposure to secure architecture design and implementation
  • Experience with the deployment and management Carbon Black or other EDR solutions across cloud infrastructure
  • Significant previous experience as an infrastructure engineer working on a large scale enterprise or multi-tenant environment
  • VMware 7.0+
  • Significant experience troubleshooting and analysing complex failures
  • Operational experience of NSX 3.0+
  • Scripting abilities in Powershell and PowerCLI
  • Experience with Cisco UCS or other enterprise blade systems
  • Significant Experience with Storage Technologies (HPE 3PAR, Nimble, Dell Compellent)
  • Experience with FC storage networking
Job Responsibility
Job Responsibility
  • Work to ensure conformity to public sector infrastructure requirements are met
  • Work in conjunction with our SoC team to develop and maintain platform security baselines
  • Monitor, diagnose and resolve significant problems within the ANS infrastructure
  • Be an escalation point for team members and the support teams offering technical expertise in virtualization, compute hardware and storage
  • Collaborate and work with other technical teams to provide industry leading support to our customers
  • Responsible for creating high quality documentation
  • Proactively work to identify areas of improvement in the platform
  • Effectively deliver project milestones
  • Responsible for the generation of LLD from HLD
  • Ensure our infrastructure is up to date by planning & performing patching and firmware upgrades
What we offer
What we offer
  • 25 days’ holiday, plus you can buy up to 5 more days
  • Birthday off
  • An extra celebration day
  • 5 days’ additional holiday in the year you get married
  • 5 volunteer days
  • Private health insurance
  • Pension contribution match and 4 x life assurance
  • Flexible working and work from anywhere for up to 30 days per year
  • Maternity: 16 weeks’ full pay
  • Paternity: 3 weeks’ full pay
  • Fulltime
Read More
Arrow Right