A healthy IT system is vital, and having the right software and hardware is only part of the puzzle.
Available separately, or as part of our IT Support packages, our Server & Network Monitoring service provides you with total peace of mind.
As part of our ongoing commitment to create an optimum IT environment for our customers, our Server & Network Monitoring comes as a fully managed service. This means potential issues are picked up automatically and reviewed by one of our IT experts. This can pre-empt major issues and save your business considerable down-time.
Because we’re based in London, we’re available same-day for emergency visits in and around the City. So if we do notice any urgent issues that can’t be solved remotely, we’ll be at your office quick as lightning to help ensure all business-critical functions continue to operate.
What’s our monitoring process?
To ensure the fastest response to any issues flagged by our automated monitoring systems, all new alerts are collated onto a central dashboard in our offices, which is monitored continuously by a dedicated pair of monitoring engineers. From here, tickets are logged into our helpdesk where based on colour coding, and for any critical alerts (e.g. server offline) there is also an audible alarm and an email notification sent to a wider pool of engineers including the Helpdesk Manager.
Other items can be monitored through the use of SNMP traps. Or, through scripts that the monitoring software can run at predetermined intervals. There is a large database of scripts to which we have access. We can either customise these scripts or write fresh ones to enable the monitoring of virtually any aspect of your server and other network objects including printers, switches, routers and firewalls.
The Key Parameters Explained
Why do we monitor so many key parameters? See below an explanation for each of the most commonly monitored parameters.
If free space on a disk drops below an agreed threshold, action can be taken before it causes a problem to either clean up the disk or increase the disk size. We continue to monitor disks for 5 working days after the free space issue is back above the threshold before closing off the ticket.
If a server goes offline, or reports an unexpected shutdown event. These are treated as critical and dealt with immediately.
If a server (or other device such as SAN/DAS, or NAS) reports a hardware component failure or predicted failure, e.g. hard disk, power supply, fan, or ambient temperature too high.
If the UPS (Uninterruptible Power Supply) detects that the incoming power supply has failed, connectivity to the software agent on the servers is lost, or if the battery needs replacing
Where there are multiple internet connections with automated failover between them managed by the firewall, or multiple firewalls set up in High Availability mode, we receive alerts if an internet line or device goes down.
Antivirus software can be set to be automatically deployed to all and any new machines that join the network, we receive notifications into a central console if any endpoint reports an infection or application error.
Servers will be patched and rebooted monthly with all critical updates, with automated reboots taking place overnight on a prescribed schedule on agreed dates to avoid disruption
Windows Event Logs
Understanding what does and does not indicate a genuine problem in the Windows Event Logs has a reputation of being somewhat of a black art.
Over the last decade, through our experience we have built up a library of the specific combinations of events and scenarios which actually require action, so our monitoring systems will only flag up events which indicate failures or potential failures, and present them in a format which is colour-coded according to severity, so that appropriate action and priority can be performed by our helpdesk.