Recently we rolled out a new release of LogicMonitor. Among the many improvements and fixes that users saw, there were also some backend changes to the Linux systems that store monitoring data.
The rollout went smooth, no alerts were triggered – but it was pretty easy to see that something had changed: Read more »
A customer contacted us this week and said “Hey, one of my filesystems that was being monitored by LogicMonitor disappeared after I grew it.” Turns out the filesystem in question was now a bit over 2 terabytes.
Some poking around showed that the file system was being filtered out of discovery, as net-snmp was reporting a size for the file system (via 220.127.116.11.18.104.22.168.3.1.5, hrStorageSize) of -1982127408. Yes, that’s a negative value.
The hrStorageSize obect is defined as Integer32 – so it’s really a signed integer. Go above 2147483648 allocation units, and you’ll be in negative territory (as the first bit will be interpreted as the sign.)
So, instead of disk Usage (as a percentage) being calculated:
we can change the formula LogicMonitor uses to calculate the percentage of disk space to:
which takes account of the fact that anything above 2147483648 will be reported as a negative number, and corrects for it.
In English, the above formula says:
We use a similar formula in the graphing definition of the Linux Disk Usage datasource, although there the values are also multiplied by the size of the Allocation Units, so you get an accurate representation of the size of the file system:
We’ve updated LogicMonitor and it’s core datasource repository, so now all customers will be able to avoid this problem if they deploy Terabyte size filesystems.
This adjustment can be used for other values reported as signed integers when you don’t want them treated as signed. So, for everyone running into this issue – you don’t need to update net-snmp (which there seems to be a lot of people calling for); or define a new MIB object. Just configure your monitoring and graphing systems to correct for the sign, as above.
And if your monitoring systems can’t, well – you can always switch to LogicMonitor.
Tags: linux monitoring
By which I mean that, unless you are a kernel developer or some other individual with esoteric purposes, having Linux up and running is not the point of your servers. Your servers are there to DO something, whether that’s to serve web pages, answer database requests, or provide the best hosted monitoring service.
So…what do you monitor? Read more »
Tags: linux monitoring
Performance monitoring for all your infrastructure & applications. In minutes, not hours.
Questions? Call Us!
(888) 415-6442 or +1 (805)-617-3884