A customer contacted our support recently, wondering why his Linux servers showed high memory utilization. We’ve talked a fair bit about monitoring Linux memory, and what it means. But this case was a little different. The customer showed that according to free, top, and LogicMonitor graphs, most of his server’s memory was in use – and not as file system cache.
However, the odd thing is that top and ps showed that the sum of all processes RSS (resident segment size) was consuming very little memory. So where was the memory going?
Can LogicMonitor, a developer of datacenter server monitoring, where we monitor everything in all sorts of ways, have an undetected customer affecting issue? Yes. We just had an issue that was reported by some trial customers, before our techops team was aware of it. Even worse, after techops thought they had addressed the issue, the customers were still affected. How? Read more »
Last month, LogicMonitor rolled out its Favorites From Around the Web. We know you are busy beyond belief. It’s tough to keep up with the latest tech articles, blog posts, ebooks, podcasts and hilarious, time-consuming cat videos (no shame, we get it). Check out our favorite posts from around the web for September:
How Cedexis Deploys Puppet Enterprise and LogicMonitor Jointly to Support its Global Operations
Founded in 2009, Cedexis is building a faster Web. Cedexis offers visibility and control of Web performance through its community-based monitoring & analysis solution, Cedexis Radar, and its global traffic management platform, Cedexis Openmix.
Ops in the Cloud
Deploying their technology strictly in a cloud environment, Cedexis’ TechOps team follows a simple rule: “Never touch hardware.” Cedexis manages its dynamic host deployments globally across a range of managed hosting and cloud providers. To ensure uniformity across datacenters, Cedexis configures new machines identically via configuration automation tools in order to prepare each with a “blueprint” to take the Cedexis code.
Read more »
“Excuse me. Did you just say that I could learn something of actual value from a marketing person?”
Yes, I did.
In case you haven’t noticed, marketing has undergone a phenomenal transformation in the last decade. Marketing has implemented complex automation platforms, like Marketo and Eloqua, that can sift through a sea of prospect data and use predictive analytics to pick out those most likely to purchase. Web analytics, too, have gone mainstream, largely due to Google giving away the functionality for free. Consequently, marketing can better quantify the ROI of what they spend. And, according to Gartner Research, 81% of companies with revenue of more than $500M have a Chief Marketing Technology Officer, and that number is expected to grow another 8% next year. By 2017, Gartner predicts that Chief Marketing Officers will be spending more on technology than CIOs!
So what does that have to do with IT monitoring?
Read more »
Too busy to keep up with what’s happening on the Web? Never fear. Starting this month, LogicMonitor will begin posting our favorite tech articles, blog posts, ebooks, videos, podcasts, cat pictures and more every month. Our favorites from August:
For those of you using our MongoDB monitoring, there’s an update for replication monitoring.
There’s a few improvements over the prior datasource: it deals with authentication better; removes some assumptions about whether members of a replica set are running on common ports, etc. Most of the data points being monitored are standard, and don’t need much comment. (We find all the members; monitor their health, state, uptime, etc). Read more »
As we’ve often preached – too many alerts are just as bad as missing alerts. You don’t want your team to become inured to alerts, so they don’t take action on those indicating outages.
For those of you using Campfire as your team collaboration tool, you now have another way to help manage your infrastructure and server monitoring alerts, and ensure every alert is reacted to appropriately. LogicMonitor now integrates with Campfire using the Campfire API.
How does this integration help you react to alerts appropriately, and ensure your teams don’t suffer from alert overload? Read more »
[Originally appeared April 30, 2014 in the Packet Pushers online community, written by Steve Francis, Founder and Chief Product Officer with LogicMonitor. This is a continuation of the 'SNMP traps' blog post previously posted on the LogicMonitor blog.]
LogicMonitor is a SaaS-based IT infrastructure monitoring company, monitoring the performance, capacity and availability of thousands of different kinds of devices and applications for thousands of customers. Where possible, we don’t rely on SNMP traps – and neither should you.
Firstly, consider what a trap is – a single UDP datagram, sent from a device to notify you that something is going wrong with that device. Now, UDP (User Datagram Protocol) packets are, unlike TCP, not acknowledged, and not retransmitted if they get lost on the network and don’t arrive, since the sender has no way of knowing if it arrived or not. So, a trap is a single, unreliable notification, sent from a device at the exact time that a UDP packet is least likely to make it to the management station – as, by definition, something is going wrong. The thing going wrong may be causing spanning tree to re-compute, or routing protocols to reconverge, or interface buffers to reset due to a switchover to redundant power supply. Not the time to rely on a single packet to tell you about critical events. Traps are just not a reliable means to tell you of things that can critically affect your infrastructure – this is the main reason to avoid them if possible.
Performance monitoring for all your infrastructure & applications. In minutes, not hours.
Questions? Call Us!
(888) 415-6442 or +1 (805)-617-3884