A customer contacted our support recently, wondering why his Linux servers showed high memory utilization. We’ve talked a fair bit about monitoring Linux memory, and what it means. But this case was a little different. The customer showed that according to free, top, and LogicMonitor graphs, most of his server’s memory was in use – and not as file system cache.
However, the odd thing is that top and ps showed that the sum of all processes RSS (resident segment size) was consuming very little memory. So where was the memory going?
Can LogicMonitor, a developer of datacenter server monitoring, where we monitor everything in all sorts of ways, have an undetected customer affecting issue? Yes. We just had an issue that was reported by some trial customers, before our techops team was aware of it. Even worse, after techops thought they had addressed the issue, the customers were still affected. How? Read more »
Could this happen to you?
Someone in your company makes an erroneous entry in DNS. After a short time, some customers begin receiving ‘Server Not Found’ reports when trying to access your site. Email doesn’t seem to be going through for some users. Help tickets start trickling in.
As your TechOps team attempts to troubleshoot, the error silently propagates through the Internet. The trickle of isolated tickets turns into a flash flood. Executives begin urgently texting to find out what is happening.
Eventually, someone on your team combs through the DNS file and catches the mistake. Instead of entering “.com” in the middle of the night, John must have fat fingered in “.con.” The error is fixed! However, because your DNS is cached it could be a couple of days before service is fully restored for all users.
Customers and executives demand a root cause analysis. “How could this have happened? Why wasn’t it caught earlier? What are you doing to prevent this ever happening again?”
No one can deny the importance of DNS in the Internet age. And to help you keep on top of it, LogicMonitor, maker of the popular automated IT performance monitoring platform, has just released its first free tool, the DNS Change Tracker™ as a free hosted tool. In the near term, we plan to release this tool’s source code on GitHub so that everyone can make it even better.
What it does: Read more »
Last month, LogicMonitor rolled out its Favorites From Around the Web. We know you are busy beyond belief. It’s tough to keep up with the latest tech articles, blog posts, ebooks, podcasts and hilarious, time-consuming cat videos (no shame, we get it). Check out our favorite posts from around the web for September:
No one likes to talk about outages. They’re horrible to experience as an employee and they take a heavy toll in customer confidence and future revenue. But they do happen. Even publicly traded tech powerhouses, such as eBay and Microsoft, who have more technical resources than you’ll ever have, fall prey to outages. And when they do, they are closed for business, much to the chagrin of their shareholders and executive teams.
It’s not so much a question of whether an outage will occur in your company but when. The secret to surviving them is to get better at handling them and learning from the mistakes of others. Nobody is perfect all the time (my current employer, LogicMonitor, included) but I hope by talking about these mistakes, we can all begin the hard work required to avoid them in the future.
An outage occurs. A barrage of emails is fired to the Tech Ops team from Customer Support. Executives begin demanding updates every five minutes. Tech team members all run to their separate monitoring tools to see what data they can dredge up, often only seeing a part of the problem. Mass confusion ensues as groups point their fingers at each other and Sys Admins are unsure whether to respond to the text from their boss demanding an update or to continue to troubleshoot and apply a possible fix. Marketing (“We’re getting trashed on social media! We need to send a mass email and do a blog post telling people what is happening!”) and Legal (“Don’t admit liability!”) jump in to help craft a public-facing response. Cats begin mating with dogs and the world explodes.
Read more »
How Cedexis Deploys Puppet Enterprise and LogicMonitor Jointly to Support its Global Operations
Founded in 2009, Cedexis is building a faster Web. Cedexis offers visibility and control of Web performance through its community-based monitoring & analysis solution, Cedexis Radar, and its global traffic management platform, Cedexis Openmix.
Ops in the Cloud
Deploying their technology strictly in a cloud environment, Cedexis’ TechOps team follows a simple rule: “Never touch hardware.” Cedexis manages its dynamic host deployments globally across a range of managed hosting and cloud providers. To ensure uniformity across datacenters, Cedexis configures new machines identically via configuration automation tools in order to prepare each with a “blueprint” to take the Cedexis code.
Read more »
“Excuse me. Did you just say that I could learn something of actual value from a marketing person?”
Yes, I did.
In case you haven’t noticed, marketing has undergone a phenomenal transformation in the last decade. Marketing has implemented complex automation platforms, like Marketo and Eloqua, that can sift through a sea of prospect data and use predictive analytics to pick out those most likely to purchase. Web analytics, too, have gone mainstream, largely due to Google giving away the functionality for free. Consequently, marketing can better quantify the ROI of what they spend. And, according to Gartner Research, 81% of companies with revenue of more than $500M have a Chief Marketing Technology Officer, and that number is expected to grow another 8% next year. By 2017, Gartner predicts that Chief Marketing Officers will be spending more on technology than CIOs!
So what does that have to do with IT monitoring?
Read more »
Too busy to keep up with what’s happening on the Web? Never fear. Starting this month, LogicMonitor will begin posting our favorite tech articles, blog posts, ebooks, videos, podcasts, cat pictures and more every month. Our favorites from August:
Tech conferences are great venues to get better at your job. You may set off to learn a new technology, hear from thought leaders about your industry, network with new and fabulous people in your field, and possibly find a new job! Don’t tell your boss, given she just begrudgingly signed off on your travel to San Francisco for VMWorld! Tech tradeshows can be overwhelming. The jetlag, milling about in large crowds, sleep deprivation and the technology onslaught are all factors working against you. Here are some tips to ensure that you get the most bang for your buck: Read more »
Performance monitoring for all your infrastructure & applications. In minutes, not hours.
Questions? Call Us!
(888) 415-6442 or +1 (805)-617-3884