<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>LogicMonitor Blog</title>
	<atom:link href="http://blog.logicmonitor.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.logicmonitor.com</link>
	<description>Interesting issues in datacenter monitoring</description>
	<lastBuildDate>Tue, 31 Aug 2010 23:14:18 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Linux Monitoring is dead.</title>
		<link>http://blog.logicmonitor.com/2010/08/31/linux-monitoring-is-dead/</link>
		<comments>http://blog.logicmonitor.com/2010/08/31/linux-monitoring-is-dead/#comments</comments>
		<pubDate>Tue, 31 Aug 2010 23:13:24 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[linux monitoring]]></category>

		<guid isPermaLink="false">http://blog.logicmonitor.com/?p=173</guid>
		<description><![CDATA[Long live Linux monitoring.
By which I mean that, unless you are a kernel developer or some other individual with esoteric purposes, having Linux up and running is not the point of your servers.  Your servers are there to DO something, whether that&#8217;s to serve web pages, answer database requests, or provide the best hosted monitoring [...]]]></description>
			<content:encoded><![CDATA[<h2><strong>Long live Linux monitoring.</strong></h2>
<p>By which I mean that, unless you are a kernel developer or some other individual with esoteric purposes, having Linux up and running is not the point of your servers.  Your servers are there to DO something, whether that&#8217;s to serve web pages, answer database requests, or provide the best hosted monitoring service.</p>
<p>So&#8230;what do you monitor?<span id="more-173"></span></p>
<p>Well, in the case of a server whose main point is to be part of a cluster of Apache web servers &#8211; <a href="http://www.logicmonitor.com/monitoring/applications/web-server-monitoring/apache/">monitor Apache</a>. Check that it&#8217;s running; returning the content you expect it to return; the number of busy server threads, throughput per second, etc.</p>
<p>Congratulations, you&#8217;ve monitored the most important thing about that server. But that doesn&#8217;t mean you are done. This is where the <a href="http://www.logicmonitor.com/monitoring/servers/linux/">Linux monitoring</a> comes in. If your server is to serve http requests with Apache, it needs CPU resources. It needs memory. It definitely should not be swapping. If there are any issues to resolve, that will be much easier if NTP has synch&#8217;d the server&#8217;s time.  You&#8217;ll want to know about hard drive failures; power supply issues, temperature.  You&#8217;ll want to know if the SSL certificate for your web server is approaching expiration (and you&#8217;ll want to know that a few weeks in advance.) You&#8217;ll want to know if file systems are filling up; if physical disks are approaching their maximum in terms of IO operations per second, increasing CPU wait time. If interfaces are having errors, drops or discards.  If the mail sent by the server is bouncing, being delivered, or rejected.</p>
<p>All these aspects are important to monitor so that you will know if your server faces any issues that will prevent it from fulfilling its destiny &#8211; of serving web pages.</p>
<p>And of course as this server is part of a cluster, you want to graph its throughput with the other servers, so you can see overall performance, and also compare different servers to each other:</p>
<p><img class="alignnone size-full wp-image-174" title="LinuxMonitoring" src="http://blog.logicmonitor.com/wp-content/uploads/2010/08/LinuxMonitoring.png" alt="Linux Monitoring" width="575" height="264" /></p>
<p>And this is just for one server, with one application. If your server runs a java webserver, or a database, or memcached &#8211; there&#8217;s suddenly a lot more things to monitor, just to get &#8220;Linux Monitoring&#8221;.</p>
<p>So what&#8217;s the point? I&#8217;ve said it before, and will say it again, but automation is essential. If your monitoring is cumbersome to get new servers monitored completely, guess what &#8211; you won&#8217;t monitor them completely. Which will increase your outages, and increase time to resolution. The more you monitor, the quicker you can resolve issues, as issues are often not where you&#8217;d expect.</p>
<p>A quick query shows that a typical server in our lab has 382 datapoints monitored on it after a default install, just by entering its hostname into LogicMonitor. If we had to configure the monitoring ourselves, the number would more likely approach 1 for lab machines (&#8220;does it ping?&#8221;), especially as we fire up new virtual machines in the lab regularly. But that would mean our developers would miss all sorts of MySQL and Tomcat metrics that they routinely use to improve the next release of our product. And our Ops team would not have good history of performance metrics, like average request service time, so they can identify when issues were introduced on the QA servers. And QA would be held up as our QA systems would not be as highly available.</p>
<p>So for us, automation has both decreased work, and increased productivity in all sorts of unexpected ways. How has it benefited you? Or how could it?</p>
<p><strong>LogicMonitor automates your monitoring setup. <a href="http://www.logicmonitor.com/contact/try-it/"><img class="alignnone size-full wp-image-179" style="border-style: none;" title="Linux-monitoring-button" src="http://blog.logicmonitor.com/wp-content/uploads/2010/08/Linux-monitoring-button.png" alt="Linux Monitoring Trial" width="238" height="60" /></a></strong><a href="http://www.logicmonitor.com/about/contact/try-it/"><br />
</a></p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fblog.logicmonitor.com%2F2010%2F08%2F31%2Flinux-monitoring-is-dead%2F&amp;linkname=Linux%20Monitoring%20is%20dead."><img src="http://blog.logicmonitor.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://blog.logicmonitor.com/2010/08/31/linux-monitoring-is-dead/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>VMware monitoring webinar to watch</title>
		<link>http://blog.logicmonitor.com/2010/08/21/vmware-monitoring-webinar-to-watch/</link>
		<comments>http://blog.logicmonitor.com/2010/08/21/vmware-monitoring-webinar-to-watch/#comments</comments>
		<pubDate>Sun, 22 Aug 2010 00:19:05 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.logicmonitor.com/?p=165</guid>
		<description><![CDATA[Just a quick note to say that our recording of the webinar we gave on virtualization monitoring VMware monitoring (and XenServer Monitoring) is up on LogicMonitor.com.
It&#8217;s a quick (about 10 minutes) look at why it&#8217;s particularly important to have a unified monitoring system that covers virtualization infrastructure, the guest OS&#8217;s and the applications on the [...]]]></description>
			<content:encoded><![CDATA[<p>Just a quick note to say that our recording of the <a href="http://www.logicmonitor.com/learn-more/worlds-shortest-webinars/">webinar </a>we gave on virtualization monitoring <a href="http://www.logicmonitor.com/monitoring/virtualization/vsphere-monitoring/">VMware monitoring</a> (and XenServer Monitoring) is up on LogicMonitor.com.</p>
<p>It&#8217;s a quick (about 10 minutes) look at why it&#8217;s particularly important to have a unified monitoring system that covers virtualization infrastructure, the guest OS&#8217;s and the applications on the guest OSs, and the storage, all in one monitoring system &#8211; otherwise you can end up with people without sufficient information chasing all sorts of problems that would be immediately identifiable with a unified system.</p>
<p>Check it out, and if you have any questions, feel free to email us at info@ logicmonitor.com</p>
<p>There were quite a few questions at the end of the webinar, but I didn&#8217;t include them in the recording, else we would have had to post it somewhere else other than the Worlds Shortest Webinars. <img src='http://blog.logicmonitor.com/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fblog.logicmonitor.com%2F2010%2F08%2F21%2Fvmware-monitoring-webinar-to-watch%2F&amp;linkname=VMware%20monitoring%20webinar%20to%20watch"><img src="http://blog.logicmonitor.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://blog.logicmonitor.com/2010/08/21/vmware-monitoring-webinar-to-watch/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>3 Simple steps to Apache Monitoring</title>
		<link>http://blog.logicmonitor.com/2010/08/13/3-simple-steps-to-apache-monitoring/</link>
		<comments>http://blog.logicmonitor.com/2010/08/13/3-simple-steps-to-apache-monitoring/#comments</comments>
		<pubDate>Fri, 13 Aug 2010 20:04:37 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.logicmonitor.com/?p=143</guid>
		<description><![CDATA[If you&#8217;re reading this, you know you should be monitoring your Apache web servers. (You want to know if they are approaching limits of configured server workers; you want to know how many requests you are serving; you want to ensure availability, etc).  Fortunately, enabling Apache monitoring is quite simple.
Make sure you are loading the [...]]]></description>
			<content:encoded><![CDATA[<p>If you&#8217;re reading this, you know you should be monitoring your Apache web servers. (You want to know if they are approaching limits of configured server workers; you want to know how many requests you are serving; you want to ensure availability, etc).  Fortunately, enabling Apache monitoring is quite simple.</p>
<p><strong>Make sure you are loading the mod_status module.</strong></p>
<p>If you are using a version of Apache that was installed by your OS&#8217;s package manager, there are OS specific ways to enable modules.</p>
<p>For Ubuntu/Debian:</p>
<pre class="programlisting">/usr/sbin/a2enmod status</pre>
<p>For Redhat/Centos: Just uncomment the line:</p>
<pre class="programlisting">LoadModule status_module modules/mod_status.so</pre>
<p>in /etc/httpd/conf/httpd.conf<br />
For Suse derivatives:<br />
add &#8220;status&#8221; to the list of modules on the  line starting with APACHE_MODULES= in /etc/sysconfig/apache2</p>
<p><strong>Configure the Mod_status module</strong></p>
<p>You want the following to be loaded in your apache configuration files.</p>
<pre class="programlisting">ExtendedStatus On
&lt;Location /server-status&gt;
 SetHandler server-status
 Order deny,allow
 Deny from all
#Add LogicMonitor agent addresses here
 Allow from localhost 192.168.10.10
&lt;/Location&gt;</pre>
<p>Where you set that configuration also changes depending on your Linux distribution.<br />
/etc/apache2/mods-available/status.conf on Ubuntu/Debian<br />
/etc/httpd/conf/httpd.conf on Redhat/CentOs<br />
/etc/apache2/mod_status.conf on OpenSuse/SLES<br />
Finally, restart apache using your OS startup script ( /etc/init.d/httpd restart or /etc/init.d/apache2 restart). Note that using the OS startup script is often necessary to allow the OS specific script files to assemble the final apache config. Sending apache signals, or using apache2ctl, does not do this.</p>
<p><strong>3. Watch the monitoring happen.</strong></p>
<p>If you are using LogicMonitor&#8217;s <a href="http://www.logicmonitor.com/monitoring/applications/web-server-monitoring/apache/">Apache monitoring</a>, then you&#8217;re done.  LogicMonitor will automatically detect the Apache web server, and apply appropriate monitoring and alerting, as well as alerting and graphing on the rest of the system, so you can correlate CPU, interface and disk load to Apache load.</p>
<p>One thing you may want to customize is your dashboards &#8211; add a widget that collects all Apache requests/second, from all hosts, or all production hosts, and aggregates them into a single graph.  Using LogicMonitor&#8217;s flexible graphs, the graph will automatically include new servers as you add them.</p>
<p><img class="alignnone size-full wp-image-154" title="ApacheRequests" src="http://blog.logicmonitor.com/wp-content/uploads/2010/08/ApacheRequests.png" alt="Apache Requests per second" width="579" height="274" /></p>
<p><strong>Want to make your Apache monitoring simple?  Check out LogicMonitor for  monitoring that automates your monitoring setup. </strong><a href="http://www.logicmonitor.com/about/contact/try-it/">Free Trial</a></p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fblog.logicmonitor.com%2F2010%2F08%2F13%2F3-simple-steps-to-apache-monitoring%2F&amp;linkname=3%20Simple%20steps%20to%20Apache%20Monitoring"><img src="http://blog.logicmonitor.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://blog.logicmonitor.com/2010/08/13/3-simple-steps-to-apache-monitoring/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Is your network monitoring software deadly?</title>
		<link>http://blog.logicmonitor.com/2010/07/27/is-your-monitoring-deadly/</link>
		<comments>http://blog.logicmonitor.com/2010/07/27/is-your-monitoring-deadly/#comments</comments>
		<pubDate>Tue, 27 Jul 2010 20:02:48 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[deadly monitoring]]></category>

		<guid isPermaLink="false">http://blog.logicmonitor.com/?p=130</guid>
		<description><![CDATA[So maybe the consequences of an outage in your infrastructure will not be as calamitous as the BP oil spill in the Gulf, but the effect on your enterprise may feel like it.  Which is why we could all use B.P. as a case study in how not to treat your monitoring.
News reports detail allegations [...]]]></description>
			<content:encoded><![CDATA[<p>So maybe the consequences of an outage in your infrastructure will not be as calamitous as the BP oil spill in the Gulf, but the effect on your enterprise may feel like it.  Which is why we could all use B.P. as a case study in how not to treat your monitoring.</p>
<p><a href="http://articles.latimes.com/2010/jul/23/news/la-oil-spill-disabled-alarms-20100723">News reports</a> detail allegations that some of the alarms on the failed rig were &#8220;inhibited&#8221;, because &#8220;they did not want people to wake up at 3 a.m. due to false alarms&#8221;.  The rig&#8217;s operator responded, &#8220;Repeated false alarms increase risk and decrease rig safety&#8221;.<br />
<span id="more-130"></span><br />
We&#8217;ve said it <a href="http://blog.logicmonitor.com/2009/11/06/top-i-t-datacenter-monitoring-mistakes-part-3-in-a-series/">before</a> and we&#8217;ll probably say it again, but alert overload is dangerous, no matter what the occupation. Even if your field of responsibility is for ensuring the performance and availability of network <a href="http://www.logicmonitor.com/monitoring/networking-equipment/switch-monitoring/">switch monitoring</a>, you need to ensure all alerts are meaningful.</p>
<p>This requires two things:</p>
<ul>
<li><a href="http://blog.logicmonitor.com/2010/06/11/processes-for-data-center-monitoring/">processes</a> that are followed (reviewing alerts, reports on top alert sources, etc)</li>
<li>tools that allow you tune and route your alerts.  (One area I&#8217;d include in this is powerful alert calculations &#8211; it is often helpful to be able to express alerts such as &#8220;Warn if discards is more than 0.5% of traffic &#8211; but only if traffic is greater than 1 Mbps inbound&#8221;.)</li>
</ul>
<p>If your monitoring system can&#8217;t do this kind of alerting, and  easily  let you tune it on group, host or instance level, you&#8217;re either wasting  time dealing with false alerts, or have &#8220;inhibited&#8221; the alarms on your own oil rig.  And we&#8217;ve all seen where that leads&#8230;.</p>
<p><strong>Want to make your monitoring less lethal?  Check out LogicMonitor for  monitoring that automates your monitoring setup, and lets you only get  the alerts that matter.</strong><a href="http://www.logicmonitor.com/about/contact/try-it/"> Free Trial</a></p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fblog.logicmonitor.com%2F2010%2F07%2F27%2Fis-your-monitoring-deadly%2F&amp;linkname=Is%20your%20network%20monitoring%20software%20deadly%3F"><img src="http://blog.logicmonitor.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://blog.logicmonitor.com/2010/07/27/is-your-monitoring-deadly/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>World Cup Soccer Monitoring</title>
		<link>http://blog.logicmonitor.com/2010/07/01/world-cup-soccer-monitoring/</link>
		<comments>http://blog.logicmonitor.com/2010/07/01/world-cup-soccer-monitoring/#comments</comments>
		<pubDate>Fri, 02 Jul 2010 05:05:27 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.logicmonitor.com/?p=127</guid>
		<description><![CDATA[You (or at least I) wouldn&#8217;t think world cup fever would affect the more mundane world of data center monitoring. Unless you work for Twitter, or some other web presence overloaded by unprecedented surges in network traffic due to people checking in on, and commenting about, the World Cup.  Twitter found that having a network [...]]]></description>
			<content:encoded><![CDATA[<p>You (or at least I) wouldn&#8217;t think world cup fever would affect the more mundane world of data center monitoring. Unless you work for Twitter, or some other web presence overloaded by unprecedented surges in network traffic due to people checking in on, and commenting about, the World Cup.  Twitter <a href="http://engineering.twitter.com/2010/06/perfect-stormof-whales.html">found</a> that having a network that &#8220;wasn&#8217;t appropriately being monitored&#8221; supporting two critical components was not a good idea, and led&#8230;to an outage (or several fail whales.)</p>
<p>Twitter is addressing issues, including their monitoring. (As we&#8217;ve noted before, if monitoring did not alert about an incident at the earliest possible time, the incident should never be closed until the monitoring is improved to warn of the issue in advance.)</p>
<p>In our view, the best monitoring approach is to have your monitoring software assume everything should be monitored to production levels, automatically, and not need your Ops team to tell it.</p>
<p>Repel your own flying whales by using <a href="http://www.logicmonitor.com/features/automated-configuration/">data center monitoring</a> automation as much as possible.  Let the humans go play soccer, or blow their vuvuzelas!</p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fblog.logicmonitor.com%2F2010%2F07%2F01%2Fworld-cup-soccer-monitoring%2F&amp;linkname=World%20Cup%20Soccer%20Monitoring"><img src="http://blog.logicmonitor.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://blog.logicmonitor.com/2010/07/01/world-cup-soccer-monitoring/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Processes for Data Center Monitoring</title>
		<link>http://blog.logicmonitor.com/2010/06/11/processes-for-data-center-monitoring/</link>
		<comments>http://blog.logicmonitor.com/2010/06/11/processes-for-data-center-monitoring/#comments</comments>
		<pubDate>Fri, 11 Jun 2010 18:03:42 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.logicmonitor.com/?p=117</guid>
		<description><![CDATA[

Even with the best datacenter monitoring system in place, whether an implementation succeeds depends largely on the processes adopted around monitoring.
The ideal is a monitoring system that is comprehensive (alerting on all conditions that need attention) and noise free (NOT alerting to any conditions that do not need attention.)  A noisy alert system is almost as [...]]]></description>
			<content:encoded><![CDATA[<div>
<div>
<p>Even with the <a href="http://www.logicmonitor.com/">best datacenter monitoring</a> system in place, whether an implementation succeeds depends largely on the processes adopted around monitoring.</p>
<p>The ideal is a monitoring system that is comprehensive (alerting on all conditions that need attention) and noise free (NOT alerting to any conditions that do not need attention.)  A noisy alert system is almost as bad as no monitoring &#8211; it will train people to ignore their alerts.</p>
<p>These goals are diametrically opposed, and further complicated by the fact that different users of the monitoring have different criteria for what needs attention or is noise.  However, with appropriate alert escalations and routing, and some good processes in place, you can approach this state.</p>
<p>What processes can help?</p>
<h3>Use Scheduled Down Time</h3>
<p>This is probably the most important process to enforce.  If someone is going to be working on a system, schedule downtime!  Prevent the alerts going out in the first place. If you have regular maintenance windows for sets of hosts, set up the scheduled downtime to recur automatically.  If there are processes that will trigger alerts periodically (such as CPU alerts triggered by disk scrubs on NetApp filers), schedule recurring downtime for just that alert.</p>
<h3>Get Rid of unneeded alerts</h3>
<p>For every alert received, an assessment should be made during the initial deployment of the monitoring &#8211; was the alert needed?  If so, acknowledge the alert and go fix the problem. If not, how general can the removal of the alert be made?  If the alert is regarding buffer discards on a switch, but only on the port where a 1Gbps network links to a 100Mbps uplink, discards are normal and expected, so the threshold should be adjusted just for this one specific port.  If the alert is regarding swap space used on a QA system, and QA regularly engages in stress tests that would trigger this, you should disable the alert or adjust the threshold on the QA group. (For resources shared on systems, such as storage array volumes, you should add a filter to disable the QA alerts, or set different thresholds.)</p>
<h3>Send alerts to the right place</h3>
<p>For every valid alert received, make sure it&#8217;s going to the all the right people, only the right people, by the right methods, and is escalating at appropriate periods. (An example of inappropriate escalation would be sending warning alerts on production systems via email, instead of pager &#8211; but escalate them every 5 minutes.  If three warnings occur at 1.00 am, and no one checks the email until 8.00 am, everyone will have 250 email alerts cluttering their inbox.)  The set of people that want to know about network retransmissions are probably different from the set that wants to know about a commerce site being completely down.</p>
<h3>Review</h3>
<p>We recommend a weekly review, especially during the initial roll out of a monitoring system, to ensure the above points:</p>
<ul>
<li>every alert was valid.<img class="alignright" title="Clue Stick" src="http://upload.wikimedia.org/wikipedia/commons/1/10/2_By_4_Clue_Stick.jpg" alt="Clue Stick" width="200" height="190" /> If not, consensus is arrived at as to how and at what level the alert needs to be tuned. If alerts were the result of scheduled staff actions, but no one told the monitoring about the scheduled down time, liberal use of the clue stick (or stronger action) is recommended.</li>
<li>every alert was delivered to all, and only, the correct recipients.</li>
<li>the escalations for each alert were valid and appropriate.</li>
<li>don&#8217;t close any incident that was not alerted on,  until the alerts to detect it have been created.</li>
</ul>
</div>
<p>Putting processes will likely be the key to your monitoring deployment succeeding or failing, and is another area that a <a href="http://www.logicmonitor.com/">SaaS monitoring service</a> is likely to be superior to a premise based system.  A SaaS service is invested in your continual use and satisfaction of monitoring &#8211; they don&#8217;t get the money up front.  At LogicMonitor, at least, we help our customers with the whole implementation, including processes.</p>
</div>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fblog.logicmonitor.com%2F2010%2F06%2F11%2Fprocesses-for-data-center-monitoring%2F&amp;linkname=Processes%20for%20Data%20Center%20Monitoring"><img src="http://blog.logicmonitor.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://blog.logicmonitor.com/2010/06/11/processes-for-data-center-monitoring/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>LogicMonitor&#8217;s Hierarchy of data center Monitoring</title>
		<link>http://blog.logicmonitor.com/2010/05/15/logicmonitors-hierarchy-of-monitoring/</link>
		<comments>http://blog.logicmonitor.com/2010/05/15/logicmonitors-hierarchy-of-monitoring/#comments</comments>
		<pubDate>Sat, 15 May 2010 14:18:29 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.logicmonitor.com/?p=114</guid>
		<description><![CDATA[So we sometimes get asked about our tagline, Monitoring that Matters.
Doesn&#8217;t all monitoring matter?  Well, yes it does.  But to badly paraphrase George Orwell, all monitoring matters, but some monitorings are more mattering than others.
What makes monitoring matter?  It&#8217;s whatever monitoring reduces your outages, reduces issues of unacceptable performance, or reduces time to resolution of [...]]]></description>
			<content:encoded><![CDATA[<p>So we sometimes get asked about our tagline, <em>Monitoring that Matters.</em></p>
<p>Doesn&#8217;t all monitoring matter?  Well, yes it does.  But to badly paraphrase George Orwell, all monitoring matters, but some monitorings are more mattering than others.</p>
<p>What makes monitoring matter?  It&#8217;s whatever monitoring <em>reduces your outages, reduces issues of unacceptable performance, or reduces time to resolution of these issues and outages.<br />
</em></p>
<p>We like to think of monitoring as similar to Maslow&#8217;s hierarchy &#8211; there are base levels that must be met first, but the higher up you go, the better.</p>
<ol>
<li>The base level of monitoring is &#8220;is my host/site alive?&#8221;  Everyone needs this (but not everyone has it.) That gives you reactive monitoring.</li>
<li>&#8220;Is my host going to keep working in the near term?&#8221;  This means alerts about disks filling up, or swap space and memory used.  This helps with reducing some outages.</li>
<li>&#8220;How is my host performing?&#8221; Things like CPU load, and rate of swapping.  Alerts on these metrics warn of impending performance issues, that can be addressed.</li>
<li>&#8220;How is my application performing?&#8221;  A measurement of representational application performance. This may be things such as database transaction time, time for a web site to process a request, or even, for a storage array, latency of write requests. It&#8217;s really a more fundamental level of monitoring than level 3 &#8211; an alert about CPU load, in level 3, may not indicate anything amiss &#8211; in the case of <a href="http://www.logicmonitor.com/monitoring/monitoring-netapp-filers/">NetApp monitoring</a>, it could be a weekly raid scrub, and request latency, which is what really matters, is not affected at all. However, we rank it higher than level 3, simply as it&#8217;s easier (and more common) to monitor general purpose metrics (such as CPU) than application specific performance metrics (such as database transaction times.)</li>
<li>&#8220;Why is my application performing as it is?&#8221; This is where monitoring really starts to matter.  The more data you collect and trend, the better your monitoring system will be at helping you quickly identify and resolve issues. (Of course, the more data you graph, the better your monitoring system need to be at presenting and organizing that data in a meaningful way. It&#8217;s easy to show 4 graphs of a system, but when you may have 120 different graphs for one system, and you need to quickly scan them to see correlations &#8211; the UI challenges get more interesting.) In a database, an alert about response time is clearly significant, but doesn&#8217;t tell you what the cause is.  But if the monitoring system can quickly show you your database&#8217;s sequential table scans jumped after the last software release, or that latency of a storage array volume is high because another volume sharing the same physical disks is experiencing an abnormal rate of IO operations, you will be able to resolve your issues <strong>much</strong> quicker.</li>
<li>&#8220;How do I fix my issue?&#8221; This is the peak level, where the monitoring system not only shows all the data, but presents directions on how to resolve any issues.  LogicMonitor can do this in some cases (for example, using data from the number of select operations, query cache hits and cache prunes due to low memory to recommend enlarging, reducing or disabling the MySQL query cache).  But this is a much harder issue to generalize, especially across systems that interact. But we&#8217;re always improving.</li>
</ol>
<p>How far up the hierarchy is your monitoring?  If you don&#8217;t have a wealth of data about all aspects of your system, that you can trend in real time and look at historically, it&#8217;s almost certain that your outages and performance issues are both too frequent and too long.</p>
<p><em><br />
</em></p>
<p><em><br />
</em></p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fblog.logicmonitor.com%2F2010%2F05%2F15%2Flogicmonitors-hierarchy-of-monitoring%2F&amp;linkname=LogicMonitor%26%238217%3Bs%20Hierarchy%20of%20data%20center%20Monitoring"><img src="http://blog.logicmonitor.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://blog.logicmonitor.com/2010/05/15/logicmonitors-hierarchy-of-monitoring/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Are free monitoring tools like Nagios really free?</title>
		<link>http://blog.logicmonitor.com/2010/05/02/are-free-monitoring-tools-like-nagios-really-free/</link>
		<comments>http://blog.logicmonitor.com/2010/05/02/are-free-monitoring-tools-like-nagios-really-free/#comments</comments>
		<pubDate>Sun, 02 May 2010 23:29:10 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blog.logicmonitor.com/?p=111</guid>
		<description><![CDATA[IT people by nature are supposed to be gurus. They’re supposed to be able to build things from scratch. This expectation certainly applies to data center monitoring, where a common practice is to rely on open source monitoring tools such as Nagios. But when you consider the value of your time, these free tools can [...]]]></description>
			<content:encoded><![CDATA[<p>IT people by nature are supposed to be gurus. They’re supposed to be able to build things from scratch. This expectation certainly applies to data center monitoring, where a common practice is to rely on open source monitoring tools such as Nagios. But when you consider the value of your time, these free tools can quickly wind up being far more costly than commercial tools.  For instance, we did a survey and found that some system admins had spent over 100 hours to get their open source monitoring solution to do what they wanted. Further, there was ongoing work to try to keep the system up to date with frequent changes in their datacenter, and even then they only had, for the most part, coarse level monitoring (for example, monitoring only the CPU load of a load balancer, instead of monitoring the state of all the hundreds of VIPs on the load balancer.)</p>
<p>When the only alternatives were costly enterprise-class monitoring solutions, sweating it out with open source was understandable. But now that there are affordable tools that automate configuration and give you everything you need in 30 minutes, insisting on building your own doesn&#8217;t seem wise (especially in this era of understaffed data centers.)  At the root of this DIY mentality is pride. With so many open source options available, Techies probably feel some sense of shame or embarrassment going to an IT director and asking for tools that cost money.</p>
<p>I&#8217;d suggest a better source of pride is being able to spend time on tasks that add value to the enterprise &#8211; writing Puppet scripts that automate machine and software deployments, and so greatly reduce the time to spin up machines; investigate cloud usage options; correlate resource expenses with revenue per business unit.  There are a lot of things that should be done in any enterprise, that are not because of lack of time.  A good systems administrator&#8217;s time is very valuable &#8211; much more valuable than going through a MIB to figure out which item is important to monitor.</p>
<p>And no matter how good a systems administrator you are, monitoring is not going to be your top priority (nor should it be).  You&#8217;ll get monitoring going &#8220;good enough&#8221; &#8211; but there will be lots of cases that it failed to alert on, when a comprehensive monitoring system would have.  Then after every outage, you&#8217;ll have to go back and extend the monitoring, adding in metrics that could have helped predict the specific case.</p>
<p>So given the cost of your time; the more in depth monitoring that you get immediately with LogicMonitor (a typical Nagios implementation may monitor 10 metrics on a linux host; a typical LogicMonitor deployment will monitor over 100); and the opportunity cost of the things you could be doing to add value with your time, if you weren&#8217;t configuring monitoring, then why not use an automated monitoring tool such as LogicMonitor that makes you a better system administrator, and doesn’t require a Fortune 500 budget to implement?</p>
<p>If you’d rather skip the tedious work, but want the peace of mind knowing that your infrastructure is properly monitored, and that you will be alerted of any issues early, it’s perfectly okay to go the automation route.  You’ll feel a sense of satisfaction in preventing an outage, whether you wrote the code or not.  And your CFO may even thank you for spending the money.</p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fblog.logicmonitor.com%2F2010%2F05%2F02%2Fare-free-monitoring-tools-like-nagios-really-free%2F&amp;linkname=Are%20free%20monitoring%20tools%20like%20Nagios%20really%20free%3F"><img src="http://blog.logicmonitor.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://blog.logicmonitor.com/2010/05/02/are-free-monitoring-tools-like-nagios-really-free/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The MySQL conference and Open Source</title>
		<link>http://blog.logicmonitor.com/2010/04/15/the-mysql-conference-and-open-source/</link>
		<comments>http://blog.logicmonitor.com/2010/04/15/the-mysql-conference-and-open-source/#comments</comments>
		<pubDate>Fri, 16 Apr 2010 03:04:14 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[MySQL]]></category>

		<guid isPermaLink="false">http://blog.logicmonitor.com/?p=109</guid>
		<description><![CDATA[Some of the LogicMonitor team were just up at the MySQL 2010 conference in San Jose.  A good event &#8211; it was my first time there. LogicMonitor was being seen for the first time by a lot of DBAs.   We generally got a great response &#8211; especially once it was realised we [...]]]></description>
			<content:encoded><![CDATA[<p>Some of the LogicMonitor team were just up at the MySQL 2010 conference in San Jose.  A good event &#8211; it was my first time there. LogicMonitor was being seen for the first time by a lot of DBAs.   We generally got a great response &#8211; especially once it was realised we bring not only powerful <a href="http://www.logicmonitor.com/monitoring/database-monitoring/mysql-monitoring-and-optimization/">MySQL monitoring</a>, and automate any set up &#8211; but that we apply the same powerful, configuration free monitoring to the operating system, hardware, network, back end storage, etc, etc. (A surprising number of people seemed to expect that anything powerful must be a single product solution.)</p>
<p>One common observation that came up was that the conference was much smaller than previous years, both in attendees and exhibitors.  Some people lamented the new ownership of MySQL by Oracle, and the ensuing &#8220;corporatization&#8221;. I didn&#8217;t really see much evidence of that in the conference sessions or topics, however.  From the other exhibitors, however, the feeling was the set of attendees was smaller, but a better qualified set. (From the point of view of people trying to sell them .. something, but also from attendees who wanted to talk to other attendees that had tackled real problems.)  Most attendees did seem to be from not-seat-of-pants enterprises, but real companies with revenue models, and even revenue.  They also had real problems to solve, and were interested in products/solutions that addressed those problems, even if they cost money, as they realised that the costs of not addressing them were generally much higher. (In revenue costs, in staff time, in diverting resources from strategic development to solving already solved problems&#8230;)</p>
<p>There was still quite a lot of attendees who&#8217;s first question to us at the LogicMonitor booth was &#8220;Is it open source&#8221;?  But the fact that we are not was not a disqualification from them spending time with us &#8211; it just meant that they wanted to know what value we added over open source solutions, that would make it worth giving us money.  Something we could easily answer to (almost) everyone&#8217;s satisfaction, after we explained the time savings, the more comprehensive monitoring, the cross platform monitoring, the automation, the auto-aggregation and classification features, etc.</p>
<p>Perhaps MySQL being purchased by Sun, then Oracle, has to some extent meant that the young innovators that gathered around it have gone on to more Open pastures, but it may also mean that the solid base of users remaining and working with it will solve more real world problems, generate more revenue, and also be willing to spend more money to solve their own problems. Which may just lure back more of the wild innovators, once they see the opportunities are actually better than they were &#8211; cool things can still be done, but now people will pay them &#8211; which makes the doing of cool things sustainable, not just done for love. Nothing wrong with doing cool things for love AND money.</p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fblog.logicmonitor.com%2F2010%2F04%2F15%2Fthe-mysql-conference-and-open-source%2F&amp;linkname=The%20MySQL%20conference%20and%20Open%20Source"><img src="http://blog.logicmonitor.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://blog.logicmonitor.com/2010/04/15/the-mysql-conference-and-open-source/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Is SaaS Security the best solution for your business?</title>
		<link>http://blog.logicmonitor.com/2010/02/26/is-saas-security-stopping-you-from-adopting-the-best-solution-for-your-business/</link>
		<comments>http://blog.logicmonitor.com/2010/02/26/is-saas-security-stopping-you-from-adopting-the-best-solution-for-your-business/#comments</comments>
		<pubDate>Fri, 26 Feb 2010 18:08:56 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[security]]></category>

		<guid isPermaLink="false">http://blog.logicmonitor.com/?p=93</guid>
		<description><![CDATA[Having worked in SaaS companies for a long time (going back to when they were called ASPs), I&#8217;ve heard a lot of companies not adopt SaaS solutions due to &#8220;security concerns&#8221;.  This attitude has generated a quite a few blog posts recently, so thought I&#8217;d add my 2 cents.
The people involved in SaaS think security [...]]]></description>
			<content:encoded><![CDATA[<p>Having worked in SaaS companies for a long time (going back to when they were called ASPs), I&#8217;ve heard a lot of companies not adopt SaaS solutions due to &#8220;security concerns&#8221;.  This attitude has generated a quite a few blog posts recently, so thought I&#8217;d add my 2 cents.</p>
<p>The people involved in SaaS think security is often <em>better</em> in SaaS systems that premise based systems.<br />
Justin Pirie at &#8220;The Week in SaaS&#8221; (an essential blog for those in SaaS, I think), put it <a href="http://www.justinpirie.com/2010/02/twis15-salesforce-on-saas-network-effects-and-ecosystem/">this way</a>:</p>
<div class="quote">something struck me- 46% of people surveyed were not moving to the cloud because of security.<br />
This is bonkers! Just because it’s behind your firewall does not make it secure.</div>
<p>Reuvan Cohen at <a href="http://www.elasticvapor.com/2010/02/why-cloud-computing-is-more-secure.html">Elastic Vapor</a> summarizes his view:</p>
<div class="quote">the new reality is that cloud computing is in a lot of ways more secure simply because people are actually spending time looking at the potential problems beforehand. </div>
<p>So what&#8217;s my opinion?  Having managed IT operations for a variety of companies, and worked in SaaS companies, I think I can share a realistic view.<br />
There are (simplistically) two aspects to application security &#8211; physical security and application level security.</p>
<p>If you are a small company, and have premise based applications, you probably don&#8217;t care much about application level security. The company is small, and everyone will be trusted to some degree.  The fact that the application is behind a firewall, with no access from outside, does provide fairly good security.  The SaaS advantage here is that small companies do not usually have physically secure premise based servers.  They are typically in a small server room (or closet), without much in the way of alarms, 24 hour guards, and all the other touted features of datacenters.    And if you can physically access a server, you can get to the data on the server.  As a friend of mine, the head of sales at a <a href="http://www.appfolio.com/">SaaS property management software</a> company, puts it &#8220;No Fortune 500 company would consider putting their servers in your SMB server room.  Yet they do have them in the same datacenter as our SaaS servers.&#8221;</p>
<p>As a company grows, they will typically get on top of physical security, but then application security raises it&#8217;s head, as security sensitive applications will now be restricted to a subset of employees.  Many premise based applications (especially open source ones or internally developed ones, it seems) are written without any access control designed in.  And once a company reaches any size, the premise based application will need to be accessed by people outside the firewall (remote offices, teleworkers, etc). How is that access to be granted securely, without undermining the whole security premise of &#8220;Well, it doesn&#8217;t matter if it&#8217;s not terribly secure, as no one can access it.&#8221;</p>
<p>Yes, you can put reverse proxy firewalls or SSL VPNs to provide some sort of remote access, but now the &#8220;simple&#8221; choice of premise based software for security is getting more and more complicated (and expensive).</p>
<p>So I think the consensus above is correct &#8211; in a company of any size, you are more likely to have less security issues and expense with a SaaS solution than premise based software.<br />
(FYI &#8211; LogicMonitor has its servers in Equinix datacenters.)</p>
<p>What are your thoughts?</p>
<a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save?linkurl=http%3A%2F%2Fblog.logicmonitor.com%2F2010%2F02%2F26%2Fis-saas-security-stopping-you-from-adopting-the-best-solution-for-your-business%2F&amp;linkname=Is%20SaaS%20Security%20the%20best%20solution%20for%20your%20business%3F"><img src="http://blog.logicmonitor.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a>]]></content:encoded>
			<wfw:commentRss>http://blog.logicmonitor.com/2010/02/26/is-saas-security-stopping-you-from-adopting-the-best-solution-for-your-business/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
