{"id":3012,"date":"2011-04-12T08:15:51","date_gmt":"2011-04-11T22:15:51","guid":{"rendered":"http:\/\/nsrd.info\/blog\/?p=3012"},"modified":"2018-12-11T18:12:04","modified_gmt":"2018-12-11T08:12:04","slug":"technology-is-rarely-the-issue","status":"publish","type":"post","link":"https:\/\/nsrd.info\/blog\/2011\/04\/12\/technology-is-rarely-the-issue\/","title":{"rendered":"Technology is rarely the issue"},"content":{"rendered":"<p>One of the stories I sometimes hear from companies is that some technology X doesn&#8217;t work in their environment because X sucks, or X is broken, or X &#8230; well, you get the picture.<\/p>\n<p>Years ago, when I first got into backup, the the main reasons I had to do recovery were due to system or hardware failures. Hard drive reliability was IMHO much lower, operating systems were frequently less stable, etc. Reliability was about getting to 99% availability, let alone 99.9% or anything grandiose like that.<\/p>\n<p>These days, hardware\/OS\/app failure is, I&#8217;d suggest, one of the least likely reasons for a recovery being conducted in most organisations. Instead, it&#8217;s mainly related to soft issues \u2013 user error, audits, compliance checking, etc.<\/p>\n<p>There&#8217;s a point here, and I&#8217;m almost ready to make it.<\/p>\n<p>Back when I first started with backup, I&#8217;d have agreed that technology could be firmly blamed for a lot of errors. These days? Rarely \u2013 even when I blame it.<\/p>\n<p>I periodically go on a rant about just how painful Linux is sometimes, but at the core I also admit that it&#8217;s a lack of training and time on my part \u2013 I&#8217;ve not made learning the ins and outs of Linux firewalls a field of study in the past, so now that I&#8217;m having to construct them by hand for a personal project it&#8217;s about as fun as tasering myself in the genitals. Technology is <em>partly<\/em> the problem \u2013 as is always the case with Linux, it&#8217;s designed for programmers and developers to manipulate, not for end users, or people like me who have concentrated on other things and <em>just want the damn thing to work<\/em>.<\/p>\n<p>Ahem, where was I?<\/p>\n<p>The simple fact is that we often blame technology because it&#8217;s easy. It&#8217;s like kids picking on the &#8220;easy target&#8221; at school with bullying; we bully technology and blame it for all our woes and issues because well, it doesn&#8217;t really fight back. (Hopefully we&#8217;ll get out of this habit before the singularity&#8230;)<\/p>\n<p>As techos though, let&#8217;s be honest. The technology is rarely the issue. Or to be more accurate, if there&#8217;s an issue, technology is the tip of the iceberg \u2013 the visible tip. And using the iceberg analogy, you know I mean that technology is rarely going to be the majority of the issue.<\/p>\n<p>The &#8216;issue&#8217; iceberg in IT looks like this:<\/p>\n<p style=\"text-align: center;\"><a href=\"https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2011\/04\/What-goes-wrong.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-3015\" title=\"The issue iceberg\" src=\"https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2011\/04\/What-goes-wrong.jpg\" alt=\"The issue iceberg\" width=\"390\" height=\"304\" srcset=\"https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2011\/04\/What-goes-wrong.jpg 390w, https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2011\/04\/What-goes-wrong-300x233.jpg 300w, https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2011\/04\/What-goes-wrong-384x300.jpg 384w\" sizes=\"auto, (max-width: 390px) 100vw, 390px\" \/><\/a><\/p>\n<p>It&#8217;s probably best here that I stop and differentiate between issues and problems. A problem to me, is an isolated or an atomic failure \u2013 like, a faulty tape drive, or a failed hard drive. They&#8217;re clearly technology related, but they&#8217;re not really <em>issues<\/em>. An issue is a deeper, systemic and compound failure. E.g., something like &#8220;on any one day, 30% of my backups fail&#8221;, or &#8220;Performance across all systems is generally 50% worse at end of month&#8221;, etc.<\/p>\n<p>When technology gets blamed in those instances, I&#8217;m reminded of someone who say, never has their car serviced, then when it eventually breaks down complains that the car was a lemon. Was it that the car failed the person, or more accurately that the person failed the car?<\/p>\n<p>As I said, it&#8217;s easy to blame the thing that can&#8217;t defend itself.<\/p>\n<p>In environments with ongoing, long-term issues, there reaches a point where you have to sit back and ponder \u2013 is the technology causing the issue, or is the environment causing the technology to have an issue?<\/p>\n<p>The inevitable and hard truth is that in some cases, it&#8217;s the latter, not the former.<\/p>\n<p>Let&#8217;s consider a basic scenario &#8211; the &#8220;on any given day 30% of our backups fail&#8221; scenario. So, does that mean that on any given day 30% of servers crash and reboot during the backup? Or does the backup software agent crash on 30% of servers when a backup is attempted? Maybe, in the most exceptional of circumstances, this may be the case.<\/p>\n<p>In reality though? In reality we have to start looking at the rest of that iceberg:<\/p>\n<p><a href=\"https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2011\/04\/rest-of-iceberg.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-3018\" title=\"Rest of the iceberg\" src=\"https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2011\/04\/rest-of-iceberg.jpg\" alt=\"Rest of the iceberg\" width=\"390\" height=\"305\" srcset=\"https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2011\/04\/rest-of-iceberg.jpg 390w, https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2011\/04\/rest-of-iceberg-300x234.jpg 300w, https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2011\/04\/rest-of-iceberg-383x300.jpg 383w\" sizes=\"auto, (max-width: 390px) 100vw, 390px\" \/><\/a><\/p>\n<p>High systemic failure rates, if attributed to the deployed technology, should result in a law suit. How often do we see that happening?<\/p>\n<p>&gt;queue the cicadas&lt;<\/p>\n<p>That&#8217;s right.<\/p>\n<p>When there are systemic failure rates, a business must, eventually, turn to face the truth that they have to review their:<\/p>\n<ul>\n<li><strong>Policies<\/strong> \u2013 Are there any governing rules to the company which are contributing to the problem? For instance, does the company require the technology to be adapted in such a way that it wasn&#8217;t designed for? This can be hard and real policies, or they can be implicitly allowed policies \u2013 such as empire building.<\/li>\n<li><strong>Processes<\/strong> \u2013 Are there operating methods which are triggering the issue? Imagine a business for instance where change control has become such a consuming process that backup failures are repeatedly allowed to occur because a change window isn&#8217;t available. Is that the fault of the backup technology?<\/li>\n<li><strong>People and Education<\/strong> \u2013 I&#8217;m not suggesting that staff at sites are incompetent. Far from it. Incompetent is such a harsh, unpleasant word that in the 15+ years I&#8217;ve been consulting, it&#8217;s been a <em>very<\/em> rarely used word. Education though is a factor. No, I&#8217;m not picking on people without tertiary skills, but <em>training<\/em> is a factor. For example, managers who have no day to day technical experience may decide that some technology, based on a half hour vendor pitch, is easy enough that staff won&#8217;t need training in it. If said staff then go on to say, accidentally delete a LUN from a production server, because they weren&#8217;t trained , how is that the fault of the SAN?<\/li>\n<\/ul>\n<p>Navel gazing, introspection, call it what you will, it&#8217;s not always a pleasant task. It&#8217;s about objectively looking at how we&#8217;re doing things, and ask, &#8220;are <em>we<\/em> partly to blame?&#8221;<\/p>\n<p>Yet, if you <em>aren&#8217;t<\/em> prepared to do this, you&#8217;re doomed (yes, <em>doomed<\/em>) to keep making the same mistake again, and again, and again. The pile of failed technology builds up, the quest for the silver bullet becomes more frenetic, and the chances of a major failure happening increase. In the worst scenarios, it can become decidedly toxic.<\/p>\n<p>But it doesn&#8217;t need to be. Evaluating your processes, your policies and your people (particularly the training of your people) can be \u2013 well, cathartic. And the benefits to the business, in terms of literal cost savings and efficiencies, ensures that the introspection is well worth it.<\/p>\n<p>As a consultant, you might assume that it&#8217;s my job to ensure that customers buy the best and the most expensive technology out there that I can sell them. That&#8217;s a cynical attitude that comes from a few shoddy operators. As a consultant, my job is to partner with you&nbsp;and your company and help you achieve your best. (If you think I&#8217;m just blowing smoke up your proverbial, check my &#8220;<a title=\"13 traits of a great consultant\" href=\"http:\/\/unsane.info\/wordpress\/?p=756\" target=\"_blank\">13 traits of a great consultant<\/a>&#8221; article.)<\/p>\n<p>Sometimes that means highlighting that there are issues, not problems, and those issues require a deeper fix than plugging in a new piece of technology.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>One of the stories I sometimes hear from companies is that some technology X doesn&#8217;t work in their environment because&hellip;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":false,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[5,12,13],"tags":[353,502,749,758,762,989],"class_list":["post-3012","post","type-post","status-publish","format-standard","hentry","category-backup-theory","category-general-technology","category-general-thoughts","tag-education","tag-issue","tag-policy","tag-problem","tag-process","tag-technology"],"aioseo_notices":[],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_shortlink":"https:\/\/wp.me\/pKpIN-MA","jetpack_sharing_enabled":true,"jetpack_likes_enabled":true,"_links":{"self":[{"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/posts\/3012","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/comments?post=3012"}],"version-history":[{"count":1,"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/posts\/3012\/revisions"}],"predecessor-version":[{"id":7516,"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/posts\/3012\/revisions\/7516"}],"wp:attachment":[{"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/media?parent=3012"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/categories?post=3012"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/tags?post=3012"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}