{"id":9939,"date":"2021-01-27T14:52:50","date_gmt":"2021-01-27T04:52:50","guid":{"rendered":"https:\/\/nsrd.info\/blog\/?p=9939"},"modified":"2021-01-27T14:52:54","modified_gmt":"2021-01-27T04:52:54","slug":"why-data-classification-in-backup-is-pointless","status":"publish","type":"post","link":"https:\/\/nsrd.info\/blog\/2021\/01\/27\/why-data-classification-in-backup-is-pointless\/","title":{"rendered":"Why Data Classification in Backup is Pointless"},"content":{"rendered":"\n<p>You know that saying, &#8220;Data is the new oil&#8221;? If you work in IT and haven&#8217;t heard of it, I&#8217;d be surprised. In fact, I&#8217;ve cited it a few times myself, including in the second edition of <strong><a href=\"https:\/\/www.routledge.com\/Data-Protection-Ensuring-Data-Availability\/Guise\/p\/book\/9780367256777\" target=\"_blank\" rel=\"noreferrer noopener\">Data Protection: Ensuring Data Availability<\/a><\/strong>. The phrase caught on as much as anything because it strikes a chord \u2014&nbsp;for so long, oil was the driving force of the economy, and it seemed reasonable to assume data had supplanted it.<\/p>\n\n\n\n<p>But, here&#8217;s the rub: data isn&#8217;t really the new oil.<\/p>\n\n\n\n<p>You see, data is irrelevant.<\/p>\n\n\n\n<p>Yes, I&#8217;m going to say that again: <strong><em>data is irrelevant<\/em><\/strong>.<\/p>\n\n\n\n<p>That is, data <strong>without context<\/strong> is irrelevant.<\/p>\n\n\n\n<p>Talking about data as being critical to the business is akin to talking about a DNS server being a business outcome. Unless your business sells DNS services, a DNS server is <em>not<\/em> a business outcome. Even if your business <em>does<\/em> sell DNS services, a DNS server is only one aspect of a business outcome, because there&#8217;s always more wrapped up in a &#8216;product&#8217; than a single server, or even a clustered server.<\/p>\n\n\n\n<p>Going all the way back to <strong><a href=\"https:\/\/www.routledge.com\/Enterprise-Systems-Backup-and-Recovery-A-Corporate-Insurance-Policy\/Guise\/p\/book\/9781420076394\" target=\"_blank\" rel=\"noreferrer noopener\">Enterprise Systems Backup and Recovery<\/a><\/strong>, I&#8217;ve made it clear that system dependency mappings (essential for disaster recovery and business continuity planning) have to map the dependencies all the way through to the business function.<\/p>\n\n\n\n<p>It may be hard for many of us in IT to accept, but the business doesn&#8217;t give a damn about whether the Oracle database is clustered. The business doesn&#8217;t give a damn about whether the NAS platform uses RAID-5 or RAID-6. And it certainly doesn&#8217;t give a damn about whether systems are patched or not.<\/p>\n\n\n\n<p>If you work in IT, particularly infrastructure or systems administration, and tried to explain to someone (not in IT) at a party what you do, you know this in your soul. The blank look of despair on their face as they try to extricate themselves from the &#8220;What do you do for work?&#8221; question may well be seared into your memory.<\/p>\n\n\n\n<p>People who don&#8217;t work in IT, particularly leaders in business care about outcomes. Outcomes for the business. Can revenue <strong>for the business <\/strong>be increased? Can environmental footprint <strong>for the business <\/strong>be decreased? Likewise, they&#8217;re not really concerned with whether the DNS service or database cluster is recoverable. They want to know whether the <em>customer billing<\/em> business function is recoverable. They want to know whether the <em>inventory tracking<\/em> function is available whenever it&#8217;s required. They want to know whether the <em>customer ordering platform<\/em> is always operational.<\/p>\n\n\n\n<p>What powers those functions? Most of them don&#8217;t care. To think otherwise is to think that every car owner is intrinsically concerned with the thermal and acoustic characteristics of their muffler.<\/p>\n\n\n\n<p>And because of that, data classification at the time of backup that&#8217;s contextually unaware is pointless. The <em>context<\/em> of course being, &#8220;what&#8217;s the business function of this data?&#8221;<\/p>\n\n\n\n<p>If you think a backup environment can intelligently ascertain the <strong><em>business function<\/em><\/strong> of your data based on its file extension, metadata or even actual content in &#8216;real-time&#8217; as data is being read, I have a bridge I&#8217;d like to sell you.<\/p>\n\n\n\n<p>I&#8217;ve long maintained that data classification is a primary-storage process \u2014&nbsp;but it&#8217;s more than that. It&#8217;s a human-centric process still, and will remain so for some time to come. Sure, there&#8217;s software that&#8217;ll tell you whether you have some PCI data laying around. And sure, there&#8217;s software that can monitor your network and tell you which systems are talking to what servers and storage platforms, but all of this information requires human intervention, and human understanding.<\/p>\n\n\n\n<p>Until you have AI-level software that can monitor your environment and <em>tell you what your business functions are<\/em>, and then map all of those functions with full dependency analysis back to individual systems, any supposed in-backup classification beyond simple client tagging is a placebo. Anyone who tries to tell you otherwise is trying to sell you a bridge.<\/p>\n\n\n\n<p>There are some things you currently can&#8217;t automate your way out of \u2014 at least not with the current level of technology. This is one of these things. You might even say that this is a good example of the difference between <em>things that can be automated<\/em> and <em>autonomous systems<\/em>. Automatic data classification requires both the former and the latter, and we don&#8217;t have the latter yet.<\/p>\n\n\n\n<p>That&#8217;s why it&#8217;s essential data classification is performed where the data is stored, not when the data is copied, and with humans involved. To be truly useful in guiding next-step functionality, that classification needs to be by business function, not the file extension. By classifying data when it&#8217;s originally handled or stored, appropriate backup policies can be developed.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>You know that saying, &#8220;Data is the new oil&#8221;? If you work in IT and haven&#8217;t heard of it, I&#8217;d&hellip;<\/p>\n","protected":false},"author":1,"featured_media":9940,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[5,1133],"tags":[132,1594,1151],"class_list":["post-9939","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-backup-theory","category-best-practice","tag-automation","tag-autonomous","tag-data-classification"],"aioseo_notices":[],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2021\/01\/bigStock-Bridge-Sale.jpg","jetpack_shortlink":"https:\/\/wp.me\/pKpIN-2Aj","jetpack_sharing_enabled":true,"jetpack_likes_enabled":true,"_links":{"self":[{"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/posts\/9939","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/comments?post=9939"}],"version-history":[{"count":3,"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/posts\/9939\/revisions"}],"predecessor-version":[{"id":9943,"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/posts\/9939\/revisions\/9943"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/media\/9940"}],"wp:attachment":[{"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/media?parent=9939"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/categories?post=9939"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/tags?post=9939"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}