{"id":10717,"date":"2021-12-11T13:10:10","date_gmt":"2021-12-11T03:10:10","guid":{"rendered":"https:\/\/nsrd.info\/blog\/?p=10717"},"modified":"2021-12-11T13:10:12","modified_gmt":"2021-12-11T03:10:12","slug":"networker-dedupe-stats-redux","status":"publish","type":"post","link":"https:\/\/nsrd.info\/blog\/2021\/12\/11\/networker-dedupe-stats-redux\/","title":{"rendered":"NetWorker Dedupe Stats, Redux"},"content":{"rendered":"\n<h2 class=\"wp-block-heading\">Where we&#8217;re at<\/h2>\n\n\n\n<p>A few weeks ago I posted a (lengthy) Perl script (and explanation) for running deduplication analysis against NetWorker savesets written to Data Domain devices.<\/p>\n\n\n\n<p>Since then I&#8217;ve been tinkering with the script and I&#8217;ve made some modifications &#8212; and made it easier for you to use, even if you don&#8217;t have a Perl environment running on your machine.<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><a href=\"https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2021\/12\/dedupe_analysis_v12.png\"><img loading=\"lazy\" decoding=\"async\" width=\"1723\" height=\"1090\" src=\"https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2021\/12\/dedupe_analysis_v12.png\" alt=\"\" class=\"wp-image-10718\" srcset=\"https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2021\/12\/dedupe_analysis_v12.png 1723w, https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2021\/12\/dedupe_analysis_v12-300x190.png 300w, https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2021\/12\/dedupe_analysis_v12-1024x648.png 1024w, https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2021\/12\/dedupe_analysis_v12-768x486.png 768w, https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2021\/12\/dedupe_analysis_v12-1536x972.png 1536w, https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2021\/12\/dedupe_analysis_v12-1600x1012.png 1600w\" sizes=\"auto, (max-width: 1723px) 100vw, 1723px\" \/><\/a><figcaption><strong>Update NetWorker Deduplication Analysis Usage<\/strong><\/figcaption><\/figure>\n\n\n\n<p>My original script irked me because there was no mapping between volume ID and volume names, which I&#8217;ve addressed in this version. If you run it on the NetWorker server, it will automatically do the mapping for you. If you&#8217;re running it against gathered data, you can also grab output from &#8220;mminfo -mv -xc,&#8221; from the NetWorker server and point the utility at that. With this gathered, you&#8217;ll get volume names associated with all the statistics &#8211; such as:<\/p>\n\n\n\n<figure class=\"wp-block-image size-full\"><a href=\"https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2021\/12\/dedupe_analysis_volnames.png\"><img loading=\"lazy\" decoding=\"async\" width=\"1556\" height=\"979\" src=\"https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2021\/12\/dedupe_analysis_volnames.png\" alt=\"\" class=\"wp-image-10719\" srcset=\"https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2021\/12\/dedupe_analysis_volnames.png 1556w, https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2021\/12\/dedupe_analysis_volnames-300x189.png 300w, https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2021\/12\/dedupe_analysis_volnames-1024x644.png 1024w, https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2021\/12\/dedupe_analysis_volnames-768x483.png 768w, https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2021\/12\/dedupe_analysis_volnames-1536x966.png 1536w\" sizes=\"auto, (max-width: 1556px) 100vw, 1556px\" \/><\/a><figcaption><strong>Deduplication Statistics &#8211; Now with Volume Name Goodness<\/strong><\/figcaption><\/figure>\n\n\n\n<p>Finally, as you may have spotted from the command line usage in the first graphic and intro &#8212; the utility is now compiled &#8211; and for Windows and Linux. So you can just download it and run against your environment. And remember &#8212; there&#8217;s no need to run it <em>on<\/em> the NetWorker server; you can run whatever mminfo query you want so long as you generate the output with the -S option, transfer it to another machine, and run it there.<\/p>\n\n\n\n<p>If you don&#8217;t want to run it on the NetWorker server, here&#8217;s what you&#8217;d do:<\/p>\n\n\n\n<ol class=\"wp-block-list\"><li>On the NetWorker server, run:<ul><li>An mminfo query generating -S output &#8211; e.g., &#8220;<strong>mminfo -avot -S<\/strong>&#8221; &#8211; writing that to a file\/directory that&#8217;ll have plenty of free space.<\/li><li>An mminfo query generating the volume list on the server: &#8220;<strong>mminfo -mv -xc,<\/strong>&#8220;, writing that file <\/li><\/ul><\/li><li>Transfer the two files across to your workstation.<\/li><li>Run the appropriate invocation of dedupe-analysis. For example:<ul><li>Let&#8217;s say your mminfo -S output is saved as: mminfo_S.txt<\/li><li>You have mminfo -mv -xc, output saved as: mminfo_mv.txt<\/li><li>You want per-host details as well as the rollups.<\/li><li>You want the files\/directories generated to be prefixed with MyNSRServer<\/li><li>You&#8217;d run: dedupe-analysis -f mminfo_S.txt -M mminfo_mv.txt -i -o MyNSRServer<\/li><\/ul><\/li><\/ol>\n\n\n\n<h2 class=\"wp-block-heading\">Downloads<\/h2>\n\n\n\n<p>You can download an executable version of the deduplication analysis tool for your platform below. I&#8217;ve included MD5 checksums for both the zip files, and the included binaries.<\/p>\n\n\n\n<ul class=\"wp-block-list\"><li>Linux:<ul><li>Zip file MD5: 5ee89e693d5060b2bb4567dec1d05cd4<\/li><li>Binary file MD5: b7512b9032bcd073309c8682afc1f230<\/li><li><a href=\"https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2021\/12\/dedupe-analysis-Linux.zip\"><strong>Zip Download<\/strong><\/a><\/li><\/ul><\/li><li>Windows:<ul><li>Zip file MD5: 00d0d20b9516f38541ebcda0455ae1ff<\/li><li>Binary file MD5: 9ea914d643806be9a3e7cf8c49df2c34<\/li><li><strong><a href=\"https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2021\/12\/dedupe-analysis-Windows.zip\">Zip Download<\/a><\/strong><\/li><\/ul><\/li><\/ul>\n\n\n\n<p>Happy deduping!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Where we&#8217;re at A few weeks ago I posted a (lengthy) Perl script (and explanation) for running deduplication analysis against&hellip;<\/p>\n","protected":false},"author":1,"featured_media":10722,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[1181,16,20],"tags":[275,301,1249],"class_list":["post-10717","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-data-domain-2","category-networker","category-scripting","tag-data-domain","tag-deduplication","tag-networker"],"aioseo_notices":[],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"https:\/\/nsrd.info\/blog\/wp-content\/uploads\/2021\/12\/bigStock-Data-Compression.jpg","jetpack_shortlink":"https:\/\/wp.me\/pKpIN-2MR","jetpack_sharing_enabled":true,"jetpack_likes_enabled":true,"_links":{"self":[{"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/posts\/10717","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/comments?post=10717"}],"version-history":[{"count":3,"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/posts\/10717\/revisions"}],"predecessor-version":[{"id":10723,"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/posts\/10717\/revisions\/10723"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/media\/10722"}],"wp:attachment":[{"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/media?parent=10717"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/categories?post=10717"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/nsrd.info\/blog\/wp-json\/wp\/v2\/tags?post=10717"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}