In 7 weeks I’ll be moving. Not the blog – that’s staying right here, but I, personally will be moving. I’ve mentioned this once or twice in the past, but as it approaches crunch time there’s some administrivia to mention.

We’ll be moving in the first week of June, and it’s a move of about 750km as the crow flies, or 950-1000km in terms of roads. When Darren and I first started planning the move, I had grandiose plans that I’d build up a bunch of draft blog articles and continue to release them regularly throughout the move.

Now, I’m not so sure. 50 days may sound like a lot, but when you work during the week you can start counting the hours before you move. Maybe 2-3 productive hours every night, if you’re up to it, and the weekends. Including the weekend we have the removalists arrive, there’s now 7 weekends left to us – assuming spending 12 hours each day of each weekend, that’s 168 hours to go.

So maybe add in those week night hours, and you’ve got say, 250 hours to go.

250 hours left to finish compressing and deduplicating our lives and transfer them as compactly as possible to another location.

Things are going to get tight, and my blog posts may get more sporadic between now and mid-June, when the move should be wrapped up and we’re settled. So, please, bear with me – the channel isn’t changing, but the broadcaster is moving, and that can cause some delays.

And if you want to know more about the move, check my personal blog entry here.

 

The folks over at 37 Signals published a little piece of what I would have to describe as crazy fiction, about how the combination of cloud and more technically savvy users means that we’re now seeing the end of the IT department.

I thought long and hard about writing a rebuttal here, but quite frankly, their lack of logic made me too mad to publish the article on my main blog, where I try to be a little more polite.

So, if you don’t mind a few strong words and want to read a rebuttal to 37 Signals, check out my response here.

 

A while ago I wrote about the “Perils of an Icarus Support Contract” … funnily enough, when I did, it actually inspired me to finally go out and get some tattoo work done. Greek legends and mythology had always appealed to me, and so it seemed a natural fit when I realised there’s some strong messages in the flight of Icarus and Daedalus.

Over at my personal blog I’ve been chronicling the process. It started with this post – which also explains why I chose what some people would consider to be a morbid tale (I don’t). Over several sessions it’s continued through to this current state.

 

A while ago I got rather frustrated with the performance of compression utilities on my Mac Pro. It’s a bit of a beast; it was the last model before the Nehalem based systems, but has 8 x 3.2GHz cores and a respectable 20GB of RAM, but compression isn’t always as fast as I’d have liked.

Coming from a long-term Unix background, I have tended mostly to use bzip2 – brilliant compression ratios, but slow, slow, slow.

Eventually it occurred to me though was that the problem was simple: bzip2 is single-threaded. I can compress a file and using Activity Monitor, see a single core sit at a high utilisation rate – but that’s all.

Apple’s Grand Central though made me think: how much better would compression utilities work if they could run against multiple cores? Not running multiple compression activities at the same time – but splitting it up and hitting as many cores as possible at once.

As you’d expect, the answer is: much, much better. With a little bit of searching, I found pbzip2 – a parallel processor version of bzip2. Check it out, and be sure to donate to the programmer – he deserves the support.

Without a doubt, it’s a much faster way of compressing files when you have a bunch of cores available to throw at the problem. Here’s a test scenario:

  • Generate an 8GB, highly random file.
  • Time a regular bzip2 on the file.
  • Time a pbzip2 on the file.

The results? See for yourself:

# du -hs test.dat
8.0GB   test.dat
# date ; bzip2 < test.dat > test-bzip.dat.bz2; date
Sun 13 Feb 2011 12:33:11 EST
Sun 13 Feb 2011 13:02:16 EST
# date; pbzip2 < test.dat > test-pbzip.dat.bz2; date
Sun 13 Feb 2011 13:06:58 EST 
Sun 13 Feb 2011 13:12:05 EST

In the above case, the compressed files are both still effectively 8GB, since the source file was designed to not be susceptible to compression. Looking at a “real world” example then, I’ll pick a virtual machine. In actual fact, being able to quickly compress copies of virtual machines was the reason I first looked around for a better compression utility, so it makes sense to do so. Picking a small virtual machine, I can see:

[Sun Feb 13 13:20:42]
preston@aralathan /Volumes/Data/VMs
$ du -hs test02.pvm/
6.1G	test02.pvm/

Now, compressing with regular bzip2 via tar, we get:

$ date; tar cf - test02.pvm | bzip2 -c > ~/Desktop/test-bzip2.pvm.bz2; date
Sun 13 Feb 2011 13:25:37 EST
Sun 13 Feb 2011 13:40:45 EST
$ du -ms ~/Desktop/test-bzip2.pvm.bz2
2087	/Users/preston/Desktop/test-bzip2.pvm.bz2 

That was a total of 15 minutes, 10 seconds to compress 6.1GB down to 2087 MB using conventional bzip2.

Moving on to tar/pbzip2:

$ date; tar cf - test02.pvm | pbzip2 -c > ~/Desktop/test-pbzip2.pvm.bz2; date
Sun 13 Feb 2011 13:47:23 EST
Sun 13 Feb 2011 13:49:53 EST
$ du -ms ~/Desktop/test-pbzip2.pvm.bz2 
2092	/Users/preston/Desktop/test-pbzip2.pvm.bz2

So while it cost an extra 5MB in storage, pbzip2 compressed the same data in 150 seconds – or two and a half minutes, if you will. (I should note, that was reading from a 2 x 7200 RPM stripe, and writing to SSD. At this point the compression seems to be IO bound on the read, if anything – I had previously been compressing the same data in 104 seconds to a firewire-800 drive when reading from a 3 x 7200 RPM stripe.)

If you’re needing high speed compression, be sure to check out pbzip2.

NOTE: You may be wondering why I didn’t use the Unix ‘time’ command. Ever since encountering a bug in Tru64 Unix ‘time’, I’ve steered clear of it. (That bug resulted in ‘time’ blowing out the runtime of what it was monitoring by several orders of multitude.) I know, it would be safe to go back to ‘time’ by now, but for the purposes of what I needed to demonstrate, “date; command; date” was more than sufficient.

 

I’m pleased to say that IDATA NZ is also looking for new staff. Here’s the advert, for anyone interested in working out of New Zealand’s capital city:

IDATA is a leading data protection, storage, virtualisation and high availability solutions provider.  Our customer base includes many high profile companies throughout Australasia.

Due to a high demand for our services and solutions IDATA are seeking an additional senior consultant to work from our Wellington office.  You will have extensive experience in data storage and protection solutions in a predominantly enterprise environment.

To be successful in this role you will also have experience in Virtualisation, Storage, Backup, and Archiving products

Candidates with experience in the above technologies with one or more of the following technology providers: EMC, VMware, IBM, Symantec, NetApp can apply here: careers@idataresolutions.com

 

My employer, IDATA Australia, is expanding, and we’re looking for subject matter experts around EMC products, with a strong focus on backup and archiving skills. Regardless of whether you’re located in Australia or the other side of the world, we may have a position for you. The roles will cover both support and services, so you need to have great communication skills and a strong customer focus.

Specifically, we’re looking for people who are comfortable working in a combined support and services team – i.e., the role would involve both deployments and training on customer sites, as well as customer remote support. If you’ve got experience in one or the other rather than both, that’s not a problem so long as you’re eager to learn.

The Australasian market is a region of high growth, so working for IDATA will keep you challenged to stay in top technical form.

If you’re interested in working for IDATA Australia, contact me.

(Please note, IDATA does not accept solicitations from recruitment agencies.)

 

Consulting isn’t the glamour job that a lot of people think it is. Sure, you may get to travel and learn new things, but as anyone who travels a lot will tell you, it can be a bit wearing after a while, and you find yourself suddenly hankering for the most basic of simple meals after weeks at a time of hotel or restaurant food.

Nor do you do consulting for the fame. Sure, good consultants will get reputations in their fields, but we’re hardly household names (except maybe in particularly niche/geeky households). Undoubtedly I think the best consultants start with a passion for their field and a desire to learn as much as possible, but this will only get you so far. There needs to be something else to prevent burn out and to allow for personal growth.

I’ve been extraordinarily lucky to develop and keep long term customer relationships, carrying customers between my last company (which collapsed) and my current company, and equally having customers carry those relationships with me from one company to the next. At one level, they remain customers – but at another, as you’ve been working with certain customers year in, year out, you realise that the true value of consulting is the people you’re doing the consulting for. It’s not about being able to solve problems and challenges for companies that you deal with, it’s about being able to solve problems and challenges for the individual people you deal with. The first just pays the bills; the second one is what gives you the job satisfaction.

In the past couple of weeks I’ve had a few people I’ve been working with for years advise me that they’re moving on to other endeavours. It’s always at this time that I pause to think about the core reason I do this job: to help people.

 

Just to let you folks know that I’m currently just “mostly missing in action” rather than “missing in action”.

It’s been a hectic few weeks for me, both personally and work-wise, and as my role adjusts to some changes I’ll be doing more travel again which will mean getting used to doing posts on the run.

I’m currently in (not so) sunny Western Australia – specifically, Bunbury, doing some work for a long-term customer, and flying back on Friday. That’ll give me Saturday and Sunday to recharge my batteries and get some Greek coffee into my system. (Maybe, if my friendly tattoo artist will help out, maybe Sunday will also see me getting some new ink. Though maybe that won’t happen if I don’t remember to email her – hmmm…)

Next article: Last week I attended the SNIA AU/NZ first storage blogfest – where myself and a group of other bloggers/tweeters were taken to EMC, IBM, HDS and NetApp for some discussions and technology direction overviews. I had hoped to blog about it by now – but instead I’m aiming to have blogged about it by the time I leave WA – that’ll mean either tonight, or tomorrow night.

Stay tuned – I’m not MIA, just MMIA.

 

For the release of both Mac OS X Tiger (10.4 – 2005) and Max OS X Leopard (10.5 – 2007), Apple had various mocking campaigns and posters for the preceding conferences with slogans along the lines of:

Redmond, start your photocopiers

This was a very public and very open jibe from Apple regarding Microsoft’s reputation for simply copying features from Mac OS X. Now, I don’t want to really get into the “you’re a fanboy – no, you’re a fanboy!” style argument, but I do want to suggest that given the recent debacle that’s started to surface over the abysmal performance of the Windows 7 backup process, Microsoft appears to be cutting their noses off to spite their faces.

Back on 6 March 2009, I covered just how amazing Time Machine was as an OS-integrated backup product. I never said it was something that would replace enterprise products like NetWorker, but I did say:

This, quite honestly, is the epitome of simplicity. Going beyond standard backup and recovery operations, Time Machine is also an excellent disaster recovery tool – if you have serious enough issues that you need to rebuild your machine, the Mac OS X installer actually has the option of doing a rebuild and recovery from Time Machine backups.

To be blunt – as a backup utility for end users, Time Machine is an ace in the hole, and one of the most underrated features of Mac OS X.

Sure, Time Machine doesn’t do everything that every user wants it to do – but then again, no product ever will. Yet I’ve backed up a significant number of TB (as far as desktops go) using Time Machine, and recently I was highly pleased to be able to recover 18 months of my fathers’ hard work with no effort at all. This was from a machine where I’d setup Time Machine and had not had a chance to visit since – nor check remotely, since my parents don’t use the internet.

So frankly, on behalf of Windows users, I’m somewhat horrified at the experiences being felt with Microsoft’s Windows 7 backup utility – and their use case scenarios!

As documented over at The Register, “Windows 7 Backup Gets Users’ Backs Up”, there’s a litany of issues being reported:

Jon Hell posted on April 23 that he is backing up 900GB of data on a quad core PC with 7GB of RAM; “After twenty four hours Windows Backup had managed to complete 18 per cent of the backup, but after forty eight hours, it had got even slower, and had only reached 23 per cent of the full backup.”

And:

John Dougrez-Lewis was the first poster, and wrote that he could use file copy to move 250GB of file data to an external eSATA drive in an hour at a speed of 72MB/sec. When he did the same job using Windows 7 RTM Backup it took 14 hours, roughly 5MB/sec – more than 14 times slower.

If these were isolated experiences it could be understood – after all, no product will work perfectly for every single person.

The actual Microsoft forum regarding the issues is directly available via this link. We also see an article from Microsoft, Backing up large data set on Windows 7:

Windows Backup is optimized to help home users protect their important data on their PCs and this is typically expected to be 200GB of data on average. On a PC that contains significantly larger data size, Windows Backup’s performance may degrade. If you need to back up more than 400GB of data, we recommend that you backup your PC using a system image.

Sorry to say, but this “meh” attitude towards backup turns my stomach. If this were an article published a decade ago about an OS-included backup utility it might be understandable – after all, a decade ago, 400GB of data was a big amount!

The article goes on to provide instructions for setting up a scheduled system image. Sure, the average techo will look at the instructions provided and punch through them in a couple of minutes at most, but with instructions like the following, you’re guaranteed to (a) turn most average users off and (b) definitely provide a terrible user experience:

If you have a separate data drive, you will need to create a task in Task Scheduler to create the system image:

a.      Open an elevated command prompt

b.      Type the following command:

SCHTASKS /Create /SC <Frequency> /TN <TaskName> /RL HIGHEST /ST <StartTime> /TR “WBADMIN START Backup –backupTarget:<target> -include:<source> -quiet”

This goes to the heart of why Time Machine is so successful – Apple recognised that the only way to get users to backup is to make it painless and easy. Microsoft’s approach to end-user backup seems to be diametrically opposed to that of Apple – and as a result of it, I know which backup mechanism will save more consumer data, even given the hugely different market shares of the platforms.

When it comes to backup, Microsoft would do well to “start their photocopiers”.

 

I will be taking a hiatus from this blog, which will last at least two weeks, or perhaps longer.

© 2012 The NetWorker Blog Suffusion theme by Sayontan Sinha