Archive for the ‘Books’ category

Back to the Future

September 26th, 2009

Anyone with children will know all about regression; when behaviour goes backwards and you wonder what on earth just happened.  As adults we are also prone to regression and at work and under pressure many of us revert to behaviours that aren’t ones we would probably choose in some of our less pressured moments.

The actual term regression is attributed to Francis Galton, a prolific 19th century statistician (amongst many other things) and a half-cousin of Charles Darwin.  In a statistical context, the term refers to the relationship between the selected values of x and the observed values of y which when used correctly can be incredibly powerful and enable the prediction of future events.

I’m particularly interested in regression at the moment because I want to further refine my thinking on scatterplots that appear to show a reliable correlation between course searches and enrolments.  I’ve had very little formal training in statistics so when I spotted Stephen Few’s positive review of an O’Reilly publication called Head First Data Analysis, I thought I’d give it a go.

headfirstdataanalysis

What a great book.  OK, some of it is a little basic, but the 80-odd pages on statistical regression are just fantastic and walked me through the basic concepts and then extensions of them effortlessly.  The Head First series isn’t something I’ve seen before but I really like the approach they use - the idea of keeping your brain busy and focussed on the content while learning some fairly detailed concepts - something I always struggle with when reading drier, more academic publications.  I managed to consume all 400-odd pages in the course of a return flight to Sydney so I’d consider it light reading and highly effective.

If like me you now have a hunger for more detail, there is a reference to our old friend Edward Tufte right at the end of the book.  In the 70’s Tufte published a book on regression called Data Analysis for Public Policy which is jammed full of theory and relevant examples.  The best bit of all is it can be downloaded for free here.

Metric Distribution Analysis

July 6th, 2009

nysi_coverBeing a bit of a Stephen Few fan, I’ve now absorbed his latest book, Now You See It: Simple Visualisation Techniques for Quantitative Analysis.

Unlike everyone else (yes everyone) who has reviewed it on Amazon.com so far, I don’t actually think it deserves a full 5-Star rating on all fronts (for instance, the reproduction of Tableau charts and some other outputs is low in terms of image quality, which for a visualisation publication surely is a problem).  That aside however, it does have some excellent content and one particular device which I’ve latched onto is contained in the chapter on Distribution Analysis.

Representing a metric and providing information on the value relative to the entire distribution is having your cake and eating it.  Not only do you get the measure, but you see where it sits relative to its peers so I see this as a very complimentary graphic alongside the more traditional metric traffic light and trend presentation.

So what could a metric distribution analysis look like?

Few uses an example where you have the Low, Median and High values of a particular measure displayed on a single axis, he then improves the example by adding a marker for the 25th and 75th percentiles.

This then starts to sound very similar to the tertile distribution (where we chop the data up into thirds) we have been working on for presenting unit and course metrics at UNE.  We have calculated a lower, mid and upper tertile with boundaries at 33.34% and 66.67%.  Each measure for each unit and course is then calculated and given the appropriate traffic light depending on which tertile it falls into.

What we hadn’t thought of was being able to supplement the traffic light, trend and value with a distribution chart.  So here’s how it currently looks in development for a sample unit for our Attrition measure based on Few’s ideas:

picture-9

  • Attrition is less than 10% for this unit (green star)
  • The lowest attrition of any unit is just under 5% (left red triangle)
  • The 33rd percentile is at around 14% attrition (blue vertical bar)
  • The 66th percentile is at around 22% attrition (light brown vertical bar)
  • The highest attrition of any unit is around 31% (right red triangle)
  • This unit is performing very well and attrition is significantly low

In combination with the traffic light, trend and time-series line chart, there is a huge amount of information being conveyed by a very simple instrument.  Should I add values to this?  Well maybe.  I tried and it got really cluttered and of course the traffic light scorecard itself will have the values so maybe this graphic is fine just as it is when used in conjunction with the other devices.

Collaborating on Agile BI/DW

June 4th, 2009

This should appeal to us as a university bunch, surely…

After dreaming about it, talking about it, and researching for it, I am officially launching the “Agile Business Intelligence – Collaborative Book” project.

Great idea Martin, I hope you get lots of collaboration on this one.

http://analytical-mind.com/2009/06/01/agile-business-intelligence-collaborative-book/

There’s a registration form you can complete if you’d like to be involved.

Death by PowerPoint

December 5th, 2008

A colleague recently commented that I spend more on textbooks than most other areas of IT put together and I guess it is probably true. This is something of a paradox you might think; a topic that is ‘leading edge’ and exists on computer screens but yet for which the development team relies heavily on the printed medium.

I’ve been truly amazed at the volume and quality of current publication books that are wholly relevant to BI. Not books on tuning to optimise warehouse ETL performance, or even books that you are already reading like those from Kimball University, but books on really creative things, like design and layout.

Edward Tufte - VDQI

The whole art of physical ‘cutting and pasting’ existed way before Ctrl/Command-X/Vcame along and we really have corrupted the art in typical modern ‘bigger, better, faster’ fashion. Fortunately the purists live on and we have much to learn from them. So if you are responsible for delivering reports or BI content to your community I urge you to invest a few hours just researching the design and layout aspects of your information presentation.  If you’re not already a convert I think you will be surprised at just how much there is to it and probably unnerved about how little you know and some of the basic mistakes we have all been making.

Tufte just has to be on the shelf of any BI presentation designer and everyone I’ve shown this particular volume to has either bought a copy themselves or coveted mine past the point of politeness. The physical presentation in this book is incredible and you find yourself respectfully turning each page marveling at the typesetting and the effort put into its creation. I particularly wanted to mention Tufte because he referenced a PowerPoint version of the Gettysburg Address from Peter Norvig, Director of Research at Google, which includes a wonderful example of a meaningless slide, the type of which we unfortunately see and endure all too many of.

His point, and that of many others, is that PowerPoint enables awful presentations, ones that send the audience to sleep, after insulting them.

My point, is that I’ve finally decided to do something about it and stop doing PowerPoint presentations (well OK, seriously cut down on them). Being a BI evangelist in a university is a role that involves making a lot of presentations. So I bought another book (no surprise there) which describes the Extreme Presentation Method. I dismissed it initially as a pretty insignificant volume but dragged myself through it this week as I have a presentation to make to CAUDIT on Monday.

So far I have a lot of pieces of paper with scribbles on them, some cards which vaguely resemble a Best Man speech I gave at a wedding about 10 years ago, and not a PowerPoint or bullet in sight!

I’ll let you know how I go but so far I think it might just be crazy enough to work…