Think Like A Data Analyst — Act Like An Archaeologist
November 8, 2010 Leave a Comment
Anyone who has read this blog will probably find clues suggesting that I am a big Dan and Chip Heath fan (and by fan I mean “groupie”). I am also a big fan of
Nathan Yau at Flowing Data. He has a consistently good blog with great content and useful ideas about how to visualize data. I have meant to build on one of his blog postings for a while, and here it is. (I hope he also feels that ‘Imitation is the sincerest of flattery.”)
Yau called his posting “Think like a statistician–without the math.” I am going to use his outline to make a slightly different point: think-like-an-archaeologist-even-though-you’re-an-analyst. At least as important as understanding database structure or business intelligence software or sophisticated statistical methods are several other skills not often taught in college statistics class (or in Analytics departments at large corporations). We could start with the maxim to Dig. Dig. Dig.–but there is more.
- “See the Big Picture”: Archaeologists sift through massive amounts of material not to find some pieces of broken pottery but to understand civilizations. This curiosity about the import of fragments of information is arguably the most prized attribute for an effective health care data analyst. Identifying a large increase in musculoskeletal costs is interesting. That’s reporting. Analytics goes much further … like this: 1) Go back to the data and discover this increase was driven by a large number of back surgeries, 2) then link that finding to the fact that there is a large percentage of overweight/obese members, 3) then notice that a very low percentage of members were participating in the available weight management program, 4) then identify that only a few members had engaged with the health coaches around “elective” surgeries. This “path” through the data is analytics — and analytics is often a journey of discovery.
- “Look Outside the Data”: We are talking about applying context and perspective. For an archaeologist, this is exactly the point–to place what they find alongside what they already know about the other historical events. In healthcare, sometimes the benchmark data is even more valuable the client’s own data. Knowing that an Employer’s average hospital days per 1,000 is 350 is not nearly as informative as knowing that the Employer’s hospital days per 1,000 than is 32% above their industry peers.
- “No Agendas”: It is important to have a hunch when you start looking for meaning across multiple health care data sets. It’s OK to start with a strong belief that diabetes or heart disease are key drivers of trend (because they are). However, it is important to let the data speak for itself. Unexpected and valuable findings will emerge as you set aside the assumptions and look for patterns, correlations, and outliers. (For an interesting discussion of archaeologists as detectives uncovering new patterns and rewriting history, listen to this podcast at RadioLab about Oxford teams unpacking ancient garbage.)
- “Ask Why”: Letting the data speak for itself is important; however, that does not mean that having a sufficient amount of cynicism and mistrust for the data is bad. Listening to your “gut” doesn’t sound like a valuable skill for a data analyst; however, there have been many, many times when I’ve questioned the accuracy of a particular health care measure and, as a result, discovered a major data error or wrong calculation. It’s easy to be captured by a particular find, and misunderstand its significance (or lack thereof). It’s just as easy to forget that the data isn’t necessarily “innocent” and must always be contextualized, tested, and cross-checked.
Health care data analysts may never be mistaken for Indiana Jones (although if we could carry to work those really cool whips it would help). The information is like treasure, though, that leads us to rich veins or patterns in a population and helps us fend off rising healthcare costs and poor health.



Company
I’ve written many times in this blog about how data, presented effectively, can change the way we see problems and think about solutions. I found an exceptional example of this in a lecture given by
Health Plans have been agonizingly slow to take advantage of the vast amounts of data available to them. Admittedly, they have some valid excuses. The barriers put in place to protect patient privacy and a lack of consistent data standards makes health data mining a formidable task. However, when I read recently that
yes and a different perspective almost always result in improved writing. In fact, good editing, or finish work, often requires different tools or skills. A very good editor inevitably breathes new energy into the writing process by solidifying tenuous connections and building out nascent ideas.