Here are the notes from my Warm Gun SF 2013 keynote, based on one of the stories from The Year Without Pants (An Amazon.com best book of 2013). Thanks to folks that were there for being a great crowd.
“Faith in a source of data grows in direct relation to your distance from the collection of it”
- The Data Paradox. No matter how much data you have, you will still depend on intuition to decide how to interpret, explain and use the data (See: Amygdala). Intuition is also used to pick samples, design queries, chose statistical models and define what outliers are. In A/B testing, you use intuition to decide what B is. Underneath all of our rational intellect is intuition, which influences our “rational” behavior far more than we admit. Often data yields unavoidable tradeoffs where two or more options are equally viable and someone must make a judgement call beyond the data. (In strict paradox form: the more data you have the less you know).
- No team or organization is Data-driven. Data is non conscious: it is merely a list of stupid, dead numbers. Data doesn’t not have a brain and therefore can’t drive or lead anything. At best you want to be data influenced, where (living) decision makers have good data available that they can use to help answer good questions about what they’re doing, how well it’s being done and what perhaps they should be doing in the future. All data has bias and blind-spots and a truly data-driven organization will drive itself into the ground chasing the illusion of purely objective truth.
- Data is a flashlight. Data gives you specific information about a singular vector of information. Data, like a flashlight, is only as useful as the person wielding it and the person interpreting what it shows. It has no magical powers. To get good information you want multiple sources so you can triangulate information and compensate for the inherent biases each kind of data has. For example A/B testing can tell you things customer interviews can’t and vice versa. One analytical model suggests one hypothesis but a different method can suggest another with the same data.
- Ban the phrase “The data says.” Data can’t say anything for the same reason it can’t drive anything: data is inert. People, including data experts or growth hackers, can never speak singularly for the data. At best they are interpreters, offering one interpretation of what the useful narrative story derived from the data is (if there is one at all). Better experts yield better interpretations but never is their interpretation the only one available. If every anyone utters “the data says” they are pretending data can have a singular interpretation which it never does, and this false faith prevents the asking of good questions, such as: is there an equally valid hypothesis based on this data that suggests a different conclusion than yours? (The answer is often yes).
- Cognitive Bias pollutes our view of data. We know our brains are kludges, vulnerable to optical illusions. We also have blind spots in our cognition called cognitive biases. The most common one regarding data is confirmation bias, where we seek only to validate our preconceptions and stop doing analysis as soon as we have a singular hypothesis that supports our assumptions. Another dangerous bias is narrative bias, which is our attraction to stories. We love stories that are easy to understand, easy to say and that make us feel good, and will project these stories into data compulsively.
- Cui Bono -”who benefits?” Who paid for this data? What was their reason for paying for it? What ambitions do they have? Certain outcomes of data benefit the people asking for the data and the people who capture the data, biasing the results. In political elections it’s common to see competing campaigns find very different data for who is in the lead, each finding their own candidate in front. Another example is how company founders will select data that makes them sound the best when pitching for funding (And VC firms will listen for the kinds of data they want to hear). Generally in life when you’re confused about why a strange decision was made, or there is grand incompetence, or nothing is happening at all, ask cui bono?
Also see: Data Death Spiral
You can watch the actual keynote presentation below: