Among the misconceptions regarding Big Data, two important ones stand out: that correlations alone suffice and that Big Data means sampling bias is no longer an issue.
Fooled by Association
First, Big Data mining advocates claim that correlations suffice and the quest for causal interpretation should be abandoned. The real danger is that you will be "fooled by association," as explained in Freakonomics.
I consulted car company managers who were upset because, though profits were up, a "key performance indicator" was down. After causal analysis, what became clear was that this indicator did not cause or lead profits; the correlation was merely a coincidence and turned around in recent periods. As a result of the causal analysis, managers could refocus their energies on moving those indicators that do lead to sales.
The Sampling Bias
Second, Big Data sometimes gives the illusion that sampling bias is no longer an issue (as it is for small data) because the data capture the entire population. However, "N = all is often an assumption rather than a fact about the data" (Kaiser Fung, Numbersense).
For example, your social media data may accurately capture online sentiment but only for those consumers who are online and care enough about your brand and product category to comment through the online channel.
In recent research across 15 product categories, we compared the power of representative offline survey metrics (awareness, consideration, and liking) and online behavior metrics (paid ad clicks, site visits, and social media conversations) to explain and predict sales. We found that online behavior metrics excelled in short-term predictions, but that offline survey metrics excelled in medium-term predictions.
What Have We Learned?
Blowing up data size does dissolve us from the challenges of meaningful inference from the data. The recent review of the Google Flu Trends "success" story illustrates both the importance of causal inference and the sampling bias, excellently described by Tim Harford.
"Big Data has arrived, but big insights have not," states Harford. "The challenge now is to solve new problems and gain new answers—without making the same old statistical mistakes on a grander scale than ever."
Continue reading "Big Data Has Arrived, but Big Insights Have Not" ... Read the full article
MarketingProfs provides thousands of marketing resources, entirely free!
Simply subscribe to our newsletter and get instant access to how-to articles, guides, webinars and more for nada, nothing, zip, zilch, on the house...delivered right to your inbox! MarketingProfs is the largest marketing community in the world, and we are here to help you be a better marketer.
You may like these other MarketingProfs articles related to Metrics & ROI:
- How to Implement Artificial Intelligence in Marketing: Rajkumar Venkatesan on Marketing Smarts [Podcast]
- How to Use Email Metrics to Optimize Your Campaigns [Infographic]
- Analyzing the Analyst: A Guide to Holistic Analytics for Tracking the Right Metrics
- Measuring Customer LTV: Marketers' Top Approaches and Challenges
- Seven B2B Sales Metrics That Can Help You Plan Your Marketing Strategy