top of page

Notes from FiveThirtyEight Talk on Telling Stories

  • Writer: CANA
    CANA
  • Feb 17, 2017
  • 3 min read

Updated: Sep 13, 2022


Data Telling Stories

“This is the best talk I’ve attended in over a year.”- Harrison Schramm

You may know Harrison Schramm from his “5 Minute Analyst” articles and blog posts, and when he isn’t thinking of the cost of the Death Star or solving the logistics problems of Harry Potter, he also is one of CANA Advisors’ Principal Operations Research Analysts. Recently he had the opportunity to go to a FiveThirtyEight Talk on Telling Stories (at the RStudio::conf ). In his words, Harrison said, “[t]his is the best talk I’ve attended in over a year.” In a change of pace from writing a blog post or article on the talk, we asked Harrison if he would share his notes on the event, and he was kind enough to pass them along. We hope these notes spark your interest in not just the ‘how’ but the ‘why’ of statistical analysis.

****From the Event Notebook of Harrison Schramm****

Data Journalism Principles:

Story leads data follows use rigorous but interminable methods: Be accurate, Be fast, and Be transparent.

Useful tools for R.

tidyverse is the tool of choice for data. (The tidyverse is a set of packages that work in harmony because they share common data representations and API design. https://blog.rstudio.org/2016/09/15/tidyverse-1-0-0/)

In the interest of transparency, FiveThityEight has created an R package. (Nate Silver’s FiveThirtyEight uses statistical analysis — hard numbers — to tell compelling stories about politics, sports, science, economics and culture. https://github.com/fivethirtyeight). For example, if you would like to see a breakdown of Avengers Characters by longevity and gender, you can do the following:

Install.packages(“fivethirtyeight”)

Library(ggplot2); library(magrittr); library(“fivethirtyeight”)

avengers %>% ggplot(aes(factor(death1), years_since_joining)) + geom_violin() + facet_wrap(~gender) + xlab("Currently Living?") + ylab("Years Since Joining") + ggtitle("Avengers Characters Violin Plot - Status vs. Years")

The Six Types of Data Stories

  1. Novelty

  2. Outlier

  3. Archetype

  4. Trend

  5. Debunking

  6. Forecast

Novelty Data Story: Basic questions are first.
  • New Data Story danger: Triviality

  • Remedy: Simple Summaries

  • Ask yourself: Is this data meaningful to others?

Outlier Stories
  • Danger: Spurious Result

  • Tactic: Characters - talk about who the outlier is: who is it, what company is it, etc.

  • Profile one of the characters from the outlier group, then introduce the statistics

  • Ask yourself: Is this really so different?

Archetype Stories
  • Danger: Oversimplification

  • Tactic: Modeling

  • Ask Yourself: What Variables am I leaving out?

Trend
  • Trends: Terrorism overall declining in the EU, but religiously inspired attacks rising.

  • Done using dplyr, data %>% group_by %>% summarize %>% ggplot

  • Danger: Variance - regression to the mean

  • Tactic: Be Conservative

  • Ask yourself: Is this signal or noise?

  • Fun Quote: If you can always tell a valid trend, you should be trading on wall street, not telling data stories

Debunking
  • Bechdel test: Examines how women are portrayed in movies. 1. Are there 2 or more women, 2. Do they talk to each other, 3. Do they talk to each other about something other than men?

  • Danger: Confirmation Bias - your own belief in the debunking action.

  • Tactic: Showcase Failures

  • Ask Yourself: How much do I want to debunk this?

  • Quote about p-hacking: Warning: This is evil (statistical) work. Do not go to the dark side. Do not try this at home. Note: You can read Harrison’s piece on P-hacking appearing in OR/MS Today here: https://www.informs.org/ORMS-Today/Public-Articles/June-Volume-43-Number-3/P-value-Primer-P-OR-P-values-in-operations-research-M-N-O-P-Q-R-S-T

  • Example of p-hacking: Eating potato chips leads to higher SAT Math scores.

Forecast (You work a narrow path here)
  • Danger: Overfitting

  • Tactic: Simulations and scenarios

  • Ask Yourself: Am I properly conveying the uncertainty in my model?

We hope these notes from Harrison Schramm on R and how to use it to tell a story with your statistical and analytical data is useful.

Follow Harrison (@5MinuteAnalyst on twitter) and the rest of the CANA Advisors’ Team (@CANAADVISORS on Facebook and twitter) for more insight, blog posts and articles devolving into data, logistics and analytics in creative and helpful ways.

what is your data story?

Other interesting CANA Articles on R:

Blog Article: Notes on The Seven Pillars of Statistical Wisdom http://www.canallc.com/single-post/2016/09/16/Notes-on-The-Seven-Pillars-of-Statistical-Wisdom

 
 
 

6 Comments


Çikolata, binlerce yıllık geçmişi olan lezzetli bir yiyecektir. İlk olarak Orta Amerika'da yaşayan Olmekler, Mayalar ve Aztekler tarafından keşfedilmiştir. Bu uygarlıklar, kakao çekirdeklerini kutsal kabul etmiş ve içecek olarak tüketmişlerdir. Aztekler, kakaoyu baharatlarla karıştırarak "xocoatl" adını verdikleri acı bir içecek yapmışlardır. Avrupa'ya ise 16. yüzyılda İspanyol kaşifler tarafından getirilmiştir. Başlangıçta sadece aristokratlar tarafından tüketilen çikolata, zamanla şeker eklenerek tatlandırılmış ve herkesin erişebileceği bir lezzet haline gelmiştir.Çikolata, kakao çekirdeklerinin toplanması, fermente edilmesi, kurutulması ve öğütülmesiyle üretilir. Kakao ağaçları, tropikal bölgelerde yetişir ve meyveleri içinde kakao çekirdeklerini barındırır. Hasattan sonra çekirdekler fermente edilerek aroma kazanır. Daha sonra kurutulup kavrulan çekirdekler öğütülerek kakao likörü elde edilir. Bu likör, çeşitli işlemlerden geçirilerek sütlü, bitter veya beyaz çikolata gibi farklı türlerde ürünlere dönüştürülür. Çikolatanın lezzeti,…

Like


MZKO QPFQ
MZKO QPFQ
Nov 22, 2024

谷歌seo推广 游戏出海seo,引流,快排,蜘蛛池租售;

Fortune Tiger Fortune Tiger;

Fortune Tiger Fortune Tiger;

Fortune Tiger Fortune Tiger;

Fortune Tiger Fortune Tiger;

Fortune Tiger Slots Fortune Tiger Slots;

Like

TOQN TYQU
TOQN TYQU
Nov 18, 2024

谷歌seo推广 游戏出海seo,引流,快排,蜘蛛池租售;

Fortune Tiger Fortune Tiger;

Fortune Tiger Fortune Tiger;

Fortune Tiger Fortune Tiger;

Fortune Tiger Fortune Tiger;

Fortune Tiger Slots Fortune Tiger Slots;

Like

TOQN TYQU
TOQN TYQU
Nov 18, 2024

google seo google seo技术+飞机TG+cheng716051;

Fortune Tiger Fortune Tiger;

Fortune Tiger Fortune Tiger;

Fortune Tiger Fortune Tiger;

Fortune Tiger Fortune Tiger;

Fortune Tiger Fortune Tiger;

Fortune Tiger Fortune Tiger;

Fortune Tiger Fortune Tiger;

Fortune Tiger Fortune Tiger;

Fortune Tiger Fortune Tiger;

Fortune Tiger Fortune Tiger;

Fortune Tiger Slots Fortune Tiger Slots;

Fortune Tiger Slots Fortune Tiger Slots;

Fortune Tiger Slots Fortune Tiger Slots;

Like
bottom of page