Skip to main content

Posts

Interactive NYC commuting data illustrates distribution of the sampling mean, median

Josh Katz and Kevin Quealy p ut together a cool interactive website to help users better understand their NYC commute . With the creation of this website, they also are helping statistics instructors illustrate a number of basic statistics lessons. To use the website, select two stations... The website returns a bee swarm plot, where each dot represents one day's commuting time over a 16-month sample.   So, handy for NYC commuters, but also statistics instructors. How to use in class: 1. Conceptual demonstration of the sampling distribution of the sample mean . To be clear, each dot doesn't represent the mean of a sample. However, I think this still does a good job of showing how much variability exists for commute time on a given day. The commute can vary wildly depending on the day when the sample was collected, but every data point is accurate.  2. Variability . Here, students can see the variability in commuting time. I think this example is e...

Do Americans spend $18K/year on non-essentials?

This is a fine example of using misleading statistics to try and make an argument. USA Today tweeted out this graphic , related to some data that was collected by some firm. There appear to be a number of method issues with this data, so a number of ways to use this in your class: 1) False Dichotomy:  Survey response options should be mutually exclusive. I think there are two types of muddled dichotomies with this data: a) What is "essential"? When my kids were younger, I had an online subscription for diapers. Those were absolutely essential and I received a discount on my order since it was a subscription. However, according to this survey dichotomy, are they an indulgence since they were a subscription that originated online. b) Many purchases fall into multiple categories. Did the survey creators "double-dip" as to pad each mean and push the data towards it's $18K conclusion? Were participants clear that "drinks out with frien...

Pew Research's "Gender and Jobs in Online Image Searches"

You know how every few months, someone Tweets about stock photos that are generated when you Google "professor"? And those photos mainly depict white dudes? See below. Say "hi" to Former President and former law school professor Obama, coming it at #10, several slots after "novelty kid professor in lab coat". Well, Pew Research decided to quantify this perennial Tweet, and expand it far beyond academia. They used Machine Learning to search through over 10K images depicting 105 occupations and test whether or not the images showed gender bias.  How you can use this research in your RM class: 1. There are multiple ways to quantify and operationalize your variables . There are different ways to measure phenomena. If you read through the report, you will learn that Pew both a) compared actual gender ratios to the gender ratios they found in the pictures and b) counted how long it took until a search result returned the picture of a woman for a given j...