Skip to main content

Using Pew Research Center Race and Ethnicity data across your statistics curriculum

In our stats classes, we need MANY examples to convey both theories behind and the computation of statistics. These examples should be memorable. Sometimes, they can make our students laugh, and sometimes they can be couched in research. They should always make our students think.

In this spirit, I've collected three small examples from the Pew Research Center's Race and Ethnicity archive (I hope to update with more examples as time permits). I don't know if any data collection firm is above reproach, but Pew Research is pretty close. They are non-partisan, they share their research methodology, and they ask hard questions about ethnicity and race. If you use these examples in class, I think that it is crucial to present them within context: They illustrate statistical concepts, and they also demonstrate outcomes of racism. 

Lessons: Racism, ANOVA theory: between-group differences, post-hoc tests

While this is descriptive data, I think that it illustrates several ANOVA concepts. Participants reported whether or not they had experienced several different situations. It demonstrates how life experiences in America are influenced by race. It can also be used to get your students thinking about ANOVA.

You can think about ethnicity as the factor with four levels (Asian, Black, Hispanic, White). 
If you are teaching ANOVA, this data visualization illustrates between-group differences, both in terms of statistics and life in America. You can ask your students to speculate on which item has the most between-group variance and the least between-group variance. You could also think about post-hoc results for these findings. For the question "Been unfairly stopped by the police", Asian and Hispanic respondents may not differ significantly from one another. Still, Black and White respondents vary considerably from each other and from Asians and Hispanics. 

Data from Pew: "Most blacks say someone has acted suspicious of them or as if they aren't smart."

Lessons: Racism, categorical data, nominal data, free-response options, research methods, psychometrics

This article describes the options provided by the US Census Bureau for reporting race (their term), and how far those options have evolved since the first US census (1790). It also illustrates how racist attitudes can be seen in the way the government elected to describe the race of American citizens. It gives a little hope that things are changing (at least at the Census Bureau).

In 1790, the only options for race were: a) Free white males and female, 2) All other free persons, and 3) slaves. 

Until 1960, the person recording your census data would pick your race. 

Americans couldn't describe their backgrounds by selecting multiple races until 2000

Regardless, the Census Bureau is trying harder in 2020, and the new way of recording race provides examples of categorical variables, qualitative data, free response. 

https://www.pewresearch.org/fact-tank/2020/02/25/the-changing-categories-the-u-s-has-used-to-measure-race/

Furthermore, if you want to discuss politicized data collection, you could also dig into Hansi Lo Wang's reporting for NPR on this topic. It is excellent.

Lessons: Racism, Chi-Square theory: observed data, expected data

Pew Research created a graph to illustrate one facet of systematic racism in health care: Black people are dying of COVID-19 in numbers that are disproportionate and alarming. (Aside: I also teach my students about this, framing it in data-driven ML, using this article from Nature)

I think this graph does an excellent job of conveying a concept at the core of chi-square: The distance between expected data (The black portion of the population) versus the observed data (Black share of COVID-19 deaths) is illustrated by state.

https://www.pewresearch.org/fact-tank/2020/06/04/black-americans-face-higher-covid-19-risks-are-more-hesitant-to-trust-medical-scientists-get-vaccinated/ft_20-06-01_covidracehealth_3/
https://www.pewresearch.org/fact-tank/2020/06/04/black-americans-face-higher-covid-19-risks-are-more-hesitant-to-trust-medical-scientists-get-vaccinated/ft_20-06-01_covidracehealth_3/


Comments

Popular posts from this blog

Ways to use funny meme scales in your stats classes

Have you ever heard of the theory that there are multiple people worldwide thinking about the same novel thing at the same time? It is the multiple discovery hypothesis of invention . Like, multiple great minds around the world were working on calculus at the same time. Well, I think a bunch of super-duper psychology professors were all thinking about scale memes and pedagogy at the same time. Clearly, this is just as impressive as calculus. Who were some of these great minds? 1) Dr.  Molly Metz maintains a curated list of hilarious "How you doing?" scales.  2) Dr. Esther Lindenström posted about using these scales as student check-ins. 3) I was working on a blog post about using such scales to teach the basics of variables.  So, I decided to create a post about three ways to use these scales in your stats classes:  1) Teaching the basics of variables. 2) Nominal vs. ordinal scales.  3) Daily check-in with your students.  1. Teach your students the basics...

Using pulse rates to determine the scariest of scary movies

  The Science of Scare project, conducted by MoneySuperMarket.com, recorded heart rates in participants watching fifty horror movies to determine the scariest of scary movies. Below is a screenshot of the original variables and data for 12 of the 50 movies provided by MoneySuperMarket.com: https://www.moneysupermarket.com/broadband/features/science-of-scare/ https://www.moneysupermarket.com/broadband/features/science-of-scare/ Here is my version of the data in Excel format . It includes the original data plus four additional columns (so you can run more analyses on the data): -Year of Release -Rotten Tomato rating -Does this movie have a sequel (yes or no)? -Is this movie a sequel (yes or no)? Here are some ways you could use this in class: 1. Correlation : Rotten Tomato rating does not correlate with the overall scare score ( r = 0.13, p = 0.36).   2. Within-subject research design : Baseline, average, and maximum heart rates are reported for each film.   3. ...

Andy Field's Statistics Hell

Andy Field is a psychologist, statistician, and author. He created a funny, Dante's Inferno-themed  web site that contains everything you ever wanted to know about statistics. I know, I know, you're thinking, "Not another Dante's Inferno themed statistics web site!". But give this one a try. Property of Andy Field. I certainly can't take credit for this. Some highlights: 1) The aesthetic is priceless. For example, his intermediate statistics page begins with the introduction, "You will experience the bowel-evacuating effect of multiple regression, the bone-splintering power of ANOVA and the nose-hair pulling torment of factor analysis. Can you cope: I think not, mortal filth. Be warned, your brain will be placed in a jar of cerebral fluid and I will toy with it at my leisure." 2) It is all free. Including worksheets, data, etc. How amazing and generous. And, if you are feeling generous and feel the need to compensate him for the website, ...