Skip to main content

Posts

Showing posts with the label test of independence

YouGov America's Thanksgiving-themed chi-square examples

YouGov gifts us with seasonal chi-square examples  with data on Thanksgiving food controversies. For example: How do people feel about marshmallows on sweet potato dishes? This doesn't look randomly distributed to me. Which is more beloved: Light or dark turkey meat? If you want examples for the chi-square test of independence, dig into the PDF containing ALL of this survey's data. The distribution of people who like cranberry sauce by age group does not appear random.

Chi-square Test of Independence using CNN exit polling data

If you are trying to explain the Chi-Square Test of Independence to your students, here are some timely examples that are political and not polarizing. Well, I don't think it is polarizing. I'm sure there are people out there that disagree. Maybe some of the questions are polarizing? Regardless, it is nice to have an example that uses a current event with easy to understand data.  The example comes from  CNN. The network conducted exit polling during the 2020 presidential election . I'm sure they didn't intend to provide us with a bunch of chi-square examples, but here we are. Essentially, CNN divided Biden and Trump voters into many categories with not a parameter to be had. I have included a few of the tables here, but there are many others on the website .  They illustrate different designs (2x2, 2x3, 2x4, etc.) and different magnitudes of difference between expected and observed values. 

Explaining chi-square is easier when your observed data equals 100 (here, the US Senate)

UPDATE: 2020 Data: https://www.catalyst.org/knowledge/women-government When I explain chi-square at a conceptual, no-software, no-formula level, I use the example of gender distribution within the US Senate. There are 100 Senators, so the raw observed data count is the same as the observed data expressed via proportions. I think it makes it easier for junior statisticians to wrap their brains around chi-square.  I  usually start with an Goodness-of-Fit (or, as I like to call them, "One-sies chi-squares").For this example, I divide senators into two groups: men and women. And what do you get?  For the 115th Congress, there are 23 women and 77 men . There is your observed data, both as a raw count or as a proportion. What is your expected data? A 50/50 breakdown...which would also be 50 men and 50 women. Without doing the actual analysis, it is pretty safe to assume that, due to the great difference between expected and observed values, your chi-square Goodness o...