Skip to main content

Posts

chartr's "Speed or Accuracy? It's hard to do both in fast food drive-thrus"

Sometimes, you just need a new, simple example for a homework question or a class warm-up.   I eyeballed and entered the   data here  ( r   = -.55). Enjoy. I use this little example to explain to use the regression formula to make a prediction. Here are my slides .

Between and within group variance, explained with religion, politics, and climate change.

Ages ago, I shared how I teach ANOVA at a conceptual level. I describe within and between group variance using beliefs about the human role in climate between and within different religious groups. This data is now old. And it described global warming, not climate change, which is a crucial language distinction. So you  can imagine my delight when Pew recently released  updated and improved data investigating this issue.  In my attempt to keep the mood light when discussing an example featuring 1) religion, 2) climate change, and 3) politics, I ask students to think about how many different opinions are probably represented around their family's Thanksgiving table. Despite having much in common as a family, like, perhaps, geography, shared stories, and religion, there are still a lot of within-group differences of opinion. This leads to a discussion about people of different religions having between and within group differences of opinion regarding beliefs about global cl...

YouGov America's Thanksgiving-themed chi-square examples

YouGov gifts us with seasonal chi-square examples  with data on Thanksgiving food controversies. For example: How do people feel about marshmallows on sweet potato dishes? This doesn't look randomly distributed to me. Which is more beloved: Light or dark turkey meat? If you want examples for the chi-square test of independence, dig into the PDF containing ALL of this survey's data. The distribution of people who like cranberry sauce by age group does not appear random.

Organizations sharing data in a way that is very accessible

A few weeks ago, I posted about how you can share data in such a terrible way that one is not breaking the law, but the data is completely unusable. This makes me think of all the times I am irked when someone states a problem but doesn't offer a solution to the problem. Instead, they just talk about what is wrong and not how it could be. So, as a counter piece, let's cheer on organizations that ARE sharing data in a way that is readily accessible. You could use this in class as a palate cleanser if you teach your students about data obfuscation. You could also use it as a way of helping your students understand how data really is everywhere. Or even challenge them to brainstorm an app that uses readily accessible data in a new way to help folks.  Pro-Publica This website lets you check how often salmonella is found at different chicken processing plants. All you need to do is enter the p-number, company, or location listed on your package of chicken: https://projects.propubli...

History of Data Science's Regression Game

 There are already some pretty cool games for guessing linear relationships/regression lines. Dr. Hill's Eyeball Regression Game . The old, reliable Guess the Correlation game. However, I found a new one that has a particularly gorgeous interface, and a few extra features to help your learners. History of Data Science created the Regression game . It provides the player with a scatter plot, then the player needs to guess the y-intercept and slope. See that regression line? It is generated and changes as the entered a and b values change, which is a good learning tool. If played at the "easy-peasy" level, the player can even change those numbers multiple times over the course of 30 seconds, and watch as the corresponding line changes.  I think this game is a nice way to break up the ol' regression lecture and allows students to see the relationship between the scatter plot and the regression line.

Stats nerd gift list

This isn't a post full of teaching resources. Instead, it is a post of gifts and treats for stats nerds. Who might also teach stats, this still falls under the purview of this blog. Bonus points because many of these suggestions put money into creators' pockets. Statsy Etsy Shops NausicaaDistribution Etsy shop NausicaaDistribution is a great shop on Etsy . I own multiple products, including the ABC's of Statistics Poster shown below. It is beautiful and framed in my office.  The Chemist Tree Etsy shop Another Etsy maker I like is  TheChemistTree.  I have a set of the coasters, and they've held up well.  https://www.etsy.com/shop/TheChemistTree?ref=simple-shop-header-name&listing_id=501955501&search_query=statistics Chelsea Parlett Design Etsy Shop Stats expert Chelsea shares her stats knowledge on Twitter and on Etsy , via her stickers.  https://www.etsy.com/shop/ChelseaParlettDesign DataSwagCo is a newer shop with some funny, punny stats goods. https:/...

Dirty Data: Share the data in a way that is functionally inaccessible

In my intro stats class, we discuss shady data practices that aren't lying because they report actual numbers. But they are still shady because good data is presented in such a way as to be misleading or confusing. These topics include: Truncating the y-axis   Collecting measures of central tendency under ideal circumstances Manipulate online ratings (I didn't write the blog post about this yet, but it is coming). Relative vs. Absolute Risk AND HERE IS ANOTHER ONE: Insurance companies were asked to provide price data  RE: the Transparency in Coverage Rule in the Consolidatedated Appropriations Act of 2021. Google that if you want to know more about that, I'm not going into that. Not my lane. That said, it is an appealing idea. Let's have some transparency in our jacked-up healthcare system. And the insurance companies provided the data, but in a way inaccessible to most people. Like, all people, maybe? Because they just splurted out 100 TB of data. So, they totally com...