Skip to main content

Posts

r/DataIsUgly

I have found plenty of class inspiration on Reddit. Various subs have provided a  new way to explain mode   and median  and great, intuitive data to teach  correlation . However, much as a reverse-coded item on a scale can be used to get to the opposite of what you are asking about, r/DataIsUgly is rife with examples of how NOT to do data as to teach how to create good data visualizations. Very recently, I shared this example from r/DataIsUgly to illustrate why NOT to truncate the Y axis .  And...this sub is filled with people like us. People who love to proofread and notice data crimes. For example: How to use it in class? Can your students figure out why these data visualizations are...less than optimal? Can they fix them? They could be a fun prompt for extra credit points or a discussion board.

Annual snow fall moderates the relationship between daily snow fall and the likelihood of canceling school

Moderation isn't one of those things that we typically teach in Intro Stats. But it is a statistical tool your advanced undergraduates will likely encounter in an upper-level course. I'm not going to teach you how to teach your students how to do one. I am, however, going to share a  example of what mediation is doing, inspired by living in the city in the US that has received the most snow this season (Erie, PA, with 93.9 inches for the season as of 1.30.25).  About a year ago, CNN shared data on how much snow it takes to cancel school in various parts of the country. I assure you, Erie and the rest of Northwest PA (see red outline) gets hella snow but no snow days. https://www.cnn.com/2024/02/12/us/how-much-snow-kids-school-snow-day-across-us-dg/index.html However, our lack of snow days isn't due to lack of snow. The annual amount of snow moderates the likelihood to cancel school, such that if you are used to a lot of snow (and have the infrastructure to handle it) you d...

Absolute vs. relative risk reporting: Lake effect snow edition

I maintain that relative versus absolute risk is a concept that we absolutely must teach in intro stats. I have given some examples of this before ( murder ! COVID !) but here is another one that hits home for my fellow Great Lakers. In particular, this one is for my friends in Detroit, Toledo, Cleveland, and Buffalo  Up here in Erie, a common point of discussion is how frozen Lake Erie is. Because once it freezes, the lake's moisture no longer feeds Dread Lake Effect Snow.   I like this example because you can easily perform the math in front of your class, demonstrating that 26.14% is 103.45% of 12.85%. At the same time, you have the visual to demonstrate that the vast majority of the lake was still unfrozen even with a 103.45% increase.