Skip to main content

Posts

Showing posts with the label Hartnett Chapter 2

Mr. Beast gave us a real-life bee swarm plot.

Hey, I have kids, so I knew that Mr. Beast made a video where 100 competitors, one person from every age from 1-100, competed in feats for $250K.  In the very first competition, competitors ran a footrace, and the top five in each age category advanced to the next round.  Image from: https://www.reddit.com/r/data_irl/comments/1r15ecq/data_irl/ Anyway, in doing so, Mr. Beast inadvertently created a jitter plot using humans. Age group/starting line is at the top of the image, with the checkered finish line at the bottom. The dark blue/light blue columns are a nice touch, too. How to use in class: 1) Pander to your students by using a Mr. Beast example. 2) Ask your students to interpret the data. What can be learned from this image? The basics of bee plots. As expected, the 11-20, 21-30, and 31-40 groups ran the fastest. However, I think 31-40 was the slowest of the three groups, with a bit more variability.  3) I guess this would also be a good example of a non-linear ...

Andi Putt's infographics on autism prevalence demonstrate y-axis truncation and the surveillance effect.

This example illustrates how better assessment has likely increased in autism diagnoses (as opposed to the increase being due to vaccines or hysterical parents). It does a good job of illustrating truncated y-axes and the surveillance effect. It also reminds our psychology majors that we have many professional allies and colleagues outside of psychology. Like speech language pathologists.  I found these examples (see below) on Facebook from speech-language pathologist/excellent science communicator  Mrs. Speechie P.   AKA Andi Putt. How to use in class: 1. Truncated y-axis I like how she mentions that truncated y axes can be a scare tactic. I also like that she shows there are still relatively few in the total population. https://www.facebook.com/photo/?fbid=1327073175898079&set=a.463959318876140 On this theme, she shared a second image that does a really good job of showing how proper diagnosis isn't the same thing as fake/inflated diagnoses (a common argument in ant...

Percentiles, bee swarm plots, Bureau of Labor Statistics data...so many lessons in one interactive chart.

 There are so many ways to use this tool: Nathan Yau's Flowing Data is one of those websites I check every few days for statistical inspiration. He shares  the work of others and his own, including this  interactive bee swarm plot that illustrates salaries  for various  jobs. The bee plot, with the cursor of Psychology Teachers. https://flowingdata.com/2025/09/09/salary-and-occupation-2024/ There are many ways to use this in stats class: 1. At some point, you should talk about career exploration with your students.   2. Statistics students should be learning about modern data visualizations like this jitter plot, aka bee swarm plot.  3. If you cursor over any dot, you can see the 25th and 75th percentile scores and n size for that occupation's salary. 4. The size of each circle corresponds to the n size. Which I love because jitter plots do a great job of illustrating variability in a data set. However, each data point here represents an average...

UFO sightings peak on the Fourth of July. That's all.

 I'm surprised I haven't shared this in this space already. It is one of my favorite data points ever. Clearly, I have favorite data points. https://www.economist.com/graphic-detail/2019/07/04/are-extraterrestrials-extra-patriotic How to use in class? 1) There is data for EVERYTHING if you look hard enough, 2) WHY might this relationship exist (heat stroke, staring at the sky, drinking, freedom, fireworks)?  If you like this example, check out my W.W. Norton & Co. textbook,  Psychological Statistics for Everyone . 

PWA data visualizations on YouTube

A clever YouTuber, PWA , built a channel with nearly a million followers based on animated videos that compare nations based on data. Every nation is a sassy sphere. Each grows and shrinks in size, in comparison with other nations, as the data is presented. Like this image, illustrating national debt as a portion of GDP... I swear, it is funny and engaging without trying too hard. Also, for better or worse, framing data sharing and visualization as a thing that can make you a successful influencer WILL grab your students' attention. I think these videos would make good  bell-ringer s ( TM Janet Peters) for the start of your class. This influencer makes a ton of videos, and they aren't all related to data, FYI. Here are a few good examples for Stats class:  National Debt: In this clip, the countries compare their national debt. This video discusses some of the choices statisticians make with their data. For example, they compare their national debt in USD and then compare...

Dima Yarovinsky's "I Agree": Data visualization meets installation art piece.

Look at how Dima Yarovinsky turned the Terms and Conditions documents for several social media platforms into foreboding and beautiful art/bar graphs illustrating how much we sign away without reading. Note: He even uses the X axis to describe the length of and reading time for each T&C statement!  I think data is beautiful. This example does a good job of showing the beauty and impact of good data visualizations to my students. This isn't a huge example to use in class, but I will use it next time I discuss bar graphs. For more from the artist, in his own words, visit his webpage .  For a thought review of this art, see this article by Emma Taggart .

r/DataIsUgly

I have found plenty of class inspiration on Reddit. Various subs have provided a  new way to explain mode   and median  and great, intuitive data to teach  correlation . However, much as a reverse-coded item on a scale can be used to get to the opposite of what you are asking about, r/DataIsUgly is rife with examples of how NOT to do data as to teach how to create good data visualizations. Very recently, I shared this example from r/DataIsUgly to illustrate why NOT to truncate the Y axis .  And...this sub is filled with people like us. People who love to proofread and notice data crimes. For example: How to use it in class? Can your students figure out why these data visualizations are...less than optimal? Can they fix them? They could be a fun prompt for extra credit points or a discussion board.

Truncated Y-axis, but with female celebrities.

Why did I find this after my textbook was published? Damn it. I have a whole section about how Y-axis manipulation can make small differences look huge and then...I find this. Damn it. Source:   https://www.reddit.com/r/dataisugly/comments/1hjr01o/height_of_female_popstars/

An interactive that gets your students thinking about medians, percentiles, and their own sleeping habits.

My students struggle with sleeping and are distracted by electronics. This interactive activity allows them to think about their sleep relative to norms regarding age and sex. It also dives deeply into how sleep changes over a person's lifespan, which is a topic suitable for non-static classes like Health or Developmental.   https://www.washingtonpost.com/wellness/interactive/2024/sleep-data-survey-americans/ *You need a WaPo subscription or paywall buster to get to this interactive. Like this one! https://www.removepaywall.com/search?url=https://www.washingtonpost.com/wellness/interactive/2024/sleep-data-survey-americans/ Here is a quick interactive that a) lets your students see how well they sleep, in comparison to their demographic and b) think about median data and percentile data.  1. Repursped, gently used data is really everywhere. This interactive uses data from the Census Bureau. Which is a way to measure sleep, but not the only way. 2. Median and percentil...

Uncrustables consumption rates by NFL teams 1) do not vary by league, 2) do not correlate with 2023 wins

Many thanks to Dr. Sara Appleby for sharing this data with me!! I really enjoy silly data, like this  one from Jayson Jenks, writing for  The Athletic,  which shows how many Uncrustables each team eats per  week. Well, data from the teams that elected to participate and/or didn't make their own PB and Js. The whole article is fun, so give it a read. It makes sense that hungry athletes would go for a quick, calorie-dense, nostalgic snack containing protein.  Here is the data visualization:  Damn, Denver.  I entered this data into a spreadsheet for all of us. Spoiler alert: The number of Uncrustables eaten per week does not vary by league (independent t -test example), and the number of wins in 2023 does not correlate with the number of Uncrustables eaten per week in 2023 (correlation/regression example). Also, for my own curiosity, I re-ran the data after deleting Denver, and it wasn't enough of a difference to achieve significance.  

How can we use data to determine the scariest horror movie?

  The Science of Scare project, conducted by MoneySuperMarket.com, recorded heart rates in participants watching fifty horror movies to determine the scariest of scary movies. Below is a screenshot of the original variables and data for 12 of the 50 movies provided by MoneySuperMarket.com : https://www.moneysupermarket.com/broadband/features/science-of-scare/ Here is my version of the data in Excel format . It includes the original data plus four additional columns (so you can run more analyses on the data): -Year of Release -Rotten Tomato rating -Does this movie have a sequel (yes or no)? -Is this movie a sequel (yes or no)? Here are some ways you could use this in class: 1. Correlation : Rotten Tomato rating does not correlate with the overall scare score ( r = 0.13, p = 0.36).   2. Within-subject research design : Baseline, average, and maximum heart rates are reported for each film.   3. T-tests : The is a sequel/has a sequel data can be used to perform a ...