Not awful and boring ideas for teaching statistics

Posts

Showing posts with the label music

Modern musician vocabularies: See how I extracted this data using GenAI, and how you can use it in class.

I intended for this to be a post about the singer vocabulary. It is still that, but it is also a post about using GenAI to grab data from an image. I mean, you can use Excel to do the same thing, but GenAI is a lot easier. Here we go. It starts with the Word Tips website, which helps you solve your crossword puzzles and Wordle. This website also has a blog dedicated to words. One such blog post explored which singers have the largest vocabularies, as measured by the number of unique words in their lyrics. Their blog post compared music legends to newer talent. There are a ton of fun data visualizations on the website; go check it out. Since I teach college students, I decided to concentrate on the musicians my students listen to: In and of itself, this image serves as an example of bar graphs, good data visualization, and proper use of "buckets". However, I figured we could find a way to use the raw data in class. Create your own data visualization, create your own buckets....

Full Discussion Board Idea #2: Trends in love songs, as illustrated by The Pudding

You aren't a proper stats nerd if you have not scrolled for an hour through all of The Pudding's content . Thank goodness for The Pudding, which helped me spice up the discussion boards in my online stats class. For a long time, I emphasized rigor over wonder. In my stats class, I had topics that were functionally reasonable but not terribly engaging for class discussion. That changed last semester. I spiced up my discussion board with some of my favorite data visualizations, like this one about using a fast-food app to track power outages after a natural disaster, and this one that illustrates data on the efficacy of nutritional supplements in a beautiful, functional way. Here is another that lets students look at trends in art and wonder about how this may reflect on cultural shifts in courting and romantic relationships . TL;DR The Pudding recently shared a post about trends in love songs from 1958 through 2023. The whole inter...

Generate highly personalized music data using Exportify

Spotify generates gobs of data about music. Most people have seen the end-of-the-year data Spotify generates for each user about their listening patterns . Most people don't know that Spotify also generates a lot of data about individual songs. Some of it is straightforward: tempo, genre, length. However, Spotify also has its own niche way of quantifying songs: Danceability. Accousticness. Here is a whole list of their variables and descriptions from researchers at CMU: https://www.stat.cmu.edu/capstoneresearch/315files_s23/team23.html What does this mean for a stats teacher? You have access to highly personalizable data sets, rooted in music, with gobs and gobs of variables for each song...or artist...or album...or year of release...or genre (like, so many ways to divide up your data). For instance, I created a data set with Spotify data for 1989 and 1989 (Taylor's Version) to teach paired t -tests . How do Taylor's re-recordings compare to the originals?...

Daniel's "Most timeless songs of all time"

This article, written by Matt Daniels for The Pudding , allows you to play around with a whole bunch of Spotify user data in order to generate visualizations of song popularity over time. You can generate custom visualizations using the very interactive sections on this website. For instance, there is a special visualization that allows you to finally quantify the Biggie/Tupac Rivalry. So, data and pop culture are my two favorite things. I could play with these different interactive pieces all day long. But there are also some specific ways you could use this in class. 1) Generate unique descriptive data for different musicians and then ask you students to create visualizations using the software of your choosing. Below, I've queried Dixie Chicks play data. Students could enter their own favorite artist. Note: They data only runs through 2005. 2) Sampling errors: Here is a description of the methodology used for this data: Is this representative of all data...

ANOVA example using Patty Neighmond's "To ease pain, reach for your play list."

I often share news stories that illustrate easy-to-follow, engaging research that appeals to undergraduates. For the first time, I'm also providing a mini data set that 1) mimics the original findings and 2) provides an example of ANOVA. This story by Patty Neighmond , reporting for NPR, describes a study investigating the role of music in pain reduction. The study used three groups of kids, all recovering from surgery. The kids either 1) listened to music, 2) listened to an audio books, or 3) sat with noise-cancelling ear phones for 30 minutes. The researchers found that kids in both the music and audio book experienced pain reduction levels comparable to over-the-counter pain medication while the control group enjoyed no such benefits. And the research used the 10-point FACES scale, allowing for a side discussion about how we collect data from humans who don't have the best vocabularies or limited communication skills. This study can also be used as a way t...

Priceonomic's Hipster Music Index

This tongue-in-cheek regression analysis found a way to predict the "Hipster Music Index" of a given artist by plotting # of Facebook shares of said artist's Pitchfork magazine review on they y-axis and Pitchfork magazine review score on the x-axis. If an artist falls above the linear regression line, they aren't "hipster". If they fall below the line, they are. For example, Kanye West is a Pitchfork darling but also widely shared on FB, and, thus demonstrating too much popular appeal to be a hipster darling (as opposed to Sun Kill Moon (?), who is beloved by both Pitchfork but not overly shared on FB). As instructors, we typically talk about the regression line as an equation for prediction, but Priconomics uses the line in a slightly different way in order to make predictions. Also, if you go to the source article, there are tables displaying the difference between the predicted Y-value (FB Likes) for a given artist versus the actual Y-value, which coul...

Matt Daniel's "The Largest Vocabulary in Hip Hop"

a) The addition of this post means that I now have TWO Snoop Dogg blogg labels for this blog. b) Daniels' graph allows students to see archival data (and research decisions used when deciding how to analyze the archival data as well as content analysis) in order to determine which rapper has the largest vocabulary. Here is Matthew Daniels interactive chart detailing the vocabularies of numerous, prominent rappers. Daniels sampled each musician's first 35,000 lyrics for the number of unique words present. He went with 35,000 in order to compare more established artists to more recent artists who have published fewer songs. (The appropriateness of this decision could be a source of debate in a research methods class.) Additionally, derivatives of the same word are counted uniquely (pimps, pimp, pimping, and pimpin count as four words). This decision was guided, from what I can gather, by the time of content analysis performed. Property of Matthew Daniels...note: The ori...

A.V. Club's "Shirley Manson takes BuzzFeed's "Which Alt-Rock Grrrl Are You?" quiz, discovers she's not herself"

Lately, there have been a lot of quizzes popping up on my Facebook feed ("What breed of dog are you?", "What character from Harry Potter are you?"). As a psychologist who tinkers in statistics, I have pondered the psychometric properties of such quizzes and concluded that these quizzes where probably not properly vetted in peer-reviewed journals. Now I have a tiny bit of evidence to support that conclusion. What better way to ensure that a scale is valid than by using the standard of concurrent validity (popular in I/O psychology)? This actually happened when renowned Shirley Manson Subject Matter Expert, Shirley Manson, lead singer of the band Garbage, took the "Which Alt-rock Grrrl are you?" quiz and she didn't score as herself (as she posted on Facebook and reported by A.V. Club ). From Facebook, via A.V. Club An excellent example of an invalid test (or concurrent validity for you I/O types).