Skip to main content

Posts

Latest Post:

Andi Putt's infographics on autism prevalence demonstrate y-axis truncation and the surveillance effect.

Recent posts

This example starts with a chi-square but ends with a lesson on how even well-written prompts can result in hallucinations.

A research study counted how often ChatGPT made up citations for three different categories of mental disorders (binge eating, body dysmorphic, and major depressive). They used a chi-square to determine if rates of made up citations differed by disorder (they do).  If ever there was an article that belonged on this blog, this is it. You can use it in your stats class as an example of chi-square and/or as a warning to students if you ask them to perform literature reviews for your class. The original paper, Influence of topic familiarity and prompt specificity on citation fabrication in mental health research using large language models: Experimental Study was published in December 2025, and summarized by PsyPost  shortly after publishing.  What the researchers did: What the researchers found: How to use in class: 1. This is a good chi-square results section. They shared the test value and the p value, of course, but I like how they shared the varying rates of inaccuracy...

Use spicy, spicy peppers to explain scales of measurement and/or the difference between categorical and continuous data.

This spicy example explains scales of measurement, continuous vs. categorical variables, and how you can measure and quantify anything.  Uncommon Goods sells quirky gifts. While I was looking for Christmas gifts last year, I came across this kit https://www.uncommongoods.com/product/scoville-scale-chili-pepper-tasting-kit#618780000000 I have a teenage son, and teen boys love this sort of stuff. Actually, I think spicy peppers are enjoying increased popularity due to the Hot Ones show (The show where celebrities eat increasingly hot chicken wings while being interviewed, like Jennifer Lawrence and her famous GIF from the show).  Maybe you could link this example back to that show? Welcome to how my brain works. Anyway, among the information Uncommon Goods shared about this kit was an image of the packaging for the kit, detailing the Scoville scale rating for each pepper: And my stats teacher brain translated this packaging into this, since the same data (hottness) is presented ...

A good JAMA article that demonstrates how to appropriately share relative and absolute risks.

TL:DR: Sugary drinks might up your risk for oral cavity cancer (so says relative risk) but probably won't be the thing that kills you (so says absolute risk).  In depth: I love teaching applied statistics, including showing my students how to identify and properly attention-grabbing examples of relative risk ( 1 , 2 ). HOWEVER, relative and absolute risk aren't lying. But they can scare people, so I think it is important to share both calmly.  This example from JAMA Otolaryngology is a good example of how to responsibly share relative and absolute risk. It has a very calm, non-click bait article title:  https://jamanetwork.com/journals/jamaotolaryngology/fullarticle/2831121 Cool. Also, thanks for using female research participants. Next, the results are described in a non-salacious manner, with the absolute risk in red and relative risk in blue. How could you use this in class? Sharing relative risk isn't in and of itself unethical. Using it to scare people is questi...

Z scores suggest that British parlimentarians are using ChatGPT to write speeches.

I came across this article on social media: https://www.pimlicojournal.co.uk/p/mps-are-almost-certainly-using-chatgpt This got my attention, because I'm sick of people ragging on college students using AI. EVERYONE is using AI. That doesn't mean it is always OK or evil, but let's stop ragging on the kids. Anyway, the author used data to make their claims via z scores: https://www.pimlicojournal.co.uk/p/mps-are-almost-certainly-using-chatgpt Ways to use in class: 1. I like to talk to students about data as evidence. In science, it can be evidence to reject or not reject a hypothesis. In real life, it can track trends, both innocuous and suspicious. 2. This is another way of talking about z scores, a crucial but less exciting aspect of basics statistics. As best as I can tell, this was the z score formula used:  frequency z score = (number of times phrase was used in a year - mean times the phrase was used in all years)/standard deviation of number of times the phrase was...

Do Taylor Swift's album variants violate the assumption of independence?

No, that is not a dig at her romantic relationships. It is instead a question about the impact of her numerous variants on descriptive data in the music industry. And if you found your way to this blog, you know that I love a good, relevant, pop-culture-driven example for explaining statistical concepts. Especially somewhat dry concepts, like the assumption of independence when collecting data. Anyway. Per Microsoft Copilot: Across all formats, there are 34 different versions of the album: 8 vinyl 18 CD 1 cassette 7 digital Microsoft Copilot. (2025, November 8). Response to query about Taylor Swift’s album variants [AI-generated response]. Microsoft Copilot. https://www.reddit.com/r/TrueSwifties/comments/1n04kdu/the_life_of_a_showgirl_vinyl_variants_announced/ When overall album sales are counted by major industry players (Billboard, Luminate), every variant sale counts as one sale.  So, she is  selling many albums, but it raises the question of whether her sal...

A memorable example of Goodhart's Law for all of my psychometric/assessment instructors.

Goodhart's Law is a truism in assessment circles, which are always statistics-adjacent. And that is why I'm sharing this fine embodiment of Goodhart's Law on my blog. Always pair the important stuff with something ridiculous, I swear, it makes it easier to remember the important stuff.