Not awful and boring ideas for teaching statistics

Posts

Showing posts from August, 2016

Everything is fucked: The syllabus, by Sanjay Srivastava (with links to articles)

This syllabus for PSY 607: Everything is Fucked , made the rounds last week. The syllabus is for a course that purports that science is fucked. The course readings are a list of articles and books that hit on the limitations of statistics and research psychology ( p -values, shortcomings of meta-analysis, misuse of mediation, replication crisis, etc.). PSY 607 isn't an actual class ( as author/psychologist/blogger Srivastava explains in this piece from The Chronicle ) but it does provide a fine reading list for understanding some of the current debates and changes in statistics and psychology. Most of articles are probably too advanced for undergraduates but perfectly appropriate for teaching graduat e students about our field and staying up to date as instructors of statistics. Here is a link to the original blog post/syllabus.

Harris' "How Big A Risk Is Acetaminophen During Pregnancy?"

This study, which found a link between maternal Tylenol usage during pregnancy and ADHD, has been making the rounds, particularly in the Academic Mama circles I move in. Being pregnant is hard. For just about every malady, the only solution is to stay hydrated. With a compromised bladder. But at least pregnant women have Tylenol for sore hips and bad backs. For a long time, this has been the only safe OTC pain reliever available to pregnant women. But a recent research article has cast doubt on this advice. A quick read of this article makes it sound like you are cursing your child with a lifetime of ADHD if you take Tylenol. A nd this article has become click-bait fodder. But these findings have some pretty big caveats. Harris published this reaction piece at NPR . It is a good teaching example of media hype vs. incremental scientific progress and the third (or fourth or fifth) variable problem. It also touches on absolute vs. relative risk. NOTE: There are well-documente...

Ahn Le's "Gotta plot ‘em all!"

This example is a little out of my wheel house, but I'm putting it up here for those of you who teach more advanced UG stats or grad stats. I have never taught Principle Component Analysis. But Anh Le, PhD candidate at Duke, provides a detailed description of PCA in R AND does so using data that your advanced undergraduate/graduate students will enjoy: Pokemon. So, Le downloaded data for each of the 151 Pokemon (individual stats for the strengths and weakness of each Pokemon, and provided a link so that you can download the data as well). He even included the code he used to create his PCA via R AND he does a nice job talking the reader through his process and what the findings mean. At 37, I didn't realize how much my traditionally-aged college students love Pokemon. Pokemon came up in my undergraduate I/O class three years ago, and I was shocked by how much nostalgia my then-20 year old students felt for the franchise. I think that it is certainly experiencing a rev...

u/dat data's "Why medians > averages [OC] "

Unsettling. But I bet your students won't forget this example of why mean isn't always the best measure of central tendency. While the reddit user labeled this as example median's superiority, you could also use this as an example when mode is useful. As statisticians, we often fall back on to mode when we have categories and median when we have outliers, but sometimes either median or mode can be useful when decimal points don't make a lot of sense. Here is the image and commentary from reddit: And this an IG posting about the data from the same user, Mona Chalabi from fivethirtyeight. I included the Instagram because Chalabi expands a bit more upon the original data she used. https://www.instagram.com/p/BIVKJrcgW51/

Anscombe's Quartet

No, Wikipedia isn't a proper resource for our students to cite. But it is not without merit. For example, I think the information it provides on Anscombe's quartet is very useful. This example provides four data distributions. For each, the means and variances for both the X and Y variables are identical. The correlations between X and Y, and the regression lines, are also identical. This is the descriptive/inferential data that applies to each of the four graphs I have seen variations upon this in textbooks over the years, but typically they just show how different distributions can have the same mean and standard deviation. I think this example goes the extra mile by including r and the regression line. How to use in class: -Graphs aren't for babies. They can be an essential part of understanding your data. -Outliers are bad! -The original data is also included at the Wikipedia entry if you would like your students to create these graphs in class.