Skip to main content

Posts

Yule Log(arithm) Alternative (Hypothesis) Presentations

My friends. For you, I have compiled all of my funny statistic-Christmas images into one Google Slide presentation. You are welcome:  https://docs.google.com/presentation/d/12nmMw-69Ez71VmzaZ_QbUx4NOXiP17LHnvDjWUrx094/edit?usp=sharing

NBC News' "This algorithm helps catch serial killers"

I don't find many examples of cluster analysis to share, but this example is REALLY engaging (using data to find serial killers), and is simple enough for a baby statistician BUT you can also make it a more advanced lesson as the data's owners freely share their data and code. Short Version: Journalist Thomas Hargrove (and his team) used cluster analysis to find clusters of similar killings within geographic areas. These might be a sign that a serial killer is active in that geographic region. It correctly identified a killer in Indiana. I found this interview from datainnovation.org which most succinctly describes the data analysis: https://www.datainnovation.org/2017/07/5-qs-for-thomas-hargrove-founder-of-the-murder-accountability-project/ Also statsy because the cluster analysis was validated using data from known serial killers. Hargrove's data and code can be accessed  here  and more information on his overall project to solve murders can be found...

Explaining chi-square is easier when your observed data equals 100 (here, the US Senate)

UPDATE: 2020 Data: https://www.catalyst.org/knowledge/women-government When I explain chi-square at a conceptual, no-software, no-formula level, I use the example of gender distribution within the US Senate. There are 100 Senators, so the raw observed data count is the same as the observed data expressed via proportions. I think it makes it easier for junior statisticians to wrap their brains around chi-square.  I  usually start with an Goodness-of-Fit (or, as I like to call them, "One-sies chi-squares").For this example, I divide senators into two groups: men and women. And what do you get?  For the 115th Congress, there are 23 women and 77 men . There is your observed data, both as a raw count or as a proportion. What is your expected data? A 50/50 breakdown...which would also be 50 men and 50 women. Without doing the actual analysis, it is pretty safe to assume that, due to the great difference between expected and observed values, your chi-square Goodness o...