Skip to main content

"The Quest To Create A Better Spy-Catching Algorithm"

"(Algorithms) are used so heavily, they don't just predict the future, they are the future." -Cathy O'Neil

^This quote from this NPR story made me punch the air in my little Subaru after dropping my kid off to school. What a great sentence. There are many great one-liners in this little five-minute review of algorithms.

This NPR story by Dina Temple-Raston is a great primer for All The Ethical Issues Related To Algorithms, accessible to non- or novice-statisticians. It clocks in at just under five minutes, perfect as a discussion prompt or quick introduction to the topic.

How to use in class:

They talk about regression without ever saying "regression":

"Algorithms use past patterns of success to predict the future."

So, regression, right? Fancy regression, but that one line can take this fancy talk of algorithms and make it more applicable to your students. Sometimes, I feel like I'm just waving my hands when I try to explain this very, very important piece of regression but this report describes the prediction side of regression succinctly.

Bias In Algorithms:

"The feedback loop that reinforces lucky people's luck." Historically, who is likely to get promoted at large, successful organizations? White dudes. If that is part of your algorithm, you'll keep promoting only white dudes. Which isn't to say that there aren't qualified white dudes, but you will miss out on great women and POC.

"By learning from the past, algorithms are doomed to repeat the past" (Another great one-liner!)

False Positives:

Who are spies? White men who work for the US government and speak Russian. There are a lot of those! If the federal government flagged everyone who met that description, they would flag many, many non-spies.

Similarly, people who are engaging in corporate espionage tend to show up before everyone else or stay after everyone else. So they can be sneaky and unobserved. But a new parent may also work very early hours so they can leave early to accommodate their kid's school schedule, or an employee might stay late routinely because they have a regular, early PT appointment. An algorithm can't know this.

Algorithms still can't beat human insights (n = 2):

The bigger narrative in this piece has to do with how various government agencies attempt to use algorithms to uncover likely spies within their ranks. In two very high profile spying cases (Aldrich Ames and Jerry Chun Shing Lee) the spies were uncovered not by an algorithm but by human analysts who noticed odd behaviors and acted on those observations.

Comments

Popular posts from this blog

Ways to use funny meme scales in your stats classes

Have you ever heard of the theory that there are multiple people worldwide thinking about the same novel thing at the same time? It is the multiple discovery hypothesis of invention . Like, multiple great minds around the world were working on calculus at the same time. Well, I think a bunch of super-duper psychology professors were all thinking about scale memes and pedagogy at the same time. Clearly, this is just as impressive as calculus. Who were some of these great minds? 1) Dr.  Molly Metz maintains a curated list of hilarious "How you doing?" scales.  2) Dr. Esther Lindenström posted about using these scales as student check-ins. 3) I was working on a blog post about using such scales to teach the basics of variables.  So, I decided to create a post about three ways to use these scales in your stats classes:  1) Teaching the basics of variables. 2) Nominal vs. ordinal scales.  3) Daily check-in with your students.  1. Teach your students the basics...

If your students get the joke, they get statistics.

Gleaned from multiple sources (FB, Pinterest, Twitter, none of these belong to me, etc.). Remember, if your students can explain why a stats funny is funny, they are demonstrating statistical knowledge. I like to ask students to explain the humor in such examples for extra credit points (see below for an example from my FA14 final exam). Using xkcd.com for bonus points/assessing if students understand that correlation =/= causation What are the numerical thresholds for probability?  How does this refer to alpha? What type of error is being described, Type I or Type II? What measure of central tendency is being described? Dilbert: http://search.dilbert.com/comic/Kill%20Anyone Sampling, CLT http://foulmouthedbaker.com/2013/10/03/graphs-belong-on-cakes/ Because control vs. sample, standard deviations, normal curves. Also,"skewed" pun. If you go to the original website , the story behind this cakes has to do w...

Leo DiCaprio Romantic Age Gap Data: UPDATE

Does anyone else teach correlation and regression together at the end of the semester? Here is a treat for you: Updated data on Leonardo DiCaprio, his age, and his romantic partner's age when they started dating. A few years ago, there was a dust-up when a clever Redditor r/TrustLittleBrother realized that DiCaprio had never dated anyone over 25. I blogged about this when it happened. But the old data was from 2022. Inspired by this sleuthing,  I created a wee data set, including up-to-date information on his current relationship with Vittoria Ceretti, so your students can suss out the patterns that exist in this data.