Where Knowledge Junkies Get Their Fix
McAfee Secure sites help keep you safe from identity theft, credit card fraud, spyware, spam, viruses and online scams
Chris Higgins
How Nate Silver Predicted Obama’s Win
by Chris Higgins - February 5, 2009 - 2:54 PM

Nate SilverThroughout last year’s election, Nate Silver ran a fascinating website called FiveThirtyEight, named after the number of electors in the United States electoral college. Silver is a statistician, and he spent every day applying his considerable math chops to the various problems of the election — namely, who would win, where, and by how much. By aggregating and analyzing publicly available poll data, Silver developed a series of statistical models that predicted, with stunning accuracy, what would happen on election day.

Silver got his start as a baseball statistician before he started FiveThirtyEight. A New York Magazine profile from last fall explains:

…At his day job, Silver works for Baseball Prospectus, a loosely organized think tank that, in the last ten years, has revolutionized the interpretation of baseball stats. Furthermore, Silver himself invented a system called PECOTA, an algorithm for predicting future performance by baseball players and teams. (It stands for “player empirical comparison and optimization test algorithm,” but is named, with a wink, after the mediocre Kansas City Royals infielder Bill Pecota.) Baseball Prospectus has a reputation in sports-media circles for being unfailingly rigorous, occasionally arrogant, and almost always correct.

When Silver turned his attention to politics, his baseball stats experience came in handy. Silver’s models are predictive, meaning that his intent was to figure out what would happen on a certain date (November 4) based on the trend and accuracy of all the available data. This is a little different from what most political pollsters do — their typical task is to take a snapshot of the electorate on a given day, then analyze that. From the New York article:

As pollster John Zogby put it to me, “We take snapshots. And when you take many snapshots in a row, you get motion pictures.”

Similar to how a sports prediction seeks to predict a larger outcome (like which team will win the pennant), based on achieving a series of sub-goals (teams winning various games and playoffs), Silver saw the presidential and congressional elections as a series of smaller games in each state (primaries) that would select the players (candidates), who would eventually compete in the World Series (the election). He had tons of data coming out each day from existing pollsters. He just needed a methodology to use that data, and pull good predictions from it. The New York profile continues:

So he came up with a system that predicts a pollster’s future performance based on how good it’s been in the past. In finding his average, Silver weights each poll differently—ranking them according to his own statistic, PIE (pollster-introduced error)—based on a number of factors, including its track record and its methodology. One advantage of this system is that, during the primaries, the system actually got smarter. Because each time a poll performed well in a primary, its ranking improved.

For the general election, this gets trickier, since you have polls coming out every single day and you can’t know which ones are getting it right until Election Day. You can, however, weigh these new polls based on the pollster’s history, the poll’s sample size, and how recently the poll was conducted. You can also track trends over time and use these trend lines to forecast where things will end up on November 4. You can also, as Silver has done, analyze all the presidential polling data back to 1952, looking for information as to what is likely to happen next. (For example, how much the polls are likely to tighten in the last month of the race, which they traditionally do.) You can also run 10,000 computer simulations of the election every day based on your poll projections. (Think of this as sort of like that scene at the end of WarGames, where the computer blurs through every possible nuclear-war scenario.) As of October 8, the day after the town-hall debate, Silver’s simulations had Obama winning the election 90 percent of the time.

In the end, Silver got it right. His models predicted the popular vote within 1 percentage point as well as predicting 49 of 50 state races correctly. After the election, Silver has continued to analyze special topics, like the Coleman/Franken recount, and his work there has been entertaining as well as enlightening (see his post on real Minnesota ballots entitled Brett Favre Beats Lizard People).

If you’re interested in politics, statistics, or even baseball, I think you’ll enjoy the New York profile of Silver, which came out before the election was decided. For a bit of after-the-fact analysis, check out this New York Times article from November 9. In any case, Nate Silver is a kind of nerd hero for me: a man whose statistical superpowers brought him to national prominence. Let’s hear it for math!

Comments (11)
  1. I can’t help but think: He predicted Obama would win…? He needed statistical analysis for that? I haven’t seen such a slam dunk since Reagan’s second run/win!

  2. Jeff – it’s the details of the win that mattered. He predicted state by state totals, electoral college votes, and popular vote — all quite accurately.

  3. Not to mention he was picking Obama way before anyone else was picking him to win (when everyone thought Hillary was going to run away with it).

  4. I’d be curious to know if Silver’s political statistical models take into account the influence (if any) of his published results on public opinion and, subsequently, on the polls themselves.

  5. I watched his site all through the election. It was spectacular. For any nay-sayers out there, he took statistical data from all the major polls, so there was no way he was slanted (toward his own political leanings).

    Plus, unless I’m mistaken, didn’t he predict the exact number of electoral votes that Obama actually won? (He may have been off by one or two.) That is very impressive.

    Check out the site. It’s really cool.

  6. @Kikadee – yeah, that’s a good question. I think he was on TV a fair bit towards the end of the campaign, so it’s likely that his own quasi-journalism/analyst discussion influenced the electorate. I’m also really curious what his readership was — were they all lefties enjoying the good news? Or did he have bipartisan appeal?

    It’s also possible that his own discussion of pollster methodology changed their methodology, though I don’t recall a specific incident of that.

  7. Mike he was off by 1 electoral vote, the nebraska split vote kept him from being perfect.

  8. I watched his site all through the election. It was spectacular. It’s also possible that his own discussion of pollster methodology changed their methodology, though I don’t recall a specific incident of that.

  9. I can’t believe this. I KNOW Nate Silver– we grew up together– I was friends with his little sister, and my brother was friends with him. I guess I knew he was smart, but I had no idea. I just phoned my mother to tell her about this article and she said, “I always told his mom he’d be president some day.”

  10. Yeah, it wasn’t too hard to guess that Obama would be elected. It seemed like everyone was in love with him from early on, and that certainly helped.

    In my government class, we did a questionnaire about the incumbent party that, if it got a majority of “no’s,” would mean they would lose. Sure enough, they lost–but only by a slight margin.

    All that is fine and well, but to prove it with statistics, and to come close to the actual results? That’s impressive.

    reCaptcha: usually here

  11. More and more I keep thinking maybe I should take a statistics class. Maybe Heinlein was right: It all comes down to numbers.

Comment

commenting policy