Hacking Jeopardy

"Have you ever crammed for a test? Several tests? How about for all human knowledge?" That's how we started our 2012 article about Roger Craig, the man who used data science to hack Jeopardy. But there's a delicious irony here: we wrote about a human using data science to win Jeopardy the same year IBM's Watson had already beaten the best human champions at the game. Both stories were about the same thing -- using systematic analysis to win at trivia -- but they pointed in completely different directions.

Roger Craig's approach was beautiful. It was human data science at its best.

The Human Hacker

Craig was a data scientist and a lifelong Jeopardy fan who had failed three auditions for the show. So he reverse-engineered the game. He found a database of over 200,000 old questions, clustered and analyzed them, and discovered patterns. Jeopardy doesn't ask about everything equally. You need to know your twentieth-century presidents and your Founding Fathers, but "you don't need to know much about Millard Fillmore or Chester Arthur."

Then Craig did something clever: he quantified his own knowledge. He tested himself against the question categories, measured his gaps against the average Jeopardy player's performance, and built a study plan that targeted exactly the areas where improvement would yield the highest point return. He even built a digital version of himself that could simulate his performance on historical games. When his digital twin started winning consistently, he knew he was ready.

It worked. Craig set the record for most money earned in a single Jeopardy game and the record for the first five games. Data science made him one of the greatest players in the show's history.

Watson's Different Path

IBM's Watson beat Ken Jennings and Brad Rutter on Jeopardy in February 2011, a year before we wrote our Craig article. Watson was a massive achievement: a room full of servers running natural language processing algorithms that could parse Jeopardy's notoriously tricky answer-in-the-form-of-a-question format, search a knowledge base, and buzz in faster than humans.

But Watson was brittle. It was a rule-based system that worked on Jeopardy because the domain was constrained. The same system couldn't hold a conversation, write an essay, or reason about novel situations. IBM spent the next decade trying to turn Watson into an enterprise product, rebranding it repeatedly -- Watson Health, Watson Studio, Watson Assistant -- and never quite finding the killer application.

Watson beat the best humans at Jeopardy in 2011 and IBM spent a decade trying to find a second act. ChatGPT found a billion use cases in its first year. The difference wasn't intelligence. It was flexibility.

In 2022, IBM quietly retired the Watson brand from most of its products. By then, ChatGPT had launched and made Watson look ancient. IBM pivoted to WatsonX, an enterprise AI platform built on foundation models -- essentially admitting that the original Watson approach had been superseded.

The Real Lesson: Brittle vs. Flexible AI

This is the most important AI lesson from the past decade, and the Jeopardy story illustrates it perfectly. Watson and ChatGPT both answer questions. But Watson needed custom engineering for every new domain. ChatGPT works across thousands of domains with no customization because it learned language itself, not just trivia facts.

Roger Craig's approach was actually more like modern AI than Watson was. Craig didn't memorize answers. He found patterns, identified his weaknesses, and optimized his learning strategy. He was flexible. Watson was impressive but specialized. The irony is that a human data scientist's approach in 2012 was philosophically closer to how GPT-4 works than IBM's billion-dollar AI system was.

Craig himself saw broader implications. Adaptive learning platforms like Khan Academy's Khanmigo and Duolingo's AI tutor now do exactly what Craig described: assess what a student knows, identify gaps, and personalize the path.

What This Means for Enterprise AI

For enterprises, the Watson-to-ChatGPT arc is a warning and an opportunity. Don't build brittle AI systems that work for one use case and can't adapt. Foundation models are flexible enough to serve dozens of use cases within a single organization, if you set up the infrastructure right.

The companies that treated Watson as a magic box got burned when the magic stopped working. The companies treating GPT-4 or Claude as a platform for building operational AI capabilities are in a much stronger position. Craig hacked Jeopardy by understanding the game deeply, then applying data science where it mattered most. The best AI strategies in 2026 do the same thing.