Data Mining Techniques

In 2013, we wrote a deep dive on data mining techniques using Target's pregnancy prediction algorithm as the hook. Target had figured out that when a woman suddenly buys cocoa-butter lotion, a purse large enough to double as a diaper bag, and zinc supplements, she's probably pregnant. They could even estimate her due date. We explained the math behind it: sequence clustering algorithms, Markov chain analysis, distance calculations. It was genuinely interesting stuff.

Here's what's wild about rereading that article thirteen years later: every technique we described still works. The fundamentals haven't changed. The implementations have changed beyond recognition.

The Techniques That Endure

Our 2013 article covered the core data mining toolkit: classification (sorting things into categories), clustering (finding natural groupings), regression (predicting numbers), and association (finding patterns in sequences). We spent a lot of time on Markov chains, those mathematical models that predict the probability of moving from one state to another. We even generated some entertaining nonsense text by running a Markov chain on My Man Jeeves by Wodehouse: "Can you put a few years of your twin-brother Alfred, who was apt to rally round a bit. I should strongly advocate the blue with milk."

Computer-generated text that was "somewhat coherent nonsense," we called it. Read that sentence again and think about what GPT-4 does. Same concept. Wildly different execution.

Every technique we described in 2013 still works. Classification, clustering, regression, association rules. What changed is that neural networks made these concepts 1,000x more powerful and 1,000x harder to explain.

From Data Mining to Deep Learning

The journey from 2013 to 2026 followed a clear path, though it didn't feel clear at the time. Data mining was the established discipline. Machine learning (ML) was the hot new thing -- basically data mining with more automation and better algorithms. Deep learning emerged around 2012 when AlexNet won an image recognition competition by a huge margin using neural networks. Then in 2017, Google published "Attention Is All You Need," the transformer paper that made large language models possible. By 2022, ChatGPT hit 100 million users in two months and the entire field changed overnight.

But here's what people miss: LLMs do classification, clustering, regression, and association. They just do it inside neural networks with hundreds of billions of parameters instead of hand-crafted statistical models. When ChatGPT decides what word comes next, it's running a very sophisticated version of the Markov chain we described in 2013. It doesn't predict the next word from the previous word alone -- it considers the entire context window -- but the core idea of "what's most likely to follow" is the same.

Target's pregnancy prediction algorithm from 2013 could be replaced today by a foundation model fine-tuned on purchase data. The model would probably find patterns Target's data scientists never imagined, because it wouldn't be limited to 25 hand-selected product signals.

What Practitioners Need to Know

If you're working with data in 2026, here's my honest take. Learn the fundamentals first. Understand what classification, clustering, and regression actually do, because those concepts are the vocabulary of every AI system you'll encounter. You don't need to implement them from scratch -- nobody does that anymore -- but you need to know what your tools are doing under the hood.

Then learn how to work with foundation models. Fine-tuning, prompt engineering, RAG (retrieval-augmented generation), and evaluation metrics are the practical skills that matter now. The gap between "I understand data mining concepts" and "I can deploy an AI system that works" is where operational AI capability lives.

The Markov chain that generated nonsense text in 2013 and the LLM that writes coherent business documents in 2026 are separated by thirteen years of engineering, billions of dollars in compute, and trillions of training tokens. But the idea at the center -- find patterns in sequences, predict what comes next -- hasn't changed at all. That should be reassuring if you're trying to learn this field. The fundamentals hold.