Brawn Beats Brain: The Bitter Lesson of AI Research

How centuries of human ingenuity are being swept aside by raw computational power

Aug 14, 2025

chess pieces on chess board — Photo by Eugene Chystiakov on Unsplash

Picture a young chess player in 1921, hunched over José Capablanca's newly published Chess Fundamentals. The book opens with methodical precision: "The first thing a student should do, is to familiarise himself with the power of the pieces. This can best be done by learning how to accomplish quickly some of the simple mates."

Page by page, the student absorbs centuries of chess wisdom. Control the center. Develop pieces early. King safety matters. Castle early. Each principle builds on the last, creating an elegant architecture of understanding.

This is humanity's greatest strength: we don't start from scratch. We inherit wisdom, add our insights, pass it forward. Every textbook, every mentor, every "best practice" is a link in an unbroken chain stretching back through generations. In physics, Newton's laws became the foundation for Maxwell's equations, which enabled Einstein's relativity. Each generation stands on the shoulders of giants, extending the chain of human knowledge.

Chess exemplified this pattern perfectly. From medieval manuscripts to Capablanca to Kasparov, the royal game became a repository of human cleverness, refined across generations.

For decades, this was also how we taught computers to play chess. Early chess programs were essentially digital versions of Capablanca's book. Programmers fed them libraries of proven opening moves, rules for judging good positions from bad ones, and databases of how to win with just a few pieces left on the board. They built systems that knew which piece arrangements were strong or weak, could evaluate how well pieces were positioned, and could spot winning move sequences. The best chess computers were digital libraries of human expertise, refined and accelerated.

The approach seemed right. Cleverness plus computational speed equaled progress. The more chess knowledge we could encode, the better the programs became.

The Knowledge-Based Pinnacle

By 1997, this philosophy reached its apex with IBM's Deep Blue. Here was the ultimate expression of human chess understanding, amplified by silicon and steel. Deep Blue had 8,000 different ways to evaluate chess positions, each one crafted by human experts for specific situations. It had memorized the best opening moves from 700,000 grandmaster games and knew the perfect way to win every possible endgame with just a few pieces left.

When Deep Blue defeated world champion Garry Kasparov, it seemed to confirm the power of accumulated human wisdom. The machine had simply taken centuries of chess knowledge and applied it faster than any human could manage—200 million positions per second versus the dozen moves a grandmaster might calculate in critical positions.

The chess world celebrated. This was how progress worked: accumulate wisdom, encode it, accelerate it. Human cleverness had triumphed, even if it needed silicon assistance.

The Radical Question

But twenty years later, Google's DeepMind asked a heretical question: What if all of this is wrong?

What if centuries of accumulated wisdom aren't the foundation of intelligence—but its limitation?

What if the careful chain of knowledge, passed from teacher to student, is actually holding us back?

What if we just trusted the machine to figure it out itself?

The Experiment

The experiment was as simple as it was radical. AlphaZero would learn chess from scratch. No memorized opening moves. No databases of winning endgames. No centuries of human wisdom. Just the basic rules—how pieces move, what constitutes checkmate—and the ability to play against itself.

Picture the moment: a neural network sits down at the virtual board, knowing nothing.

Game 1: Random moves. Loses immediately.
Game 100: Still terrible. Loses in a few moves.
Game 1,000: Slightly less terrible. Occasionally doesn't blunder its queen.

But here's the beauty of reinforcement learning: every loss teaches. Every mistake adjusts the neural weights slightly. The machine doesn't get frustrated, doesn't need encouragement, never gives up. It just plays. And plays. And plays.

Approximately 19.6 million games in the first four hours alone.

Game 10,000: Starting to understand basic tactics.
Game 100,000: Developing its own style.
Game 1,000,000: Playing moves no human has ever seen.

After just 4 hours of training, AlphaZero was ready to face Stockfish—the heir to Deep Blue's knowledge-based throne, refined over decades with human expertise.

The Shocking Victory

AlphaZero didn't just win. It dominated, winning 155 games and losing just 6 out of 1,000. But more shocking than the victory was the style. AlphaZero played alien chess—moves that violated fundamental principles, sacrifices that made no sense, strategies that seemed suicidal.

Human experts watched in bewilderment as AlphaZero made moves they would never consider, then twenty moves later revealed the hidden brilliance. Chess Grandmaster Matthew Sadler, who analyzed thousands of AlphaZero's games, said its style was "unlike any traditional chess engine". The machine had developed its own intuitions, its own understanding of the game that transcended human knowledge.

All those centuries of accumulated wisdom? All of Capablanca's carefully articulated principles? All the opening theory and positional understanding that had taken humans generations to develop?

Irrelevant.

This wasn't just a story about chess. It was the first clear demonstration of what AI researcher Richard Sutton would later call the Bitter Lesson—the uncomfortable truth that our human understanding of problems, built from lifetimes of experience, often matters less than raw computational power applied to the right learning algorithms. In other words brawn beats brain.

The 70-Year Pattern

This wasn't just chess. This wasn't even just games. As Sutton observed, this same pattern has been playing out for 70 years of AI research. Every time, human knowledge-based approaches initially dominate. Every time, researchers resist the brute force alternative. And every time, raw computational power eventually wins.

Speech Recognition, 1970s: In the early DARPA-sponsored competitions, teams arrived with sophisticated systems built on human understanding—knowledge of speech sounds, how vocal cords work, and linguistic rules. On the other side were newer statistical methods based on Hidden Markov Models that did much more computation and learned patterns from data. The statistical methods won decisively, launching a decades-long transformation of natural language processing from rule-based systems to data-driven ones. Today's deep learning speech recognition systems rely even less on human knowledge and use even more computation.

Computer Vision, 1980s-2000s: For decades, computer vision researchers built systems around human insights about how vision works. They programmed computers to look for edges, basic shapes, and distinctive visual features. These hand-crafted approaches seemed logical—after all, shouldn't machines see the way humans think we see? But modern deep learning neural networks discarded all of this, using only basic pattern-matching operations and massive computation to achieve dramatically better results.

Go, 2016: The ancient Chinese game fell to the same pattern, just 20 years after chess. Enormous initial efforts went into avoiding search by leveraging human knowledge of Go's special features. All those efforts proved irrelevant once search and learning were applied effectively at scale. AlphaGo's famous move 37 against Lee Sedol—a move that violated centuries of Go wisdom—became the symbol of machine intelligence transcending human understanding.

The pattern is so consistent it's almost mechanical. Researchers start with human knowledge because it's personally satisfying and works in the short term. But as computational power grows exponentially—following Moore's Law—the brute force approaches inevitably win.

As Sutton puts it: "Time spent on one is time not spent on the other." The psychological commitment to human-knowledge approaches actually inhibits progress toward the computational methods that ultimately succeed.

The Bet on Brute Force

Sutton's advice has not gone unheeded. The world's biggest tech companies have absorbed the bitter lesson and are betting big on computational brute force. Microsoft alone plans to spend $80 billion in 2025 on AI-enabled data centers. Combined with Meta, Amazon, and Google, Big Tech will invest over $320 billion this year on AI infrastructure. Forget the app layer—just go for scale.

The consequences are staggering. Data centers now consume 1.5% of global electricity—equivalent to Saudi Arabia's entire energy consumption. Elon Musk's xAI is building a data center with 1 million GPUs requiring 2 gigawatts of power—enough to power 1.9 million homes. The operation is so power-hungry that Musk is importing an entire power plant from overseas and shipping it to Memphis. By 2030, data centers may consume electricity equivalent to Japan's total consumption.

a group of people standing in front of a painting — Photo by Prashant on Unsplash

But This Isn't Actually Alien

Yet this pattern of ignoring conventional wisdom isn't as alien as it seems. Throughout history, the most revolutionary breakthroughs have come from outsiders who threw out the rulebook. Jimi Hendrix never learned to read music and held his guitar "wrong"—left-handed and upside down—yet revolutionized rock music. Van Gogh was rejected by art academies for his unconventional technique.

The pattern repeats across every domain: formal training teaches you the "right" way, but breakthrough innovation often comes from ignoring those rules entirely. Raw talent, intuition, and relentless practice can surpass decades of accumulated conventional wisdom. In this light, AlphaZero's approach—learning from scratch through pure trial and error—isn't alien at all.

The Question That Matters

But imagine that savant-like mind without human limitations. No need for sleep, no complaints, no fatigue. A system that can absorb patterns and iterate millions of times faster than any human ever could.

If it can master chess in 4 hours well enough to beat engines refined for decades, could it learn your job?

Found this thought-provoking? Share it with someone who needs to see where we're heading.

CheckMate: AI Will Take the Jobs of Those Using It

Tabrez Syed

March 7, 2024

CheckMate: AI Will Take the Jobs of Those Using It

In 2023, as debates raged over AI's impending impact, one line sliced through the noise with sobering clarity: "AI won't take your job. It's someone using AI that will take your job." Uttered by Professor Richard Baldwin at the World Economic Forum, this sentiment quickly became a rallying cry for those seeking t…

Read full story

BoxCars AI

CheckMate: AI Will Take the Jobs of Those Using It

Discussion about this post