github-starred/TinyTorch

Fork 0

mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-04-30 10:13:57 -05:00

Files

History

…

01_xor_crisis.py

…

02_xor_solved.py

…

README.md

…

README.md

Milestone 02: The XOR Crisis (1969)

Historical Context

In 1969, Marvin Minsky and Seymour Papert published "Perceptrons," a book that mathematically proved single-layer perceptrons CANNOT solve the XOR problem. This revelation killed neural network research funding for over a decade - triggering the infamous "AI Winter."

The proof was devastating: no matter how much you train, a single layer cannot learn XOR. This milestone recreates that crisis... and then shows how multi-layer networks solved it.

What You're Building

A demonstration of perceptron limitations and the multi-layer solution:

The Crisis - Watch a perceptron fail to learn XOR despite training
The Solution - See how adding a hidden layer solves the "impossible" problem

Required Modules

Run after Module 07 (Training capability)

Module	Component	What It Provides
Module 01	Tensor	YOUR data structure
Module 02	Activations	YOUR sigmoid/ReLU activations
Module 03	Layers	YOUR Linear layers
Module 04	Losses	YOUR loss functions
Module 05	Autograd	YOUR automatic differentiation
Module 06	Optimizers	YOUR SGD optimizer
Module 07	Training	YOUR end-to-end training loop

Milestone Structure

This milestone uses crisis → solution narrative with 2 scripts:

01_xor_crisis.py

Purpose: Demonstrate the fundamental limitation

Train a single-layer perceptron on XOR
Watch loss stay high (~0.69) and accuracy stuck at 50%
No matter how long you train, it CANNOT learn
Key Learning: "Minsky was right - single layers can't solve XOR"

The XOR Problem:

Inputs    Output
x1  x2    XOR
0   0  →  0   (same)
0   1  →  1   (different)
1   0  →  1   (different)
1   1  →  0   (same)

These 4 points CANNOT be separated by a single line!

02_xor_solved.py

Purpose: Show how multi-layer networks solve it

Add ONE hidden layer (2-layer network)
Same XOR problem, now solvable
Watch accuracy reach 100%
Key Learning: "Hidden layers unlock non-linear problems!"

The Solution:

Input → Hidden Layer → Output
        (learns useful features)

The hidden layer learns to transform the space so XOR becomes linearly separable!

Expected Results

Script	Layers	Loss	Accuracy	What It Shows
01 (Single Layer)	1	~0.69 (stuck!)	~50%	Cannot learn XOR (Minsky was right)
02 (Multi-Layer)	2	→ 0.0	100%	Hidden layers solve the problem!

Key Learning: The Power of Depth

This milestone teaches the fundamental reason why deep learning works:

Single layers = Only linear decision boundaries (limited expressiveness)
Multiple layers = Can learn ANY decision boundary (universal approximation)

The XOR crisis wasn't about perceptrons being broken - it was about needing depth to solve complex problems. This realization (via backpropagation in 1986) ended the AI Winter.

Running the Milestone

cd milestones/02_1969_xor

# Step 1: Experience the crisis (run after Module 07)
python 01_xor_crisis.py

# Step 2: See the solution (run after Module 07)
python 02_xor_solved.py

Achievement Unlocked

After completing this milestone, you'll understand:

Why single-layer networks have fundamental limitations
What "linear separability" means (and why it matters)
How hidden layers enable non-linear decision boundaries
The historical importance of this problem (caused AI Winter!)

You've experienced the crisis that shaped neural network history!

README.md

Milestone 02: The XOR Crisis (1969)

Historical Context

What You're Building

Required Modules

Milestone Structure

01_xor_crisis.py

02_xor_solved.py

Expected Results

Key Learning: The Power of Depth

Running the Milestone

Further Reading

Achievement Unlocked