mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-07-16 06:27:32 -05:00

Files

Vijay Janapa Reddi 7f45d93613 Fix all test bugs and add notebook execution support to tito

Test Fixes (External pytest tests - all passing):
- Module 03: Reverted .weights for test helper classes
- Module 08: Fixed DataLoader data format (tuple → list(zip()))
- Module 10: Use CharTokenizer instead of abstract Tokenizer
- Module 15: Fixed KVCache constructor args and seq_len
- Module 19: Fixed Benchmark constructor args

Tito CLI Improvements:
- Added module name resolver: "15" → "15_quantization"
- Added .ipynb file support in _get_dev_file_path()
- Added notebook-to-Python conversion using jupytext
- Inline tests now execute notebooks correctly

Results:
- External tests: 36/36 passing (100%)
- Tito inline tests: 15/20 passing (75%)
- Remaining failures are module code bugs, not test framework issues

2025-12-05 18:29:12 -08:00

__init__.py

…

integration_cnn_test.py

Update test files with progressive integration and checkpoint improvements

2025-12-04 11:08:17 -08:00

integration_mnist_test.py

Update test files with progressive integration and checkpoint improvements

2025-12-04 11:08:17 -08:00

integration_tests.py

Fix integration tests: update API usage to match current implementation

2025-12-03 09:04:14 -08:00

integration_tinygpt_test.py

Update test files with progressive integration and checkpoint improvements

2025-12-04 11:08:17 -08:00

minimal_training_example.py

Fix all test bugs and add notebook execution support to tito

2025-12-05 18:29:12 -08:00

module_complete_orchestrator.py

…

package_manager_integration.py

…

README.md

…

run_integration_tests.py

…

run_module_tests.py

…

test_api_simplification_integration.py

Update test files with progressive integration and checkpoint improvements

2025-12-04 11:08:17 -08:00

test_basic_integration.py

…

test_cnn_integration.py

…

test_dataloader_integration.py

…

test_gradient_flow.py

…

test_integration_01_setup.py

…

test_integration_02_tensor.py

…

test_integration_03_activations.py

…

test_integration_04_layers.py

…

test_integration_09_autograd.py

…

test_layers_integration.py

…

test_loss_gradients.py

…

test_module_04_layers.py

Fix all test bugs and add notebook execution support to tito

2025-12-05 18:29:12 -08:00

test_module_05_dense.py

Update test files with progressive integration and checkpoint improvements

2025-12-04 11:08:17 -08:00

test_module_06_autograd.py

Fix all test bugs and add notebook execution support to tito

2025-12-05 18:29:12 -08:00

test_module_dependencies.py

Add consistent Aha Moment demos to all 20 modules

2025-12-04 06:33:31 -08:00

test_module_integration.py

…

test_nlp_pipeline_flow.py

Fix integration tests: update API usage to match current implementation

2025-12-03 09:04:14 -08:00

test_optimizers_integration.py

Add consistent Aha Moment demos to all 20 modules

2025-12-04 06:33:31 -08:00

test_runner.py

…

test_training_flow.py

Fix integration tests: update API usage to match current implementation

2025-12-03 09:04:14 -08:00

test_xor_original_1986.py

…

test_xor_simple.py

…

test_xor_thorough.py

…

working_training.py

Fix all test bugs and add notebook execution support to tito

2025-12-05 18:29:12 -08:00

README.md

Integration Tests

Philosophy

Integration tests catch bugs that unit tests miss - specifically bugs at module boundaries where one module's output becomes another module's input.

The Gradient Flow Pattern

The gold standard is test_gradient_flow.py. It verifies:

Gradients exist (not None)
Gradients are non-zero (actually computed)
Gradients flow through each layer (chain not broken)
Training actually works (loss decreases)

This pattern catches the most common and frustrating bugs students encounter.

Test Categories

🔥 Critical (Must Pass)

Test File	What It Catches	Modules
`test_gradient_flow.py`	Broken backpropagation	01-07
`test_training_flow.py`	Training loop failures	05-07
`test_nlp_pipeline_flow.py`	NLP stack issues	10-13
`test_cnn_integration.py`	CNN gradient issues	09

📋 Standard (Should Pass)

Test File	What It Catches	Modules
`test_dataloader_integration.py`	Data pipeline issues	08
`test_api_simplification_integration.py`	API compatibility	All

🔬 Scenario Tests

These test complete use cases:

integration_xor_test.py - XOR learning (classic test)
integration_mnist_test.py - MNIST classification
integration_cnn_test.py - CNN on images
integration_tinygpt_test.py - Language model training

What Makes a Good Integration Test

✅ Good Integration Test

def test_gradients_flow_through_mlp():
    """Gradients must reach all layers"""
    layers = [Linear(4, 4) for _ in range(5)]
    
    x = Tensor(np.random.randn(1, 4), requires_grad=True)
    h = x
    for layer in layers:
        h = relu(layer(h))
    loss = mse_loss(h, target)
    loss.backward()
    
    # ALL layers must have gradients
    for i, layer in enumerate(layers):
        assert layer.weight.grad is not None, f"Layer {i} has no gradient!"

Why it's good:

Tests the boundary between layers
Catches gradient chain breaks
Clear error message tells you WHERE it broke

❌ Bad Integration Test

def test_linear_layer():
    """Test linear layer works"""
    layer = Linear(2, 3)
    x = Tensor([[1, 2]])
    y = layer(x)
    assert y.shape == (1, 3)

Why it's bad:

This is a unit test, not integration
Doesn't test interaction with other modules
Belongs in tests/03_layers/

Running Tests

# Run all integration tests
pytest tests/integration/ -v

# Run only gradient flow tests
pytest tests/integration/test_gradient_flow.py -v

# Run only training flow tests  
pytest tests/integration/test_training_flow.py -v

# Run quick smoke tests (for CI)
pytest tests/integration/ -v -k quick

# Run with detailed output on failure
pytest tests/integration/ -v --tb=long

Adding New Integration Tests

When adding a new module (e.g., Module 14: Profiling), ask:

What other modules does it interact with?
- Profiling interacts with training loops (07) and models (03)
What could break at the boundary?
- Profiling hooks might interfere with autograd
- Timing might change tensor operations
Write a test that exercises the boundary:

def test_profiling_does_not_break_training():
    """Profiling should not interfere with gradient flow"""
    with profiler.profile():
        loss = model(x)
        loss.backward()  # Should still work!
    
    assert model.weight.grad is not None

Coverage Gaps

Currently Missing

Module	Integration Test Needed
14 Profiling	Profiler + training loop
15 Quantization	Quantized model accuracy
16 Compression	Compressed model still trains
17 Memoization	Cached ops maintain correctness
18 Acceleration	Accelerated ops match baseline

How to Fill Gaps

For each gap, create a test that:

Uses the module in a realistic scenario
Verifies correctness (not just "doesn't crash")
Checks boundaries with connected modules