mirror of https://github.com/harvard-edge/cs249r_book.git synced 2026-04-28 16:48:30 -05:00

Files

Vijay Janapa Reddi c0af0eb36b refactor(tinytorch): cleanup modules 02, 09, 12, 17, 18 following module 03 principles

Applied API simplification and consistency improvements across multiple modules:

Module 02 (Activations):
- Added __all__ export list to control public API
- Removed redundant import statement
- Prevents internal constants from polluting namespace

Module 09 (Spatial):
- Fixed test naming to use PyTorch conventions (Conv2d not Conv2D)
- Fixed AvgPool2d gradient tracking (added requires_grad parameter)
- Updated all test imports to use lowercase 'd' naming

Module 12 (Attention):
- Fixed progressive integration tests to use correct Trainer API
- Added missing loss_fn parameter to Trainer calls

Module 17 (Memoization):
- Removed redundant create_kv_cache() function (use KVCache() directly)
- Made internal constants private (_BYTES_PER_FLOAT32, _MB_TO_BYTES)
- Simplified API from 6 exports to 3 core components
- 50% reduction in public API surface

Module 18 (Acceleration):
- Fixed test suite to match function-based API
- Added tests for vectorized_matmul, fused_gelu, tiled_matmul
- All 6 tests now passing

Rationale:
- API simplicity: one clear way to do things
- Progressive disclosure: hide implementation details
- Consistent naming: follow established conventions
- Test coverage: validate all exported functionality

All module tests passing after changes

2025-12-07 06:05:15 -08:00

01_tensor

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

02_activations

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

03_layers

refactor(tinytorch): simplify module 03 API and remove confusing aliases

2025-12-07 05:31:05 -08:00

04_losses

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

05_autograd

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

06_optimizers

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

07_training

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

08_dataloader

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

09_spatial

refactor(tinytorch): cleanup modules 02, 09, 12, 17, 18 following module 03 principles

2025-12-07 06:05:15 -08:00

10_tokenization

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

11_embeddings

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

12_attention

refactor(tinytorch): cleanup modules 02, 09, 12, 17, 18 following module 03 principles

2025-12-07 06:05:15 -08:00

13_transformers

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

14_profiling

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

15_memoization

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

16_quantization

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

17_compression

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

18_acceleration

refactor(tinytorch): cleanup modules 02, 09, 12, 17, 18 following module 03 principles

2025-12-07 06:05:15 -08:00

19_benchmarking

fix(tinytorch): add missing exports and fix benchmark tests

2025-12-06 21:42:14 -08:00

20_capstone

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

checkpoints

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

cli

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

debugging

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

diagnostic

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

e2e

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

environment

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

integration

fix(tinytorch): fix test flakiness and coverage requirements

2025-12-07 04:32:14 -08:00

milestones

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

module_template

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

performance

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

progressive

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

regression

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

system

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

.gitkeep

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

conftest.py

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

integration_cnn_test.py

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

integration_mnist_test.py

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

integration_simple_test.py

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

integration_tests.py

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

integration_tinygpt_test.py

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

integration_xor_test.py

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

minimal_training_example.py

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

module_status_report.py

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

pytest_tinytorch.py

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

README.md

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

run_all_modules.py

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

run_training_milestone_tests.py

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

test_gradient_flow.py

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

test_utils.py

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

validate_nbgrader_config.py

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

working_training.py

feat: integrate TinyTorch into MLSysBook repository

2025-12-05 19:23:18 -08:00

README.md

TinyTorch Test Suite

Comprehensive testing organized by purpose and scope.

Test Organization

📦 Module Tests (`XX_modulename/`)

Purpose: Test individual module functionality
Scope: Single module, isolated behavior
Example: 01_tensor/test_progressive_integration.py

These tests validate that each module works correctly in isolation.

🔗 Integration Tests (`integration/`)

Purpose: Test cross-module interactions
Scope: Multiple modules working together
Files:

test_gradient_flow.py - CRITICAL: Validates gradients flow through entire training stack
test_end_to_end_training.py - Full training loops (TODO)
test_module_compatibility.py - Module interfaces (TODO)

Why this matters:

Catches bugs that unit tests miss
Validates the "seams" between modules
Ensures training actually works end-to-end

🐛 Debugging Tests (`debugging/`)

Purpose: Catch common student pitfalls
Scope: Pedagogical - teaches debugging
Files:

test_gradient_vanishing.py - Detect/diagnose vanishing gradients (TODO)
test_gradient_explosion.py - Detect/diagnose exploding gradients (TODO)
test_common_mistakes.py - "Did you forget backward()?" style tests (TODO)

Philosophy: When these tests fail, the error message should teach the student what went wrong and how to fix it.

⚡ Autograd Edge Cases (`05_autograd/`)

Purpose: Stress-test autograd system
Scope: Autograd internals and edge cases
Files:

test_broadcasting.py - Broadcasting gradient bugs (TODO)
test_computation_graph.py - Graph construction edge cases (TODO)
test_backward_edge_cases.py - Numerical stability, etc. (TODO)

Running Tests

Standard Mode

pytest tests/ -v                    # All tests
pytest tests/integration/ -v        # Integration tests only
pytest tests/01_tensor/ -v          # Specific module

🎓 Educational Mode (Recommended for Students)

pytest tests/ --tinytorch           # Rich output with WHAT/WHY context
pytest tests/01_tensor/ --tinytorch # Single module with education

Educational mode shows:

Module groupings before running
What each test does (WHAT)
Why it matters (WHY)
Learning tips on failure (STUDENT LEARNING)
Clear pass/fail indicators with Rich formatting

Run without pytest

python tests/integration/test_gradient_flow.py

Test Philosophy

Integration tests catch real bugs: The gradient flow test caught the exact bugs that prevented training
Descriptive names: Test names should explain what they test
Good error messages: When tests fail, students should understand why
Pedagogical value: Tests teach correct usage patterns

Educational Test Docstrings

All *_core.py test files use a structured docstring format:

def test_tensor_addition(self):
    """
    WHAT: Element-wise tensor addition.
    
    WHY: Addition is used everywhere in neural networks:
    - Adding bias to layer output: y = Wx + b
    - Residual connections: output = layer(x) + x
    
    STUDENT LEARNING: Operations return new Tensors (functional style).
    """

This format enables the --tinytorch flag to show educational context when tests run.

Adding New Tests

When adding a test, ask:

Is it testing one module? → Put in XX_modulename/
Is it testing modules working together? → Put in integration/
Is it teaching debugging? → Put in debugging/
Is it an autograd edge case? → Put in 05_autograd/

Most Important Tests

🔥 Must pass before merging:

integration/test_gradient_flow.py - If this fails, training is broken

📚 Module validation:

Each module's inline tests (in modules/)
Module-specific tests in tests/XX_modulename/

Test Coverage Goals

✅ All tensor operations have gradient tests
✅ All layers compute gradients correctly
✅ All activations integrate with autograd
✅ All loss functions compute gradients
✅ All optimizers update parameters
⏳ End-to-end training converges (TODO)
⏳ Common pitfalls are detected (TODO)

README.md

TinyTorch Test Suite

Test Organization

📦 Module Tests (XX_modulename/)

🔗 Integration Tests (integration/)

🐛 Debugging Tests (debugging/)

⚡ Autograd Edge Cases (05_autograd/)

Running Tests

Standard Mode

🎓 Educational Mode (Recommended for Students)

Run without pytest

Test Philosophy

Educational Test Docstrings

Adding New Tests

Most Important Tests

Test Coverage Goals

📦 Module Tests (`XX_modulename/`)

🔗 Integration Tests (`integration/`)

🐛 Debugging Tests (`debugging/`)

⚡ Autograd Edge Cases (`05_autograd/`)