mirror of https://github.com/harvard-edge/cs249r_book.git synced 2026-05-07 10:08:50 -05:00

Files

Rocky c19109f9a2 fix(tinytorch): LayerNorm gamma/beta missing requires_grad=True

gamma and beta were created as Tensor(np.ones/zeros(n)) with no
requires_grad flag, defaulting to False after enable_autograd() patches
Tensor.__init__. The _LayerNormBackward.apply() guards on
beta.requires_grad and gamma.requires_grad, so gradients were silently
never computed for either parameter -- LayerNorm could not learn its
scale and shift during training.

Fix: pass requires_grad=True at construction so the backward pass
computes grad_gamma and grad_beta correctly.

Also remove the manual param.requires_grad = True workaround from
test_layernorm_gradient_flow() which was masking the bug.

2026-04-23 23:19:22 +05:30

01_tensor

test(tinytorch): add PyTorch-compat coverage for ndim, numel, view, contiguous, masked_fill

2026-04-18 04:48:22 +05:30

02_activations

refactor(tinytorch): migrate from legacy np.random to default_rng(7)

2026-04-03 17:57:51 -04:00

03_layers

refactor(tinytorch): bump Python minimum to 3.10 and update Milestone 05 docs

2026-04-05 12:51:48 -04:00

04_losses

refactor(tinytorch): migrate from legacy np.random to default_rng(7)

2026-04-03 17:57:51 -04:00

05_dataloader

refactor(tinytorch): migrate from legacy np.random to default_rng(7)

2026-04-03 17:57:51 -04:00

06_autograd

fix(tinytorch): fix GELU gradient mismatch and float32 test precision

2026-04-18 15:26:44 -04:00

07_optimizers

refactor(tinytorch): migrate from legacy np.random to default_rng(7)

2026-04-03 17:57:51 -04:00

08_training

fix(tests/08_training): correct scheduler lr assertion to use epoch 0 not epoch 1

2026-04-16 18:27:15 +05:30

09_convolutions

refactor(tinytorch): migrate from legacy np.random to default_rng(7)

2026-04-03 17:57:51 -04:00

10_tokenization

fix(tests/10_tokenization): replace raw numpy array params with Tensor in DummyModel

2026-04-16 18:19:20 +05:30

11_embeddings

refactor(tinytorch): migrate from legacy np.random to default_rng(7)

2026-04-03 17:57:51 -04:00

12_attention

refactor(tinytorch): migrate from legacy np.random to default_rng(7)

2026-04-03 17:57:51 -04:00

13_transformers

fix(tinytorch): LayerNorm gamma/beta missing requires_grad=True

2026-04-23 23:19:22 +05:30

14_profiling

refactor(tinytorch): migrate from legacy np.random to default_rng(7)

2026-04-03 17:57:51 -04:00

15_quantization

refactor(tinytorch): migrate from legacy np.random to default_rng(7)

2026-04-03 17:57:51 -04:00

16_compression

refactor(tests): clean up test folder and fix gradient flow issues

2026-01-24 12:22:37 -05:00

17_acceleration

refactor(tinytorch): migrate from legacy np.random to default_rng(7)

2026-04-03 17:57:51 -04:00

18_memoization

refactor(tinytorch): migrate from legacy np.random to default_rng(7)

2026-04-03 17:57:51 -04:00

19_benchmarking

refactor(tinytorch): migrate from legacy np.random to default_rng(7)

2026-04-03 17:57:51 -04:00

20_capstone

docs(readmes): stretch HTML tables to full width

2026-04-22 16:01:54 -04:00

cli

fix(tests): update CLI tests for current command structure

2026-01-22 15:40:15 -05:00

e2e

fix(test): update assertion to match actual error message

2026-01-28 17:36:20 -05:00

environment

refactor(tinytorch): bump Python minimum to 3.10 and update Milestone 05 docs

2026-04-05 12:51:48 -04:00

integration

docs(readmes): stretch HTML tables to full width

2026-04-22 16:01:54 -04:00

milestones

docs(readmes): stretch HTML tables to full width

2026-04-22 16:01:54 -04:00

regression

refactor(tinytorch): migrate from legacy np.random to default_rng(7)

2026-04-03 17:57:51 -04:00

.gitkeep

style: apply consistent whitespace and formatting across codebase

2025-12-13 14:05:34 -05:00

conftest.py

fix(tito): register --tinytorch pytest flag in conftest

2026-03-24 08:49:45 -04:00

README.md

fix: complete module renumbering across entire codebase

2025-12-19 17:43:41 -05:00

test_utils.py

refactor(tinytorch): migrate from legacy np.random to default_rng(7)

2026-04-03 17:57:51 -04:00

validate_nbgrader_config.py

…

README.md

TinyTorch Test Suite

Comprehensive testing organized by purpose and scope.

Test Organization

📦 Module Tests (`XX_modulename/`)

Purpose: Test individual module functionality Scope: Single module, isolated behavior Example: 01_tensor/test_progressive_integration.py

These tests validate that each module works correctly in isolation.

🔗 Integration Tests (`integration/`)

Purpose: Test cross-module interactions Scope: Multiple modules working together Files:

test_gradient_flow.py - CRITICAL: Validates gradients flow through entire training stack
test_end_to_end_training.py - Full training loops (TODO)
test_module_compatibility.py - Module interfaces (TODO)

Why this matters:

Catches bugs that unit tests miss
Validates the "seams" between modules
Ensures training actually works end-to-end

🐛 Debugging Tests (`debugging/`)

Purpose: Catch common student pitfalls Scope: Pedagogical - teaches debugging Files:

test_gradient_vanishing.py - Detect/diagnose vanishing gradients (TODO)
test_gradient_explosion.py - Detect/diagnose exploding gradients (TODO)
test_common_mistakes.py - "Did you forget backward()?" style tests (TODO)

Philosophy: When these tests fail, the error message should teach the student what went wrong and how to fix it.

⚡ Autograd Edge Cases (`06_autograd/`)

Purpose: Stress-test autograd system Scope: Autograd internals and edge cases Files:

test_broadcasting.py - Broadcasting gradient bugs (TODO)
test_computation_graph.py - Graph construction edge cases (TODO)
test_backward_edge_cases.py - Numerical stability, etc. (TODO)

Running Tests

Standard Mode

pytest tests/ -v                    # All tests
pytest tests/integration/ -v        # Integration tests only
pytest tests/01_tensor/ -v          # Specific module

🎓 Educational Mode (Recommended for Students)

pytest tests/ --tinytorch           # Rich output with WHAT/WHY context
pytest tests/01_tensor/ --tinytorch # Single module with education

Educational mode shows:

Module groupings before running
What each test does (WHAT)
Why it matters (WHY)
Learning tips on failure (STUDENT LEARNING)
Clear pass/fail indicators with Rich formatting

Run without pytest

python tests/integration/test_gradient_flow.py

Test Philosophy

Integration tests catch real bugs: The gradient flow test caught the exact bugs that prevented training
Descriptive names: Test names should explain what they test
Good error messages: When tests fail, students should understand why
Pedagogical value: Tests teach correct usage patterns

Educational Test Docstrings

All *_core.py test files use a structured docstring format:

def test_tensor_addition(self):
    """
    WHAT: Element-wise tensor addition.

    WHY: Addition is used everywhere in neural networks:
    - Adding bias to layer output: y = Wx + b
    - Residual connections: output = layer(x) + x

    STUDENT LEARNING: Operations return new Tensors (functional style).
    """

This format enables the --tinytorch flag to show educational context when tests run.

Adding New Tests

When adding a test, ask:

Is it testing one module? → Put in XX_modulename/
Is it testing modules working together? → Put in integration/
Is it teaching debugging? → Put in debugging/
Is it an autograd edge case? → Put in 06_autograd/

Most Important Tests

🔥 Must pass before merging:

integration/test_gradient_flow.py - If this fails, training is broken

📚 Module validation:

Each module's inline tests (in modules/)
Module-specific tests in tests/XX_modulename/

Test Coverage Goals

✅ All tensor operations have gradient tests
✅ All layers compute gradients correctly
✅ All activations integrate with autograd
✅ All loss functions compute gradients
✅ All optimizers update parameters
⏳ End-to-end training converges (TODO)
⏳ Common pitfalls are detected (TODO)

README.md

TinyTorch Test Suite

Test Organization

📦 Module Tests (XX_modulename/)

🔗 Integration Tests (integration/)

🐛 Debugging Tests (debugging/)

⚡ Autograd Edge Cases (06_autograd/)

Running Tests

Standard Mode

🎓 Educational Mode (Recommended for Students)

Run without pytest

Test Philosophy

Educational Test Docstrings

Adding New Tests

Most Important Tests

Test Coverage Goals

📦 Module Tests (`XX_modulename/`)

🔗 Integration Tests (`integration/`)

🐛 Debugging Tests (`debugging/`)

⚡ Autograd Edge Cases (`06_autograd/`)