Files
Vijay Janapa Reddi 09de699545 fix(exports): add missing #| export directives across 10 modules
Systematic audit of all 20 modules against module-developer agent rules
found 9 standalone helper functions missing #| export — these are called
by exported code at runtime but were excluded from the generated package,
causing NameError/AttributeError in CI.

Modules fixed:
- 05_dataloader: _pad_image, _random_crop_region (used by RandomCrop)
- 06_autograd: _stable_softmax, _one_hot_encode (prior session)
- 07_optimizers: 5 mixin classes + monkey-patches (prior session)
- 08_training: 7 monkey-patched Trainer methods (prior session)
- 10_tokenization: _count_byte_pairs, _merge_pair (used by BPETokenizer)
- 11_embeddings: _compute_sinusoidal_table (prior session)
- 12_attention: _compute_attention_scores, _scale_scores, _apply_mask (prior)
- 15_quantization: _collect_layer_inputs, _quantize_single_layer (used by quantize_model)
- 18_memoization: _cached_generation_step, _create_cache_storage, _cached_attention_forward (used by enable_kv_cache)
- 19_benchmarking: rename TinyMLPerf→MLPerf, fix monkey-patch naming (prior)

Also includes: vscode-ext icon refactor (ThemeIcon migration).

All 789 tests pass (unit, integration, e2e, CLI).
2026-02-15 17:38:03 -05:00
..

TinyTorch Test Suite

Comprehensive testing organized by purpose and scope.

Test Organization

📦 Module Tests (XX_modulename/)

Purpose: Test individual module functionality Scope: Single module, isolated behavior Example: 01_tensor/test_progressive_integration.py

These tests validate that each module works correctly in isolation.

🔗 Integration Tests (integration/)

Purpose: Test cross-module interactions Scope: Multiple modules working together Files:

  • test_gradient_flow.py - CRITICAL: Validates gradients flow through entire training stack
  • test_end_to_end_training.py - Full training loops (TODO)
  • test_module_compatibility.py - Module interfaces (TODO)

Why this matters:

  • Catches bugs that unit tests miss
  • Validates the "seams" between modules
  • Ensures training actually works end-to-end

🐛 Debugging Tests (debugging/)

Purpose: Catch common student pitfalls Scope: Pedagogical - teaches debugging Files:

  • test_gradient_vanishing.py - Detect/diagnose vanishing gradients (TODO)
  • test_gradient_explosion.py - Detect/diagnose exploding gradients (TODO)
  • test_common_mistakes.py - "Did you forget backward()?" style tests (TODO)

Philosophy: When these tests fail, the error message should teach the student what went wrong and how to fix it.

Autograd Edge Cases (06_autograd/)

Purpose: Stress-test autograd system Scope: Autograd internals and edge cases Files:

  • test_broadcasting.py - Broadcasting gradient bugs (TODO)
  • test_computation_graph.py - Graph construction edge cases (TODO)
  • test_backward_edge_cases.py - Numerical stability, etc. (TODO)

Running Tests

Standard Mode

pytest tests/ -v                    # All tests
pytest tests/integration/ -v        # Integration tests only
pytest tests/01_tensor/ -v          # Specific module
pytest tests/ --tinytorch           # Rich output with WHAT/WHY context
pytest tests/01_tensor/ --tinytorch # Single module with education

Educational mode shows:

  • Module groupings before running
  • What each test does (WHAT)
  • Why it matters (WHY)
  • Learning tips on failure (STUDENT LEARNING)
  • Clear pass/fail indicators with Rich formatting

Run without pytest

python tests/integration/test_gradient_flow.py

Test Philosophy

  1. Integration tests catch real bugs: The gradient flow test caught the exact bugs that prevented training
  2. Descriptive names: Test names should explain what they test
  3. Good error messages: When tests fail, students should understand why
  4. Pedagogical value: Tests teach correct usage patterns

Educational Test Docstrings

All *_core.py test files use a structured docstring format:

def test_tensor_addition(self):
    """
    WHAT: Element-wise tensor addition.

    WHY: Addition is used everywhere in neural networks:
    - Adding bias to layer output: y = Wx + b
    - Residual connections: output = layer(x) + x

    STUDENT LEARNING: Operations return new Tensors (functional style).
    """

This format enables the --tinytorch flag to show educational context when tests run.

Adding New Tests

When adding a test, ask:

  • Is it testing one module? → Put in XX_modulename/
  • Is it testing modules working together? → Put in integration/
  • Is it teaching debugging? → Put in debugging/
  • Is it an autograd edge case? → Put in 06_autograd/

Most Important Tests

🔥 Must pass before merging:

  • integration/test_gradient_flow.py - If this fails, training is broken

📚 Module validation:

  • Each module's inline tests (in modules/)
  • Module-specific tests in tests/XX_modulename/

Test Coverage Goals

  • All tensor operations have gradient tests
  • All layers compute gradients correctly
  • All activations integrate with autograd
  • All loss functions compute gradients
  • All optimizers update parameters
  • End-to-end training converges (TODO)
  • Common pitfalls are detected (TODO)