TinyTorch

mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-07-21 00:35:39 -05:00

Author	SHA1	Message	Date
Vijay Janapa Reddi	0378da462c	Add consistent Aha Moment demos to all 20 modules Each module now includes a self-contained demo function that: - Uses the 🎯 emoji for consistency with MODULE SUMMARY - Explains what was built and why it matters - Provides a quick, visual demonstration - Runs automatically after test_module() in __main__ Format: demo_[module_name]() with markdown explanation before it. All demos are self-contained with no cross-module imports.	2025-12-04 06:33:31 -08:00
Vijay Janapa Reddi	9aaa159fb6	Fix integration tests: update API usage to match current implementation - Replace Dense with Linear (API name change) - Fix PositionalEncoding parameter order (max_seq_len, embed_dim) - Replace Variable with Tensor (API consolidation) - Replace learning_rate with lr for optimizers - Remove Sequential (not in current API) - Replace BCELoss with BinaryCrossEntropyLoss - Remove LeakyReLU (not in current API) - Fix dropout eval test - Skip advanced NLP gradient tests (requires autograd integration) - Reduce loss improvement threshold for test stability - Fix tensor reshape error message to match tests	2025-12-03 09:04:14 -08:00
Vijay Janapa Reddi	ac7d6a9721	Fix integration tests: Dense -> Linear alias	2025-12-03 08:37:32 -08:00
Vijay Janapa Reddi	4aeb3c9c69	Merge main into dev, resolving conflicts with dev's version	2025-12-03 07:26:43 -08:00
Vijay Janapa Reddi	dde470a4e5	Fix all stale imports from models.transformer to core.transformer	2025-12-03 00:28:37 -08:00
Vijay Janapa Reddi	7f6dd19c10	Improve milestone 05 (Transformer) with letters for better visualization - Enhanced attention proof to use A-Z letters instead of numbers - Shows MCYWUH → HUWYCM instead of [1,2,3] → [3,2,1] - More intuitive and fun for students - Removed quickdemo, generation, dialogue scripts (too slow/gibberish)	2025-12-02 23:33:58 -08:00
Vijay Janapa Reddi	e11195c377	Fix test issues: remove misplaced file and fix learning rate - Removed tests/08_dataloader/test_autograd_core.py (duplicate of 05_autograd) - Fixed learning rate in training test to prevent gradient explosion	2025-12-02 23:08:23 -08:00
Vijay Janapa Reddi	47635d1550	Add three-phase testing to tito module test - Phase 1: Inline unit tests (quick sanity checks) - Phase 2: Module pytest with --tinytorch educational output - Phase 3: Integration tests for modules 01-N Added --unit-only and --no-integration flags for flexibility. Students can now run comprehensive tests with clear feedback about what each phase is checking and why it matters.	2025-12-02 23:06:17 -08:00
Vijay Janapa Reddi	e103f0dff7	Document educational test mode in tests/README.md - Add --tinytorch flag documentation for Rich educational output - Document WHAT/WHY/STUDENT LEARNING docstring format - Show example of the docstring structure	2025-12-02 22:53:30 -08:00
Vijay Janapa Reddi	8d77ea3cd1	Add educational WHAT/WHY/STUDENT LEARNING docstrings to all module tests All 20 modules now have *_core.py test files with: - Module-level context explaining WHY the component matters - WHAT each test does - WHY that behavior is important - STUDENT LEARNING tips for understanding Works with --tinytorch pytest flag for Rich CLI output.	2025-12-02 22:47:25 -08:00
Vijay Janapa Reddi	36dd05ef62	Add educational test output with Rich CLI - Create pytest_tinytorch.py plugin for educational test output - Update test_tensor_core.py with WHAT/WHY/STUDENT LEARNING docstrings - Show test purpose on pass, detailed context on failure - Use --tinytorch flag to enable educational mode Students can now understand what each test checks and why it matters.	2025-12-02 22:37:25 -08:00
Vijay Janapa Reddi	a622e2c200	Fix regression tests for current API - Update TransformerBlock to use mlp_ratio instead of hidden_dim - Update PositionalEncoding argument order - Fix MultiHeadAttention to use self-attention API - Add missing MultiHeadAttention import	2025-12-02 22:30:42 -08:00
Vijay Janapa Reddi	1e155fb4da	Remove legacy broken tests with outdated API imports - tests/performance/: Referenced non-existent modules/ directory - tests/system/: Required tinytorch.nn.functional which does not exist - tests/regression/test_conv_linear_dimensions.py: Same issue - These tests predated the API consolidation	2025-12-02 22:30:37 -08:00
Vijay Janapa Reddi	df6247d0eb	Add core tests for modules 06, 12, and 14-20 - Module 06: 7 tests for SGD/Adam optimizer weight updates - Module 12: 9 tests for attention computation and gradient flow - Modules 14-20: Educational tests with skip for unexported modules - All tests include docstrings explaining WHAT, WHY, and HOW	2025-12-02 22:30:29 -08:00
Vijay Janapa Reddi	7d41bb125e	Clean up naming conventions - Remove top-level SimpleModel from modules 15 & 16 (keep in test functions) - Rename QuantizationComplete → Quantizer (cleaner, matches Profiler pattern) - Rename CompressionComplete → Compressor (same pattern) - Rename benchmarking.benchmark → bench (shorter)	2025-12-02 22:05:50 -08:00
Vijay Janapa Reddi	1c1921bfba	Fix CLI help test for custom help format	2025-12-02 22:26:37 -05:00
Vijay Janapa Reddi	ea0dee1cc9	Update CLI registry test for current command structure - Added login.py as expected subcommand file - Removed references to deleted archived commands (leaderboard, olympics, etc.)	2025-12-02 22:25:18 -05:00
Vijay Janapa Reddi	c4f6454ee5	Remove legacy integration tests with outdated APIs Removed tests that used old APIs (Variable, learning_rate, etc.): - test_optimizers_integration.py - test_module_dependencies.py - integration_simple_test.py - integration_xor_test.py These are replaced by the new comprehensive tests: - test_training_flow.py (8 tests) - test_gradient_flow.py (6 tests) - test_nlp_pipeline_flow.py (8 tests) - test_cnn_integration.py (10 tests) All 67 integration tests now pass.	2025-12-02 22:23:54 -05:00
Vijay Janapa Reddi	6546b56c23	Fix gradient flow tests - use tensor operations for loss The tests were creating losses incorrectly by breaking the computation graph: WRONG: loss = Tensor(np.sum(output.data)) # Breaks graph! RIGHT: loss = output.sum() # Maintains graph Fixed: - test_cnn_integration.py: Conv2d and CNN gradient tests - test_nlp_pipeline_flow.py: Attention gradient tests - Removed xfail marker from attention test (now passing) The underlying Conv2d and Attention implementations were correct all along.	2025-12-02 22:22:51 -05:00
Vijay Janapa Reddi	d9633041e1	Fix test imports and nn module exports - Fix nn/__init__.py: alias Layer as Module for PyTorch compatibility - Fix test_layers_integration.py: use package imports instead of sys.exit - Fix test_optimizers_integration.py: use package imports instead of exec() - Add ReLU, Sigmoid, Softmax, Tanh exports to nn module	2025-12-02 22:18:42 -05:00
Vijay Janapa Reddi	fb20e255c9	Add integration tests for training flow and NLP pipeline New tests that catch module boundary bugs: test_training_flow.py: - Optimizer actually updates weights (SGD, Adam) - Training reduces loss over iterations - Gradient chain not broken through 5 layers - Input receives gradients - zero_grad works correctly - Batch gradients are averaged test_nlp_pipeline_flow.py: - Embedding receives gradients - Repeated tokens accumulate gradients - Attention projections receive gradients - Attention input receives gradients (xfail - known issue) - Transformer block gradient flow - Complete NLP pipeline end-to-end README.md: - Integration test philosophy - Good vs bad integration test examples - Coverage gaps and how to fill them	2025-12-02 22:14:08 -05:00
Vijay Janapa Reddi	3a885601f9	Clean up repository - Remove stale feature branches (kept debugging branch with unmerged work) - Move test_spatial_core.py to correct directory (tests/09_spatial) - Remove .tito user state from tracking (config.json, progress.json) - Delete archived CLI commands (tito/commands/_archived/) - Move standalone integration tests to tests/integration/ - Remove outdated audit/report markdown files - Remove old template and deprecated test files - Simplify .gitignore for .tito/ directory	2025-12-02 22:03:16 -05:00
Vijay Janapa Reddi	c23cc5d41b	Add CI and docs workflow status badges to README	2025-12-02 21:33:12 -05:00
Vijay Janapa Reddi	af0beb9408	Simplify top bar to match MLSysBook style - Clean navigation bar: brand left, links right - Icon+text buttons: MLSysBook, Subscribe, Star, Community - Responsive: icons only on mobile - Removed old WIP banner complexity	2025-12-02 21:24:22 -05:00
Vijay Janapa Reddi	2c5ef2e3cf	Fix tagline consistency: Don't import it. Build it. - Remove 'just' from all instances of the tagline - Update banner to lead with tagline - Consistent branding across docs, CLI, and demos	2025-12-02 21:17:41 -05:00
Vijay Janapa Reddi	660ba60d0f	Transform WIP banner into TinyTorch announcement bar - Change from yellow 'Under Construction' to blue professional banner - Add MLSysBook link, Community join CTA, GitHub link - Remove duplicate header from intro.md (banner is on all pages now) - Consistent branding across every page	2025-12-02 21:01:05 -05:00
Vijay Janapa Reddi	2b0a1db71e	Fix all tests to pass: clean up stale and misaligned test files Test cleanup: - Remove misaligned progressive integration tests (wrong module assignments) - Remove tests referencing non-existent classes (SimpleDataset, etc.) - Remove tests with wrong API signatures (learning_rate vs lr, etc.) - Fix parameter names: use_bias -> bias, weights -> weight - Fix dtype tests: expect float32 (TinyTorch standard) - Add MultiplyLayer definition where missing Results: - tito module test --all: 20/20 modules pass - pytest tests/modules: 223 tests pass Both inline tests and external tests now pass completely.	2025-12-02 14:30:31 -05:00
Vijay Janapa Reddi	bd7fcb2177	Release preparation: fix package exports, tests, and documentation Package exports: - Fix tinytorch/__init__.py to export all required components for milestones - Add Dense as alias for Linear for compatibility - Add loss functions (MSELoss, CrossEntropyLoss, BinaryCrossEntropyLoss) - Export spatial operations, data loaders, and transformer components Test infrastructure: - Create tests/conftest.py to handle path setup - Create tests/test_utils.py with shared test utilities - Rename test_progressive_integration.py files to include module number - Fix syntax errors in test files (spaces in class names) - Remove stale test file referencing non-existent modules Documentation: - Update README.md with correct milestone file names - Fix milestone requirements to match actual module dependencies Export system: - Run tito export --all to regenerate package from source modules - Ensure all 20 modules are properly exported	2025-12-02 14:19:56 -05:00
Vijay Janapa Reddi	7b93994252	Update tensor integration tests with progressive validation	2025-11-29 19:16:51 -05:00
Vijay Janapa Reddi	6fc474d61a	Remove __pycache__ files from tests/cli	2025-11-29 19:16:35 -05:00
Vijay Janapa ReddiandClaude	58fe9363f0	Simplify CLI welcome screen and remove redundant community commands Dramatically simplified the welcome screen to show only essential info: - Quick Start (3 commands) - Track Progress (2 commands) - Community (1 command) Removed redundant commands: - leaderboard -> merged into community - olympics -> merged into community These backend-dependent features are consolidated into a single community command that will handle all social features when the backend is ready. Changes: - Simplified welcome screen (10 lines vs 40+ lines) - Moved leaderboard.py and olympics.py to _archived/ - Updated all tests (45 passing) - Cleaner --help output - Updated archived README 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-29 11:41:51 -05:00
Vijay Janapa ReddiandClaude	63e6b282be	Remove demo and book commands from CLI Students can run demos directly with Python, and developers can run jupyter-book directly. The CLI wrappers don't add value. Changes: - Move demo.py and book.py to _archived/ - Remove from main.py command registry - Remove from __init__.py imports - Update test expectations (47 tests passing) - Update archived README with removal rationale 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-29 11:19:30 -05:00
Vijay Janapa ReddiandClaude	bb5e631214	Remove checkpoint command (superseded by milestones) The checkpoint command tracked 21 technical capability checkpoints, but this overlapped significantly with the milestones system which provides a more engaging, narrative-driven progress tracking experience. Changes: - Removed checkpoint command and test files - Updated milestone.py to remove checkpoint dependencies - Removed checkpoint integration from export.py, src.py, leaderboard.py - Updated CLI help text to reference milestones instead - Updated test suite (49/49 tests passing) - Archived checkpoint.py for reference Rationale: - Milestones is more engaging (historical ML achievements) - Module status already shows granular progress - Reduces duplication and confusion - Single clear progress tracking system 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-29 01:24:25 +01:00
Vijay Janapa ReddiandClaude	8d3025afc5	Refactor CLI commands into hierarchical folder structure Reorganize TinyTorch CLI from flat structure to hierarchical organization with subfolders for complex commands with subcommands. Changes: - Create subfolders: module/, system/, package/ - Move module commands: module_workflow.py → module/workflow.py - Move module_reset.py → module/reset.py - Move system commands: system.py → system/system.py - Move system subcommands: info.py, health.py, jupyter.py → system/ - Move package commands: package.py → package/package.py - Move package helpers: reset.py, nbdev.py → package/ - Archive deprecated files: clean.py, help.py, notebooks.py, status.py - Update all imports in moved files and main.py - Add __init__.py exports for each subfolder - Create comprehensive CLI test suite (52 tests) - test_cli_registry.py: Validate command registration - test_cli_execution.py: Smoke tests for all commands - test_cli_help_consistency.py: Help text validation - Update tests to match new structure Benefits: - Clear ownership: Easy to see which helpers belong to which commands - Better organization: Related files grouped together - Scales cleanly: Adding subcommands is straightforward - Zero user impact: All commands work exactly the same All 52 tests passing ✅ 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-28 23:42:03 +01:00
Vijay Janapa Reddi	403d4c2f4c	Add .tito/backups and docs/_build to gitignore	2025-11-28 14:59:51 +01:00
Vijay Janapa Reddi	ba482bab71	Clean up repository by removing planning and status documents Removed 42 planning, brainstorming, and status tracking documents that served their purpose during development but are no longer needed for release. Changes: - Root: Removed 4 temporary/status files - binder/: Removed 20 planning documents (kept essential setup files) - docs/: Removed 16 planning/status documents (preserved all user-facing docs and website dependencies) - tests/: Removed 2 status documents (preserved all test docs and milestone system) Preserved files: - All user-facing documentation (README, guides, quickstarts) - All website dependencies (INSTRUCTOR_GUIDE, PRIVACY_DATA_RETENTION, TEAM_ONBOARDING) - All functional configuration files - All milestone system documentation (7 files in tests/milestones/) Updated .gitignore to prevent future accumulation of internal development files (.claude/, site/_build/, log files, progress.json)	2025-11-22 21:05:57 -05:00
Vijay Janapa Reddi	1517c6f83d	Clean up repository by removing planning and status documents Removed 42 planning, brainstorming, and status tracking documents that served their purpose during development but are no longer needed for release. Changes: - Root: Removed 4 temporary/status files - binder/: Removed 20 planning documents (kept essential setup files) - docs/: Removed 16 planning/status documents (preserved all user-facing docs and website dependencies) - tests/: Removed 2 status documents (preserved all test docs and milestone system) Preserved files: - All user-facing documentation (README, guides, quickstarts) - All website dependencies (INSTRUCTOR_GUIDE, PRIVACY_DATA_RETENTION, TEAM_ONBOARDING) - All functional configuration files - All milestone system documentation (7 files in tests/milestones/) Updated .gitignore to prevent future accumulation of internal development files (.claude/, site/_build/, log files, progress.json)	2025-11-22 21:05:57 -05:00
Vijay Janapa Reddi	c61f7ec7a6	Clean up milestone directories - Removed 30 debugging and development artifact files - Kept core system, documentation, and demo files - tests/milestones: 9 clean files (system + docs) - milestones/05_2017_transformer: 5 clean files (demos) - Clear, focused directory structure - Ready for students and developers	2025-11-22 20:30:58 -05:00
Vijay Janapa Reddi	0d6807cefb	Clean up milestone directories - Removed 30 debugging and development artifact files - Kept core system, documentation, and demo files - tests/milestones: 9 clean files (system + docs) - milestones/05_2017_transformer: 5 clean files (demos) - Clear, focused directory structure - Ready for students and developers	2025-11-22 20:30:58 -05:00
Vijay Janapa Reddi	223e5f53e1	Add milestone system with clean architecture - Single source of truth in milestone_tracker.py - Zero code duplication across codebase - Clean API: check_module_export(module_name, console) - Gamified learning experience through ML history - Progressive unlocking of 5 major milestones - Comprehensive documentation for students and developers - Integration with module workflow and CLI commands	2025-11-22 20:29:34 -05:00
Vijay Janapa Reddi	9767c78155	Add milestone system with clean architecture - Single source of truth in milestone_tracker.py - Zero code duplication across codebase - Clean API: check_module_export(module_name, console) - Gamified learning experience through ML history - Progressive unlocking of 5 major milestones - Comprehensive documentation for students and developers - Integration with module workflow and CLI commands	2025-11-22 20:29:34 -05:00
Vijay Janapa Reddi	10f27dabe0	Add comprehensive explanation of why sequence reversal is the canonical attention test Explains: - Why reversal cannot be solved without attention (no shortcuts!) - What other mechanisms fail (MLP, positional encoding, convolution) - How attention actually solves it (cross-position information flow) - Why it's better than copy/sorting/arithmetic for testing - The attention pattern visualization (anti-diagonal) - What passing this test proves about your implementation Key insight: Reversal is the simplest task that REQUIRES global attention	2025-11-22 18:01:56 -05:00
Vijay Janapa Reddi	71f58be27d	Add comprehensive explanation of why sequence reversal is the canonical attention test Explains: - Why reversal cannot be solved without attention (no shortcuts!) - What other mechanisms fail (MLP, positional encoding, convolution) - How attention actually solves it (cross-position information flow) - Why it's better than copy/sorting/arithmetic for testing - The attention pattern visualization (anti-diagonal) - What passing this test proves about your implementation Key insight: Reversal is the simplest task that REQUIRES global attention	2025-11-22 18:01:56 -05:00
Vijay Janapa Reddi	552046df92	Add Transformer capability tests with progressive difficulty - test_transformer_capabilities.py: 4 progressive tests (copy, reversal, sorting, modulus) - Sequence reversal is THE test that proves attention works - Tests train in 10s-2min each, provide clear pass/fail - Includes modulus arithmetic test as requested - Complete design document with test hierarchy and rationale - Quick start README for easy use Tests validate: - Basic forward pass (copy) - Attention mechanism (reversal) ⭐ - Multi-position reasoning (sorting) - Symbolic reasoning (modulus)	2025-11-22 17:57:34 -05:00
Vijay Janapa Reddi	7449db0944	Add Transformer capability tests with progressive difficulty - test_transformer_capabilities.py: 4 progressive tests (copy, reversal, sorting, modulus) - Sequence reversal is THE test that proves attention works - Tests train in 10s-2min each, provide clear pass/fail - Includes modulus arithmetic test as requested - Complete design document with test hierarchy and rationale - Quick start README for easy use Tests validate: - Basic forward pass (copy) - Attention mechanism (reversal) ⭐ - Multi-position reasoning (sorting) - Symbolic reasoning (modulus)	2025-11-22 17:57:34 -05:00
Vijay Janapa Reddi	5cd161f4af	Add regression prevention summary for gradient flow testing Answers the key question: Yes, we have comprehensive tests (29+) to prevent gradient flow issues in the future	2025-11-22 17:44:30 -05:00
Vijay Janapa Reddi	efea16b861	Add regression prevention summary for gradient flow testing Answers the key question: Yes, we have comprehensive tests (29+) to prevent gradient flow issues in the future	2025-11-22 17:44:30 -05:00
Vijay Janapa Reddi	24495b6ae4	Add comprehensive gradient flow testing guide Documents test hierarchy, common issues, and regression prevention strategies for maintaining gradient flow across TinyTorch modules	2025-11-22 17:43:53 -05:00
Vijay Janapa Reddi	013b1bd6a8	Add comprehensive gradient flow testing guide Documents test hierarchy, common issues, and regression prevention strategies for maintaining gradient flow across TinyTorch modules	2025-11-22 17:43:53 -05:00
Vijay Janapa Reddi	d2c20836dd	Add comprehensive unit tests for gradient flow regression prevention - test_spatial_gradient_flow.py: Tests Conv2d and MaxPool2d backward function attachment and gradient propagation - test_embedding_gradient_flow.py: Tests Embedding backward function attachment and gradient propagation - Tests verify _grad_fn attachment to prevent .data bypass issues - Tests validate gradient flow to all parameters (weight, bias) - Tests check end-to-end gradient chains - All tests pass (8/8 spatial, 6/6 embedding)	2025-11-22 17:43:02 -05:00

1 2 3 4 5