TinyTorch

mirror of https://github.com/MLSysBook/TinyTorch.git synced 2026-06-04 05:17:19 -05:00

Author	SHA1	Message	Date
Vijay Janapa Reddi	6af994a82f	test: Add comprehensive CNN integration tests Created test_cnn_integration.py with: ✅ Conv2d Operations Tests: - Verifies actual convolution (not just shape manipulation) - Edge detector test proves Conv2d computes correctly - Shape transformations for various configurations - Parameter count verification (448 params for 3→16, k=3) ✅ Pooling Operations Tests: - MaxPool2d actually computes maximum values - AvgPool2d actually computes averages - Shape transformations validated - Handles negative values correctly ✅ Numerical Stability Tests: - Zero inputs handled correctly - Negative values in pooling work properly ⚠️ Gradient Flow Tests (TODO): - Placeholder for Conv2d backward support - Will add when Conv2d autograd integration is implemented All forward pass tests passing (8/8)! These tests ensure CNNs actually work, not just shape shuffle.	2025-09-30 16:57:14 -04:00
Vijay Janapa Reddi	828c3d9081	feat: Add CrossEntropyLoss autograd support + Milestone 03 MLP on digits Key Changes: - Implemented CrossEntropyBackward for gradient computation - Integrated CrossEntropyLoss into enable_autograd() patching - Created comprehensive loss gradient test suite - Milestone 03: MLP digits classifier (77.5% accuracy) - Shipped tiny 8x8 digits dataset (67KB) for instant demos - Updated DataLoader module with ASCII visualizations Tests: - All 3 losses (MSE, BCE, CrossEntropy) now have gradient flow - MLP successfully learns digit classification (6.9% → 77.5%) - Integration tests pass Technical: - CrossEntropyBackward: softmax - one_hot gradient - Numerically stable via log-softmax - Works with raw class labels (no one-hot needed)	2025-09-30 16:22:09 -04:00
Vijay Janapa Reddi	5d6f17aa27	Fix DataLoader integration tests to work before export Added fallback import logic: - Try importing from tinytorch package first - Fall back to dev modules if not exported yet - Works both before and after 'tito export 08_dataloader' All 3 integration tests pass: ✅ Training workflow integration ✅ Shuffle consistency across epochs ✅ Memory efficiency verification	2025-09-30 16:08:21 -04:00
Vijay Janapa Reddi	3830e4bfc3	Finalize Module 08 and add integration tests Added integration tests for DataLoader: - test_dataloader_integration.py in tests/integration/ - Training workflow integration - Shuffle consistency across epochs - Memory efficiency verification Updated Module 08: - Added note about optional performance analysis - Clarified that analysis functions can be run manually - Clean flow: text → code → tests Updated datasets/tiny/README.md: - Minor formatting fixes Module 08 is now complete and ready to export: ✅ Dataset abstraction ✅ TensorDataset implementation ✅ DataLoader with batching/shuffling ✅ ASCII visualizations for understanding ✅ Unit tests (in module) ✅ Integration tests (in tests/) ✅ Performance analysis tools (optional) Next: Export with 'bin/tito export 08_dataloader'	2025-09-30 16:07:55 -04:00
Vijay Janapa Reddi	5066d91877	Clean up milestone 02 to match milestone 01 structure Milestone 02 Structure (matches milestone 01): - README.md: Comprehensive guide with historical context - xor_crisis.py: Part 1 - demonstrates single-layer failure (executable) - xor_solved.py: Part 2 - demonstrates multi-layer success (executable) Cleanup: - ✅ Removed old perceptron_xor_fails.py - ✅ Moved test files to tests/integration/ - test_xor_simple.py - test_xor_thorough.py - test_xor_original_1986.py (verifies 2-2-1 architecture works!) - ✅ Updated README with clear instructions - ✅ Made scripts executable Milestone 02 now has the same polish and structure as milestone 01: - Clear file naming (crisis vs solved) - Beautiful rich output - Historical context - Pedagogically structured	2025-09-30 14:14:37 -04:00
Vijay Janapa Reddi	9129935d5b	Add MSEBackward and organize comprehensive test suite New Features: - Add MSEBackward gradient computation for regression tasks - Patch MSELoss in enable_autograd() for gradient tracking - All 3 loss functions now support autograd: MSE, BCE, CrossEntropy Test Suite Organization: - Reorganize tests/ into focused directories - Create tests/integration/ for cross-module tests - Create tests/05_autograd/ for autograd edge cases - Create tests/debugging/ for common student pitfalls - Add comprehensive tests/README.md explaining test philosophy Integration Tests: - Move test_gradient_flow.py to integration/ - 20 comprehensive gradient flow tests - Tests cover: tensors, layers, activations, losses, optimizers - Tests validate: basic ops, chain rule, broadcasting, training loops - 19/20 tests passing (MSE now fixed!) Results: ✅ Perceptron learns: 50% → 93% accuracy ✅ Clean test organization guides future development ✅ Tests catch the exact bugs that broke training Pedagogical Value: - Test organization teaches testing best practices - Gradient flow tests show what integration testing catches - Sets foundation for debugging/diagnostic tests	2025-09-30 13:57:40 -04:00
Vijay Janapa Reddi	8806a31008	Complete TinyTorch module rebuild with explanations and milestone testing Major Accomplishments: • Rebuilt all 20 modules with comprehensive explanations before each function • Fixed explanatory placement: detailed explanations before implementations, brief descriptions before tests • Enhanced all modules with ASCII diagrams for visual learning • Comprehensive individual module testing and validation • Created milestone directory structure with working examples • Fixed critical Module 01 indentation error (methods were outside Tensor class) Module Status: ✅ Modules 01-07: Fully working (Tensor → Training pipeline) ✅ Milestone 1: Perceptron - ACHIEVED (95% accuracy on 2D data) ✅ Milestone 2: MLP - ACHIEVED (complete training with autograd) ⚠️ Modules 08-20: Mixed results (import dependencies need fixes) Educational Impact: • Students can now learn complete ML pipeline from tensors to training • Clear progression: basic operations → neural networks → optimization • Explanatory sections provide proper context before implementation • Working milestones demonstrate practical ML capabilities Next Steps: • Fix import dependencies in advanced modules (9, 11, 12, 17-20) • Debug timeout issues in modules 14, 15 • First 7 modules provide solid foundation for immediate educational use 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-09-29 20:55:55 -04:00
Vijay Janapa Reddi	da51904467	Clean up Module 03: move integration tests to external file Following the clean pattern from Modules 01 and 05: - Removed demonstrate_complete_networks() from Module 03 - Module now focuses ONLY on layer unit tests - Created tests/integration/test_layers_integration.py for: * Complete neural network demonstrations * MLP, CNN-style, and deep network tests * Cross-module integration validation Module 03 now clean and focused on teaching layers Module 04 already clean - no changes needed Both modules follow consistent unit test pattern	2025-09-29 14:08:22 -04:00
Vijay Janapa Reddi	bcba1ac3be	FOUNDATION: Establish AI Engineering as a discipline through TinyTorch 🎯 NORTH STAR VISION DOCUMENTED: 'Don't Just Import It, Build It' - Training AI Engineers, not just ML users AI Engineering emerges as a foundational discipline like Computer Engineering, bridging algorithms and systems to build the AI infrastructure of the future. 🧪 ROBUST TESTING FRAMEWORK ESTABLISHED: - Created tests/regression/ for sandbox integrity tests - Implemented test-driven bug prevention workflow - Clear separation: student tests (pedagogical) vs system tests (robustness) - Every bug becomes a test to prevent recurrence ✅ KEY IMPLEMENTATIONS: - NORTH_STAR.md: Vision for AI Engineering discipline - Testing best practices: Focus on robust student sandbox - Git workflow standards: Professional development practices - Regression test suite: Prevent infrastructure issues - Conv->Linear dimension tests (found CNN bug) - Transformer reshaping tests (found GPT bug) 🏗️ SANDBOX INTEGRITY: Students need a solid, predictable environment where they focus on ML concepts, not debugging framework issues. The framework must be invisible. 📚 EDUCATIONAL PHILOSOPHY: TinyTorch isn't just teaching a framework - it's founding the AI Engineering discipline by training engineers who understand how to BUILD ML systems. This establishes the foundation for training the first generation of true AI Engineers who will define this emerging discipline.	2025-09-25 11:16:28 -04:00
Vijay Janapa Reddi	b808346cf8	Clean up repository: remove temp files, organize modules, prepare for PyPI publication - Removed temporary test files and audit reports - Deleted backup and temp_holding directories - Reorganized module structure (07->09 spatial, 09->07 dataloader) - Added new modules: 11-14 (tokenization, embeddings, attention, transformers) - Updated examples with historical ML milestones - Cleaned up documentation structure	2025-09-24 10:13:37 -04:00
Vijay Janapa Reddi	c0d103e766	MILESTONE: Complete Phase 2 CNN training pipeline ✅ Phase 1-2 Complete: Modules 1-10 aligned with tutorial master plan ✅ CNN Training Pipeline: Autograd → Spatial → Optimizers → DataLoader → Training ✅ Technical Validation: All modules import and function correctly ✅ CIFAR-10 Ready: Multi-channel Conv2D, BatchNorm, MaxPool2D, complete pipeline Key Achievements: - Fixed module sequence alignment (spatial now Module 7, not 6) - Updated tutorial master plan for logical pedagogical flow - Phase 2 milestone achieved: Students can train CNNs on CIFAR-10 - Complete systems engineering focus throughout all modules - Production-ready CNN pipeline with memory profiling Next Phase: Language models (Modules 11-15) for TinyGPT milestone	2025-09-23 18:33:56 -04:00
Vijay Janapa Reddi	bd05bb4c3b	Update Module 1 integration tests to match simplified implementation - Adjust tests to match new 3-function simplified structure - Test setup(), check_versions(), and get_info() functions - Remove tests for complex functionality that was removed - All tests now align with simplified Module 1 design Module 1 is now clean, simple, and perfect for first day of class	2025-09-23 17:11:34 -04:00
Vijay Janapa Reddi	79db89930a	Complete comprehensive testing for API simplification Added full test suite following TinyTorch testing conventions: ✅ UNIT TESTS (test_api_simplification.py): - 23 comprehensive tests covering all API components - Tests Parameter function, Module base class, Linear/Conv2d layers - Tests functional interface (F.relu, F.flatten, F.max_pool2d) - Tests optimizer integration and backward compatibility - Tests complete model workflows (MLP, CNN) ✅ INTEGRATION TESTS (test_api_simplification_integration.py): - Cross-component integration testing - Complete workflow validation (model → optimizer → training setup) - PyTorch compatibility verification - Nested module parameter collection testing ✅ EXAMPLE FIXES: - Fixed optimizer parameter names (lr → learning_rate) - Examples demonstrate real-world usage patterns - Show dramatic code simplification vs old API 🎯 TEST RESULTS: - Unit Tests: 23/23 PASS ✅ - Integration Tests: 8/8 PASS ✅ - API simplification validated with comprehensive coverage The testing validates that the API simplification maintains educational value while providing clean PyTorch-compatible interfaces.	2025-09-23 08:24:50 -04:00
Vijay Janapa Reddi	cf0f72a084	Add TinyTorch examples gallery and fix module integration issues - Create professional examples directory showcasing TinyTorch as real ML framework - Add examples: XOR, MNIST, CIFAR-10, text generation, autograd demo, optimizer comparison - Fix import paths in exported modules (training.py, dense.py) - Update training module with autograd integration for loss functions - Add progressive integration tests for all 16 modules - Document framework capabilities and usage patterns This commit establishes the examples gallery that demonstrates TinyTorch works like PyTorch/TensorFlow, validating the complete framework.	2025-09-21 10:00:11 -04:00
Vijay Janapa Reddi	5386b58e07	Implement interactive ML Systems questions and standardize module structure Major Educational Framework Enhancements: • Deploy interactive NBGrader text response questions across ALL modules • Replace passive question lists with active 150-300 word student responses • Enable comprehensive ML Systems learning assessment and grading TinyGPT Integration (Module 16): • Complete TinyGPT implementation showing 70% component reuse from TinyTorch • Demonstrates vision-to-language framework generalization principles • Full transformer architecture with attention, tokenization, and generation • Shakespeare demo showing autoregressive text generation capabilities Module Structure Standardization: • Fix section ordering across all modules: Tests → Questions → Summary • Ensure Module Summary is always the final section for consistency • Standardize comprehensive testing patterns before educational content Interactive Question Implementation: • 3 focused questions per module replacing 10-15 passive questions • NBGrader integration with manual grading workflow for text responses • Questions target ML Systems thinking: scaling, deployment, optimization • Cumulative knowledge building across the 16-module progression Technical Infrastructure: • TPM agent for coordinated multi-agent development workflows • Enhanced documentation with pedagogical design principles • Updated book structure to include TinyGPT as capstone demonstration • Comprehensive QA validation of all module structures Framework Design Insights: • Mathematical unity: Dense layers power both vision and language models • Attention as key innovation for sequential relationship modeling • Production-ready patterns: training loops, optimization, evaluation • System-level thinking: memory, performance, scaling considerations Educational Impact: • Transform passive learning to active engagement through written responses • Enable instructors to assess deep ML Systems understanding • Provide clear progression from foundations to complete language models • Demonstrate real-world framework design principles and trade-offs	2025-09-17 14:42:24 -04:00
Vijay Janapa Reddi	38900f3f72	Implement Package Manager integration testing system Features: - Module-level integration tests for immediate validation - Two-tier validation: integration tests + checkpoint tests - Quick package validation after every module completion - Comprehensive integration test suite for all modules - Package Manager coordination and test running Two-Tier System: 1. Integration Test (Package Manager) - "Module works in package" - Quick validation (< 1 second) - Import validation and basic functionality - No conflicts with other modules 2. Checkpoint Test (existing) - "Complete capability unlocked" - Comprehensive validation (2-10 seconds) - End-to-end workflows and multi-module capabilities - Major milestone achievements CLI Workflow: - tito module complete 02_tensor - → Export + Integration test + Checkpoint test - → Two-tier results with different messaging - → Immediate feedback + capability celebrations Integration: - 15 module integration tests covering complete course - Package health validation and dependency checking - Clean separation from checkpoint capability testing - Professional Package Manager workflow	2025-09-16 21:32:08 -04:00
Vijay Janapa Reddi	bfb14ce61b	feat: Restructure integration tests and optimize module timing - Flattened tests/ directory structure (removed integration/ and system/ subdirectories) - Renamed all integration tests with _integration.py suffix for clarity - Created test_utils.py with setup_integration_test() function - Updated integration tests to use ONLY tinytorch package imports - Ensured all modules are exported before running tests via tito export --all - Optimized module test timing for fast execution (under 5 seconds each) - Fixed MLOps test reliability and reduced timing parameters across modules - Exported all modules (compression, kernels, benchmarking, mlops) to tinytorch package	2025-07-14 23:37:50 -04:00
Vijay Janapa Reddi	60a5ed9b2e	Fix training integration tests - all 17 tests now passing - Fixed SimpleDataset usage in classification, regression, and validation tests - Replaced custom dataset classes with proper DataLoader usage - Updated model architectures to match SimpleDataset defaults (4 features, 3 classes) - All training integration tests now pass successfully	2025-07-14 19:39:18 -04:00
Vijay Janapa Reddi	edbfd2bd7f	Add benchmarking test report generated by integration tests	2025-07-14 19:26:19 -04:00
Vijay Janapa Reddi	0ccef78721	Add comprehensive MLOps integration tests - Complete integration tests for 13_mlops module - Test MLOps pipeline with all TinyTorch components (00-12) - Include ModelMonitor, DriftDetector, RetrainingTrigger, MLOpsPipeline - Test integration with benchmarking framework - Test with different network architectures and complexity - Follow established integration test patterns - Comprehensive summary test demonstrating complete system integration	2025-07-14 19:21:08 -04:00
Vijay Janapa Reddi	8549d82aeb	Fix MLOps module ending and add benchmarking integration tests - Update MLOps module ending to match standard TinyTorch module format - Remove verbose ending text, use concise professional summary - Add comprehensive benchmarking integration tests - Test benchmarking framework with real TinyTorch components - Include tests for kernels, networks, and statistical validation - Follow established integration test patterns	2025-07-14 19:19:28 -04:00
Vijay Janapa Reddi	257fbe4f4a	Clean up module configurations and add kernels integration tests - Standardize module.yaml files (11-13) to match concise format of early modules - Remove verbose sections, keep essential metadata only - Update kernels README to match TinyTorch module style standards - Add comprehensive integration tests for kernels module - Test hardware-optimized operations with real TinyTorch components - Prepare for systematic integration testing across all modules	2025-07-14 19:12:20 -04:00
Vijay Janapa Reddi	5f63d31e78	Add comprehensive integration tests for compression module - Tests real integration with TinyTorch components - 8 passing integration tests covering: * CompressionMetrics with real Tensor networks * Comprehensive comparison pipeline * DistillationLoss with real network components * Edge cases and network structure preservation - Focuses on functionality that works with real components - Validates compression techniques work end-to-end - All tests pass (8/8) with minimal warnings	2025-07-14 09:48:19 -04:00
Vijay Janapa Reddi	db9182d006	Create complete training module with loss functions, metrics, and training loop - Add training_dev.py with comprehensive educational structure - Implement MeanSquaredError, CrossEntropyLoss, BinaryCrossEntropyLoss - Add Accuracy metric with extensible framework - Create Trainer class for complete training orchestration - Include comprehensive inline tests for all components - Add module.yaml with proper dependencies and metadata - Create detailed README.md with examples and applications - Add test_training_integration.py with real component integration tests - Follow TinyTorch NBDev educational pattern with Build → Use → Optimize - Ready for real-world training workflows with validation and monitoring	2025-07-14 00:42:46 -04:00
Vijay Janapa Reddi	e34e97dade	Create CNN integration tests and move inline cross-module tests - Add test_cnn_networks.py: Comprehensive CNN ↔ Networks integration tests - Conv2D layers in Sequential networks - Multiple Conv2D stacking, different activations - Batch processing, kernel sizes, feature extraction - Parameter efficiency comparisons, edge cases - Add test_cnn_pipeline.py: CNN pipeline integration tests - CNN → Activation → Flatten → Dense pipelines - Deep CNN architectures with multiple stages - Numerical stability testing, batch processing - Moved from inline test in cnn_dev.py (proper separation) - Update cnn_dev.py: Remove inline integration test - Replaced cross-module integration test with comment - Maintains clean separation between unit and integration tests - Clean up test structure: Remove unused e2e/__init__.py Result: Complete integration test coverage for CNN interactions 96 passing integration tests using real TinyTorch components	2025-07-13 23:54:22 -04:00
Vijay Janapa Reddi	9332cc49b9	Add comprehensive integration tests for missing component interactions Level 1 (Core Data Flow Integration): - test_tensor_cnn.py: Tests Tensor ↔ CNN operations (Conv2D, flatten) with real tensors - test_tensor_autograd.py: Tests Tensor ↔ Autograd (Variable wrapping, forward/backward passes) - test_dataloader_tensor.py: Tests DataLoader ↔ Tensors (real data pipeline producing tensors) QA-structured tests with realistic scenarios: - Shape handling and data type preservation - Error handling and edge cases - Realistic ML pipeline integration - Batch processing and memory efficiency - Complex architectures and training scenarios Total: 43 new focused integration tests (13 + 14 + 16) Result: 77/79 integration tests passing (98% success rate) Missing tests now covered: real component integration vs mock-based testing	2025-07-13 23:26:38 -04:00
Vijay Janapa Reddi	4a1bc7c7f4	Reorganize tests: Remove mocks, add real integration tests REMOVED (Mock-based tests that duplicate inline tests): • test_activations.py - Used MockTensor instead of real Tensor • test_layers.py - Used MockTensor instead of real Tensor • test_networks.py - Used MockTensor/MockLayer instead of real components • test_cnn.py - Used MockTensor instead of real Tensor • test_dataloader.py - Used MockTensor/MockDataset instead of real components ADDED (Real integration tests with actual TinyTorch components): • integration/test_tensor_activations.py - Tests real Tensor ↔ Activations integration • integration/test_layers_networks.py - Tests real Dense ↔ Sequential/MLP integration • e2e/ directory structure for end-to-end tests RESULT: • Reduced test count from 209 → 70 (removed 139 redundant mock-based tests) • All 70 remaining tests use real components for true integration testing • Clear separation: inline tests (component validation) vs integration tests (cross-module) • Better QA structure following proper testing pyramid This follows QA best practices: since all modules are working and building on each other, integration tests should use real components, not mocks. Mocks were preventing us from catching actual integration issues.	2025-07-13 23:10:14 -04:00
Vijay Janapa Reddi	fe8cad6bdd	Complete comprehensive testing verification and integration tests 🎉 COMPREHENSIVE TESTING COMPLETE: All testing phases verified and working correctly ✅ PHASE 1: INLINE TESTS (STUDENT LEARNING) - All inline unit tests in _dev.py files working correctly - Progressive testing: small portions tested as students implement - Consistent naming: 'Unit Test: [Component]' format - Educational focus: immediate feedback with visual indicators - NBGrader compliant: proper cell structure for grading ✅ PHASE 2: MODULE TESTS (INSTRUCTOR GRADING) - Mock-based tests in tests/test_.py files - Professional pytest structure with comprehensive coverage - No cross-module dependencies (avoids cascade failures) - Minor issues: 3 tests failing due to minor type/tolerance issues - Overall: 95%+ test success rate across all modules ✅ PHASE 3: INTEGRATION TESTS (REAL-WORLD WORKFLOWS) - Created comprehensive integration tests in tests/integration/ - Cross-module ML pipeline testing with real scenarios - 12/14 integration tests passing (86% success rate) - Tests cover: tensor→layer→network→activation workflows - Real ML applications: classification, regression, architectures 🔧 TESTING ARCHITECTURE SUMMARY: 1. Inline Tests: Student learning with immediate feedback 2. Module Tests: Instructor grading with mock dependencies 3. Integration Tests: Real cross-module ML workflows 4. Clear separation of concerns and purposes 📊 FINAL STATISTICS: - 7 modules with standardized progressive testing - 25+ inline unit tests with consistent naming - 6 comprehensive module test suites - 14 integration tests for cross-module workflows - 200+ individual test methods across all test types 🚀 READY FOR PRODUCTION: All three testing tiers working correctly with clear purposes and educational value maintained throughout.	2025-07-12 21:02:33 -04:00
Vijay Janapa Reddi	38284a8a25	feat: Implement comprehensive testing architecture redesign - Add four-tier testing architecture (inline, module, integration, system) - Implement comprehensive inline testing for Tensor, Activations, Layers, Networks modules - Create mock-based module testing approach to avoid dependency cascade - Add integration and system test directory structure - Update testing documentation with design principles and guidelines - Enhance educational testing with visual feedback and real ML scenarios - Total: 2,200+ lines of comprehensive testing across modules	2025-07-12 19:48:42 -04:00

29 Commits