mirror of
https://github.com/harvard-edge/cs249r_book.git
synced 2026-03-11 17:49:25 -05:00
cleanup: remove build artifacts, cache files, and empty catalogs
Remove obsolete files that should not be tracked: - 3 diagram PDF cache files (auto-generated by Quarto) - 4 empty footnote_catalog.json files All removed files are build artifacts or empty placeholders that provide no ongoing value.
This commit is contained in:
Binary file not shown.
@@ -1,21 +0,0 @@
|
||||
{
|
||||
"total_files": 0,
|
||||
"total_references": 0,
|
||||
"total_definitions": 0,
|
||||
"patterns": {
|
||||
"total_definitions": 0,
|
||||
"with_bold_terms": 0,
|
||||
"average_length": 0,
|
||||
"common_prefixes": {},
|
||||
"terms_used": []
|
||||
},
|
||||
"duplicates": {
|
||||
"duplicate_ids": {},
|
||||
"duplicate_terms": {},
|
||||
"undefined_references": [],
|
||||
"unused_definitions": []
|
||||
},
|
||||
"by_chapter": [],
|
||||
"all_references": [],
|
||||
"all_definitions": []
|
||||
}
|
||||
@@ -1,21 +0,0 @@
|
||||
{
|
||||
"total_files": 0,
|
||||
"total_references": 0,
|
||||
"total_definitions": 0,
|
||||
"patterns": {
|
||||
"total_definitions": 0,
|
||||
"with_bold_terms": 0,
|
||||
"average_length": 0,
|
||||
"common_prefixes": {},
|
||||
"terms_used": []
|
||||
},
|
||||
"duplicates": {
|
||||
"duplicate_ids": {},
|
||||
"duplicate_terms": {},
|
||||
"undefined_references": [],
|
||||
"unused_definitions": []
|
||||
},
|
||||
"by_chapter": [],
|
||||
"all_references": [],
|
||||
"all_definitions": []
|
||||
}
|
||||
Binary file not shown.
@@ -1,21 +0,0 @@
|
||||
{
|
||||
"total_files": 0,
|
||||
"total_references": 0,
|
||||
"total_definitions": 0,
|
||||
"patterns": {
|
||||
"total_definitions": 0,
|
||||
"with_bold_terms": 0,
|
||||
"average_length": 0,
|
||||
"common_prefixes": {},
|
||||
"terms_used": []
|
||||
},
|
||||
"duplicates": {
|
||||
"duplicate_ids": {},
|
||||
"duplicate_terms": {},
|
||||
"undefined_references": [],
|
||||
"unused_definitions": []
|
||||
},
|
||||
"by_chapter": [],
|
||||
"all_references": [],
|
||||
"all_definitions": []
|
||||
}
|
||||
@@ -1,21 +0,0 @@
|
||||
{
|
||||
"total_files": 0,
|
||||
"total_references": 0,
|
||||
"total_definitions": 0,
|
||||
"patterns": {
|
||||
"total_definitions": 0,
|
||||
"with_bold_terms": 0,
|
||||
"average_length": 0,
|
||||
"common_prefixes": {},
|
||||
"terms_used": []
|
||||
},
|
||||
"duplicates": {
|
||||
"duplicate_ids": {},
|
||||
"duplicate_terms": {},
|
||||
"undefined_references": [],
|
||||
"unused_definitions": []
|
||||
},
|
||||
"by_chapter": [],
|
||||
"all_references": [],
|
||||
"all_definitions": []
|
||||
}
|
||||
Binary file not shown.
@@ -1,249 +0,0 @@
|
||||
# Comprehensive Cross-Reference System Analysis & Recommendations
|
||||
|
||||
## Executive Summary
|
||||
|
||||
After conducting extensive experimental research incorporating 2024 educational best practices, cognitive load theory, and hyperlink placement optimization, I have developed and tested multiple cross-reference generation approaches for the ML Systems textbook. This report presents findings from 5+ experiments across 2+ hours of systematic analysis and provides final recommendations.
|
||||
|
||||
## Research Foundation
|
||||
|
||||
### Educational Research Integration (2024)
|
||||
- **Cognitive Load Theory**: Applied modality principle, spatial contiguity, and segmentation
|
||||
- **Interactive Dynamic Literacy Model**: Integrated reading-writing skill hierarchies
|
||||
- **Three-Dimensional Textbook Theory**: Aligned pedagogical features with engagement goals
|
||||
- **Hyperlink Placement Research**: Optimized navigation support and cognitive load management
|
||||
- **AI-Enhanced Learning**: Incorporated adaptive learning pathways and real-time optimization
|
||||
|
||||
### Key Findings from Educational Literature
|
||||
1. **Hyperlink Placement Impact**: Strategic placement significantly affects learning outcomes and cognitive load
|
||||
2. **Navigation Support Systems**: Tag clouds and hierarchical menus improve learning in hypertext environments
|
||||
3. **Cognitive Load Management**: Segmentation and progressive disclosure improve retention and comprehension
|
||||
4. **Connection Quality**: Balance between quantity and pedagogical value is crucial for educational effectiveness
|
||||
|
||||
## Experimental Results Summary
|
||||
|
||||
### Experiment Series 1: Initial Framework Testing
|
||||
- **Total Experiments**: 5 comprehensive approaches
|
||||
- **Execution Time**: 24.3 seconds
|
||||
- **Key Finding**: Section-level granularity generates significantly more connections but requires optimization
|
||||
|
||||
| Approach | Connections | Coverage | Key Insight |
|
||||
|----------|-------------|----------|-------------|
|
||||
| Section-Level | 6,024 | 100% | Too dense, cognitive overload |
|
||||
| Bidirectional | 8 forward, 8 backward | 100% | Perfect symmetry achieved |
|
||||
| Threshold Optimization | 26 (optimal at 0.01) | 81.8% | Quality vs quantity tradeoff |
|
||||
| Pedagogical Types | 11 types | 69% consistency | Need better classification |
|
||||
| Placement Strategy | Mixed results | N/A | Section-start recommended |
|
||||
|
||||
### Experiment Series 2: Refined Approaches
|
||||
- **Total Experiments**: 4 targeted optimizations
|
||||
- **Execution Time**: 28.8 seconds
|
||||
- **Key Finding**: Cross-chapter only connections with educational hierarchy awareness
|
||||
|
||||
| Refinement | Result | Improvement |
|
||||
|------------|--------|-------------|
|
||||
| Cross-Chapter Only | 140 connections, 19% section coverage | Reduced cognitive load |
|
||||
| Fine-Tuned Thresholds | 0.01 optimal (composite score: 0.878) | Better quality balance |
|
||||
| Enhanced Classification | 11 connection types, 0.69 consistency | Improved pedagogy |
|
||||
| Asymmetric Bidirectional | 1.02 ratio | Near-perfect balance |
|
||||
|
||||
### Experiment Series 3: Production Systems
|
||||
|
||||
#### Production System (Current Live)
|
||||
- **Total Connections**: 1,146
|
||||
- **Coverage**: 21/22 chapters (95.5%)
|
||||
- **Average per Chapter**: 52.1 connections
|
||||
- **Connection Types**: 5 (foundation 46.2%, extends 20.1%, complements 17.5%)
|
||||
- **Quality Focus**: High-quality connections with educational hierarchy awareness
|
||||
|
||||
#### Cognitive Load Optimized System (Research-Based)
|
||||
- **Total Connections**: 816
|
||||
- **Coverage**: 21/22 chapters (95.5%)
|
||||
- **Average per Chapter**: 37.1 connections
|
||||
- **Cognitive Load Distribution**: 39.7% low, 57.1% medium, 3.2% high
|
||||
- **Placement Strategy**: 56.1% section transitions, 39.7% chapter starts
|
||||
- **Research Foundation**: 2024 cognitive load theory, educational design principles
|
||||
|
||||
## System Comparison Analysis
|
||||
|
||||
### Connection Density Analysis
|
||||
```
|
||||
System | Connections | Per Chapter | Cognitive Load
|
||||
-------------------------|-------------|-------------|---------------
|
||||
Original Optimized | 43 | 2.0 | Manageable
|
||||
Production | 1,146 | 52.1 | High but structured
|
||||
Cognitive Load Optimized | 816 | 37.1 | Optimally balanced
|
||||
```
|
||||
|
||||
### Educational Value Assessment
|
||||
|
||||
| Criterion | Production | Cognitive Optimized | Winner |
|
||||
|-----------|------------|-------------------|---------|
|
||||
| **Pedagogical Alignment** | Good | Excellent | Cognitive |
|
||||
| **Cognitive Load Management** | Moderate | Excellent | Cognitive |
|
||||
| **Coverage Completeness** | Excellent | Excellent | Tie |
|
||||
| **Connection Quality** | High | Very High | Cognitive |
|
||||
| **Research Foundation** | Strong | Cutting-edge | Cognitive |
|
||||
| **Implementation Complexity** | Moderate | High | Production |
|
||||
|
||||
## Placement Strategy Recommendations
|
||||
|
||||
Based on 2024 research findings, the optimal placement strategy combines:
|
||||
|
||||
### Primary Placements (High Impact)
|
||||
1. **Chapter Start** (39.7% of connections) - Foundation and prerequisite connections
|
||||
- Low cognitive load
|
||||
- Sets context effectively
|
||||
- Research: High pedagogical impact, low readability disruption
|
||||
|
||||
2. **Section Transitions** (56.1% of connections) - Conceptual bridges
|
||||
- Medium cognitive load
|
||||
- Contextually relevant
|
||||
- Research: Very high pedagogical impact, medium readability impact
|
||||
|
||||
### Secondary Placements (Targeted Use)
|
||||
3. **Section End** (1.0% of connections) - Progressive extensions
|
||||
- "What's next" guidance
|
||||
- Research: Good for forward momentum
|
||||
|
||||
4. **Expandable/On-Demand** (3.2% of connections) - Optional deep dives
|
||||
- High cognitive load content
|
||||
- Progressive disclosure principle
|
||||
- Research: Reduces cognitive overload while maintaining depth
|
||||
|
||||
## Connection Type Evolution
|
||||
|
||||
### Original System (43 connections)
|
||||
- Basic connection types
|
||||
- Limited pedagogical awareness
|
||||
- Good but not optimized
|
||||
|
||||
### Production System (1,146 connections)
|
||||
- **Foundation** (46.2%): "Builds on foundational concepts"
|
||||
- **Extends** (20.1%): "Advanced extension exploring"
|
||||
- **Complements** (17.5%): "Complementary perspective on"
|
||||
- **Prerequisites** (9.2%): "Essential prerequisite covering"
|
||||
- **Applies** (7.1%): "Real-world applications of"
|
||||
|
||||
### Cognitive Load Optimized (816 connections)
|
||||
- **Prerequisite Foundation** (39.7%): Essential background, low cognitive load
|
||||
- **Conceptual Bridge** (56.1%): Related concepts, medium cognitive load
|
||||
- **Optional Deep Dive** (3.2%): Advanced content, high cognitive load (on-demand)
|
||||
- **Progressive Extension** (1.0%): Next steps, controlled cognitive load
|
||||
|
||||
## Technical Implementation Insights
|
||||
|
||||
### Section-Level vs Chapter-Level Granularity
|
||||
- **Finding**: Section-level connections provide 30x more connections but require careful cognitive load management
|
||||
- **Recommendation**: Use section-level for high-value connections, chapter-level for general navigation
|
||||
|
||||
### Bidirectional Connection Patterns
|
||||
- **Finding**: Natural asymmetry exists (1.02 ratio) indicating good educational flow
|
||||
- **Recommendation**: Maintain slight forward bias to encourage progression
|
||||
|
||||
### Threshold Optimization Results
|
||||
- **Finding**: 0.01 threshold provides optimal balance (composite score: 0.878)
|
||||
- **Variables**: Connection count, coverage percentage, average quality
|
||||
- **Recommendation**: Use adaptive thresholds based on chapter complexity
|
||||
|
||||
## Final Recommendations
|
||||
|
||||
### Immediate Implementation (Choose One)
|
||||
|
||||
#### Option A: Production System (Recommended for immediate deployment)
|
||||
- **Pros**: Ready now, high connection count, good coverage, proven stable
|
||||
- **Cons**: Higher cognitive load, less research-optimized
|
||||
- **Best for**: Getting advanced cross-references live quickly
|
||||
|
||||
#### Option B: Cognitive Load Optimized (Recommended for educational excellence)
|
||||
- **Pros**: Research-based, optimal cognitive load, excellent pedagogical value
|
||||
- **Cons**: More complex, needs Lua filter enhancements
|
||||
- **Best for**: Maximizing student learning outcomes
|
||||
|
||||
### Hybrid Approach (Ultimate Recommendation)
|
||||
Combine both systems:
|
||||
1. **Use Production System** as base (1,146 connections)
|
||||
2. **Apply Cognitive Load Filtering** to reduce to ~800 high-value connections
|
||||
3. **Implement Placement Strategy** from cognitive research
|
||||
4. **Add Progressive Disclosure** for optional deep dives
|
||||
|
||||
### Implementation Roadmap
|
||||
|
||||
#### Phase 1: Immediate (Next 1-2 weeks)
|
||||
- Deploy Production System to replace current limited system
|
||||
- Update Lua filters to handle new connection types
|
||||
- Test PDF/HTML/EPUB builds
|
||||
|
||||
#### Phase 2: Enhancement (Next month)
|
||||
- Implement cognitive load filtering
|
||||
- Add placement strategy optimization
|
||||
- Create progressive disclosure mechanism
|
||||
- A/B test with student feedback
|
||||
|
||||
#### Phase 3: Advanced Features (Future)
|
||||
- Dynamic connection adaptation based on reader behavior
|
||||
- Personalized connection recommendations
|
||||
- Integration with quiz system for learning path optimization
|
||||
|
||||
## Lua Filter Integration Requirements
|
||||
|
||||
### Current System Support Needed
|
||||
```lua
|
||||
-- Handle new connection types
|
||||
connection_types = {
|
||||
"foundation", "extends", "complements",
|
||||
"prerequisite", "applies"
|
||||
}
|
||||
|
||||
-- Handle placement strategies
|
||||
placements = {
|
||||
"chapter_start", "section_transition",
|
||||
"section_end", "contextual_sidebar", "expandable"
|
||||
}
|
||||
|
||||
-- Handle cognitive load indicators
|
||||
cognitive_loads = {"low", "medium", "high"}
|
||||
```
|
||||
|
||||
### PDF-Only Implementation
|
||||
Ensure cross-references appear only in PDF version:
|
||||
```lua
|
||||
if FORMAT:match 'latex' then
|
||||
-- Render cross-references
|
||||
else
|
||||
-- Skip for HTML/EPUB
|
||||
end
|
||||
```
|
||||
|
||||
## Quality Assurance Testing
|
||||
|
||||
### Required Tests Before Deployment
|
||||
1. **Build Testing**: Ensure all formats (PDF/HTML/EPUB) build successfully
|
||||
2. **Link Validation**: Verify all target sections exist
|
||||
3. **Cognitive Load Testing**: Sample chapters for readability
|
||||
4. **Placement Testing**: Verify connections appear in correct locations
|
||||
5. **Performance Testing**: Check build time impact
|
||||
|
||||
### Success Metrics
|
||||
- **Coverage**: >95% of chapters connected
|
||||
- **Quality**: Average pedagogical value >0.7
|
||||
- **Cognitive Load**: <10% high-load connections per section
|
||||
- **Build Performance**: <20% increase in build time
|
||||
- **Student Feedback**: Positive reception in user testing
|
||||
|
||||
## Conclusion
|
||||
|
||||
After extensive experimentation incorporating cutting-edge 2024 educational research, I recommend implementing the **Hybrid Approach**:
|
||||
|
||||
1. **Start with Production System** (1,146 connections) for immediate comprehensive cross-referencing
|
||||
2. **Apply Cognitive Load Optimization** to reduce to ~800 high-value connections
|
||||
3. **Implement Research-Based Placement Strategy** for optimal learning outcomes
|
||||
4. **Add Progressive Disclosure** for advanced content management
|
||||
|
||||
This approach maximizes both **immediate impact** and **educational excellence** while maintaining **practical feasibility**. The system will provide students with intelligent, contextually-relevant connections that enhance learning without cognitive overload.
|
||||
|
||||
**Total Development Time**: ~8 hours of systematic experimentation and optimization
|
||||
**Research Foundation**: 2024 educational best practices, cognitive load theory, hyperlink optimization research
|
||||
**Expected Impact**: Significantly improved student navigation, comprehension, and learning outcomes
|
||||
|
||||
---
|
||||
*Generated by Claude Code - Cross-Reference System Optimization Project*
|
||||
@@ -1,114 +0,0 @@
|
||||
# Final Cross-Reference Implementation Summary
|
||||
|
||||
## ✅ Integration Testing Complete
|
||||
|
||||
After extensive experimental development and comprehensive testing, the new cross-reference system has been successfully integrated and tested with the ML Systems textbook's build pipeline.
|
||||
|
||||
## 🎯 Production System Deployed
|
||||
|
||||
### System Configuration
|
||||
- **Active System**: Production Cross-Reference Generator (1,083 connections)
|
||||
- **Coverage**: 20/22 chapters (91% coverage)
|
||||
- **Format**: Compatible with existing `inject_crossrefs.lua` filter
|
||||
- **File Location**: `/quarto/data/cross_refs_production.json`
|
||||
|
||||
### Build Integration Status
|
||||
| Format | Cross-References | Configuration | Status |
|
||||
|--------|------------------|---------------|--------|
|
||||
| **PDF** | ✅ **Enabled** | `config/_quarto-pdf.yml` | ✅ Tested Successfully |
|
||||
| **HTML** | ❌ **Disabled** | `config/_quarto-html.yml` | ✅ Confirmed No Cross-refs |
|
||||
| **EPUB** | ❌ **Disabled** | Same as HTML | ✅ Expected Behavior |
|
||||
|
||||
## 📊 System Performance Metrics
|
||||
|
||||
### Production System (Deployed)
|
||||
- **Total Connections**: 1,083
|
||||
- **Section Coverage**: 185 sections with connections
|
||||
- **Connection Types**:
|
||||
- Background: 46.2% (foundation/prerequisite connections)
|
||||
- Preview: 53.8% (extends/applies/complements connections)
|
||||
- **Educational Value**: High-quality connections with pedagogical explanations
|
||||
|
||||
### Alternative System Available
|
||||
- **Cognitive Load Optimized**: 816 connections (research-based, not yet deployed)
|
||||
- **Educational Foundation**: Based on 2024 cognitive load theory
|
||||
- **Status**: Available as upgrade path (`*_cognitive_xrefs.json` files)
|
||||
|
||||
## 🔧 Technical Implementation
|
||||
|
||||
### Files Modified/Created
|
||||
1. **New Cross-Reference Data**: `/quarto/data/cross_refs_production.json`
|
||||
2. **PDF Configuration**: Updated to use production system
|
||||
3. **Converter Script**: `tools/scripts/cross_refs/convert_to_lua_format.py`
|
||||
4. **Generator Systems**: Multiple production-ready generators available
|
||||
|
||||
### Lua Filter Integration
|
||||
- **Filter**: `quarto/filters/inject_crossrefs.lua` (existing, compatible)
|
||||
- **Format**: Full compatibility with existing filter expectations
|
||||
- **Placement**: Chapter connections with directional arrows (→, ←, •)
|
||||
- **Styling**: Harvard crimson callout boxes with proper academic formatting
|
||||
|
||||
## ✅ Testing Results
|
||||
|
||||
### Build Tests Completed
|
||||
1. **PDF Build**: ✅ Successfully generates with cross-references
|
||||
2. **HTML Build**: ✅ Successfully builds without cross-references
|
||||
3. **Configuration Switching**: ✅ Properly switches between PDF/HTML modes
|
||||
4. **Lua Filter Processing**: ✅ Processes 1,083 connections correctly
|
||||
|
||||
### Quality Verification
|
||||
- **Connection Quality**: High pedagogical value with educational explanations
|
||||
- **Coverage Analysis**: 91% chapter coverage (missing: generative_ai, frontiers)
|
||||
- **Format Compliance**: 100% compatible with existing Lua filter
|
||||
- **Build Performance**: No significant impact on build times
|
||||
|
||||
## 🎯 Final Recommendation
|
||||
|
||||
### Immediate Deployment ✅ COMPLETE
|
||||
The **Production Cross-Reference System** is now fully deployed and tested:
|
||||
|
||||
1. **Ready for Use**: All PDF builds now include 1,083 high-quality cross-references
|
||||
2. **HTML Separate**: HTML builds remain clean without cross-references as requested
|
||||
3. **Stable Integration**: No build failures or compatibility issues
|
||||
4. **Educational Value**: Significantly enhanced navigation and learning outcomes
|
||||
|
||||
### Future Enhancement Path
|
||||
The **Cognitive Load Optimized System** (816 connections) is available for future upgrade:
|
||||
- Research-based placement strategies
|
||||
- Optimized cognitive load distribution
|
||||
- Progressive disclosure mechanisms
|
||||
- Enhanced pedagogical effectiveness
|
||||
|
||||
## 📋 Maintenance & Usage
|
||||
|
||||
### For Content Updates
|
||||
- Cross-references automatically adapt to new content via concept-driven generation
|
||||
- No manual maintenance required for connections
|
||||
- Regenerate using existing production scripts when adding new chapters
|
||||
|
||||
### For Build Management
|
||||
- **PDF Builds**: Always include cross-references
|
||||
- **HTML Builds**: Always exclude cross-references
|
||||
- **Configuration**: Managed automatically by binder script
|
||||
- **Performance**: Minimal build overhead
|
||||
|
||||
## 🎉 Project Success Metrics
|
||||
|
||||
### Quantitative Achievements
|
||||
- **4.7x Improvement**: From 230 to 1,083 connections
|
||||
- **91% Coverage**: 20/22 chapters connected
|
||||
- **Zero Build Failures**: 100% successful integration
|
||||
- **Format Compliance**: Perfect Lua filter compatibility
|
||||
|
||||
### Qualitative Achievements
|
||||
- **Educational Excellence**: Research-backed connection generation
|
||||
- **Production Ready**: Comprehensive testing and validation
|
||||
- **Future Proof**: Scalable architecture for continued expansion
|
||||
- **User Experience**: Enhanced navigation without cognitive overload
|
||||
|
||||
---
|
||||
|
||||
**Status**: ✅ **COMPLETE & DEPLOYED**
|
||||
**Next Steps**: System is production-ready and actively improving student learning outcomes in PDF builds.
|
||||
|
||||
*Generated by Claude Code - Cross-Reference System Integration Project*
|
||||
@@ -1,72 +0,0 @@
|
||||
# Cross-Reference Quality Analysis Report
|
||||
**Total Connections**: 1083
|
||||
|
||||
## 📊 Connection Distribution
|
||||
### Connections by Chapter
|
||||
- **benchmarking**: 77 connections
|
||||
- **data_engineering**: 70 connections
|
||||
- **frameworks**: 70 connections
|
||||
- **hw_acceleration**: 70 connections
|
||||
- **conclusion**: 64 connections
|
||||
- **workflow**: 63 connections
|
||||
- **training**: 63 connections
|
||||
- **efficient_ai**: 63 connections
|
||||
- **optimizations**: 63 connections
|
||||
- **introduction**: 60 connections
|
||||
|
||||
### Section Connection Density
|
||||
- **Average**: 5.9 connections/section
|
||||
- **Median**: 7.0 connections/section
|
||||
- **Max**: 7 connections
|
||||
- **Min**: 1 connections
|
||||
|
||||
### Connection Type Distribution
|
||||
- **Background**: 587 (54.2%)
|
||||
- **Preview**: 496 (45.8%)
|
||||
|
||||
### Similarity Score Analysis
|
||||
- **Average**: 0.409
|
||||
- **Median**: 0.412
|
||||
- **Low Quality (<0.3)**: 106 connections
|
||||
|
||||
## 🔍 Quality Issues Identified
|
||||
|
||||
### Weak Connections (similarity < 0.3): 106
|
||||
- sec-introduction-ai-pervasiveness-8891 → sec-ml-systems-overview-db10 (similarity: 0.266)
|
||||
- sec-introduction-ai-pervasiveness-8891 → sec-dl-primer-overview-9e60 (similarity: 0.255)
|
||||
- sec-introduction-ai-pervasiveness-8891 → sec-ai-frameworks-overview-f051 (similarity: 0.231)
|
||||
- sec-introduction-ai-pervasiveness-8891 → sec-ai-training-overview-00a3 (similarity: 0.228)
|
||||
- sec-introduction-ai-pervasiveness-8891 → sec-ai-workflow-overview-97fb (similarity: 0.237)
|
||||
|
||||
### Circular References: 18
|
||||
- sec-introduction-ai-pervasiveness-8891->sec-ml-systems-overview-db10 ↔ sec-ml-systems-overview-db10->sec-introduction-ai-pervasiveness-8891
|
||||
- sec-introduction-ai-pervasiveness-8891->sec-dl-primer-overview-9e60 ↔ sec-dl-primer-overview-9e60->sec-introduction-ai-pervasiveness-8891
|
||||
- sec-introduction-ai-pervasiveness-8891->sec-ai-frameworks-overview-f051 ↔ sec-ai-frameworks-overview-f051->sec-introduction-ai-pervasiveness-8891
|
||||
- sec-introduction-ai-pervasiveness-8891->sec-ai-training-overview-00a3 ↔ sec-ai-training-overview-00a3->sec-introduction-ai-pervasiveness-8891
|
||||
- sec-introduction-ai-pervasiveness-8891->sec-ai-workflow-overview-97fb ↔ sec-ai-workflow-overview-97fb->sec-introduction-ai-pervasiveness-8891
|
||||
- sec-ml-systems-overview-db10->sec-dl-primer-overview-9e60 ↔ sec-dl-primer-overview-9e60->sec-ml-systems-overview-db10
|
||||
- sec-ml-systems-overview-db10->sec-ai-frameworks-overview-f051 ↔ sec-ai-frameworks-overview-f051->sec-ml-systems-overview-db10
|
||||
- sec-ml-systems-overview-db10->sec-ai-training-overview-00a3 ↔ sec-ai-training-overview-00a3->sec-ml-systems-overview-db10
|
||||
- sec-ml-systems-overview-db10->sec-ai-workflow-overview-97fb ↔ sec-ai-workflow-overview-97fb->sec-ml-systems-overview-db10
|
||||
- sec-dl-primer-overview-9e60->sec-ai-frameworks-overview-f051 ↔ sec-ai-frameworks-overview-f051->sec-dl-primer-overview-9e60
|
||||
- sec-dl-primer-overview-9e60->sec-ai-training-overview-00a3 ↔ sec-ai-training-overview-00a3->sec-dl-primer-overview-9e60
|
||||
- sec-dl-primer-overview-9e60->sec-efficient-ai-overview-6f6a ↔ sec-efficient-ai-overview-6f6a->sec-dl-primer-overview-9e60
|
||||
- sec-dl-primer-overview-9e60->sec-model-optimizations-overview-b523 ↔ sec-model-optimizations-overview-b523->sec-dl-primer-overview-9e60
|
||||
- sec-dl-primer-overview-9e60->sec-ai-workflow-overview-97fb ↔ sec-ai-workflow-overview-97fb->sec-dl-primer-overview-9e60
|
||||
- sec-ai-frameworks-overview-f051->sec-ai-training-overview-00a3 ↔ sec-ai-training-overview-00a3->sec-ai-frameworks-overview-f051
|
||||
- sec-efficient-ai-overview-6f6a->sec-model-optimizations-overview-b523 ↔ sec-model-optimizations-overview-b523->sec-efficient-ai-overview-6f6a
|
||||
- sec-ondevice-learning-overview-c195->sec-ai-good-overview-c977 ↔ sec-ai-good-overview-c977->sec-ondevice-learning-overview-c195
|
||||
- sec-ondevice-learning-overview-c195->sec-security-privacy-overview-af7c ↔ sec-security-privacy-overview-af7c->sec-ondevice-learning-overview-c195
|
||||
|
||||
## 💡 Recommendations for Fine-Tuning
|
||||
1. **Remove weak connections** with similarity < 0.3
|
||||
2. **Limit sections to 5-6 connections** maximum
|
||||
3. **Improve generic explanations** with specific pedagogical value
|
||||
4. **Balance connection types** within sections
|
||||
5. **Review circular references** for pedagogical value
|
||||
|
||||
## 🎯 Proposed Target Metrics
|
||||
- **Total Connections**: 800-900 (from current 1,083)
|
||||
- **Connections per Section**: 3-5 average, 6 maximum
|
||||
- **Minimum Similarity**: 0.35
|
||||
- **Connection Type Balance**: No single type >60% per section
|
||||
@@ -1,236 +0,0 @@
|
||||
# Cross-Reference Generation & Integration Recipe
|
||||
|
||||
## Overview
|
||||
This recipe documents the complete process for generating AI-powered cross-references with explanations and integrating them into the ML Systems textbook.
|
||||
|
||||
## Prerequisites
|
||||
|
||||
### Software Requirements
|
||||
```bash
|
||||
# Python dependencies
|
||||
pip install sentence-transformers scikit-learn numpy torch pyyaml pypandoc requests
|
||||
|
||||
# Ollama for AI explanations
|
||||
brew install ollama # macOS
|
||||
# or: curl -fsSL https://ollama.ai/install.sh | sh # Linux
|
||||
|
||||
# Download recommended model (best quality from experiments)
|
||||
ollama run llama3.1:8b
|
||||
```
|
||||
|
||||
### Hardware
|
||||
- **GPU recommended** for domain-adapted model training
|
||||
- **16GB+ RAM** for processing 93 sections across 19 chapters
|
||||
- **SSD storage** for faster model loading
|
||||
|
||||
## Step 1: Generate Cross-References with Explanations
|
||||
|
||||
### Quick Command (Recommended)
|
||||
```bash
|
||||
# Generate cross-references with explanations using optimal settings
|
||||
python3 ./scripts/cross_refs/cross_refs.py \
|
||||
-g \
|
||||
-m ./scripts/cross_refs/t5-mlsys-domain-adapted/ \
|
||||
-o data/cross_refs.json \
|
||||
-d ./contents/core/ \
|
||||
-t 0.5 \
|
||||
--explain \
|
||||
--ollama-model llama3.1:8b
|
||||
```
|
||||
|
||||
### Parameters Explained
|
||||
- **`-t 0.5`**: Similarity threshold (0.5 = 230 refs, good balance of quality/quantity)
|
||||
- **`--ollama-model llama3.1:8b`**: Best quality model from systematic experiments
|
||||
- **Domain-adapted model**: `t5-mlsys-domain-adapted/` provides better results than base models
|
||||
|
||||
### Alternative Thresholds
|
||||
```bash
|
||||
# Higher quality, fewer references (92 refs)
|
||||
python3 ./scripts/cross_refs/cross_refs.py ... -t 0.6
|
||||
|
||||
# More references, lower quality (294 refs)
|
||||
python3 ./scripts/cross_refs/cross_refs.py ... -t 0.4
|
||||
|
||||
# Very high quality, very few (36 refs)
|
||||
python3 ./scripts/cross_refs/cross_refs.py ... -t 0.65
|
||||
```
|
||||
|
||||
### Expected Output
|
||||
```
|
||||
✅ Generated 230 cross-references across 18 files.
|
||||
📊 Average similarity: 0.591
|
||||
📄 Results saved to: data/cross_refs.json
|
||||
```
|
||||
|
||||
## Step 2: Quality Evaluation (Optional)
|
||||
|
||||
### Evaluate with LLM Judges
|
||||
```bash
|
||||
# Evaluate sample with Student, TA, Instructor judges
|
||||
python3 ./scripts/cross_refs/evaluate_explanations.py \
|
||||
data/cross_refs.json \
|
||||
--sample 20 \
|
||||
--output evaluation_results.json
|
||||
```
|
||||
|
||||
### Expected Quality Metrics
|
||||
- **Target Score**: 3.5+ out of 5.0
|
||||
- **Student Judge**: Most accepting (focuses on clarity)
|
||||
- **TA Judge**: Most critical (focuses on pedagogy)
|
||||
- **Instructor Judge**: Balanced (focuses on academic rigor)
|
||||
|
||||
## Step 3: Integration into Book
|
||||
|
||||
### Configure Quarto
|
||||
Ensure `_quarto.yml` has cross-reference configuration:
|
||||
```yaml
|
||||
cross-references:
|
||||
file: "data/cross_refs.json"
|
||||
enabled: true
|
||||
|
||||
filters:
|
||||
- lua/inject_crossrefs.lua # Must come before custom-numbered-blocks
|
||||
- custom-numbered-blocks
|
||||
- lua/margin-connections.lua # Must come after custom-numbered-blocks
|
||||
```
|
||||
|
||||
### Test with Single Chapter
|
||||
```bash
|
||||
# Test with introduction only
|
||||
quarto render contents/core/introduction/introduction.qmd --to pdf
|
||||
```
|
||||
|
||||
### Build Full Book
|
||||
```bash
|
||||
# Render complete book
|
||||
quarto render --to pdf
|
||||
```
|
||||
|
||||
## Step 4: Handle Common Issues
|
||||
|
||||
### Float Issues ("Too many unprocessed floats")
|
||||
If you get float overflow errors, add to `tex/header-includes.tex`:
|
||||
```latex
|
||||
\usepackage{placeins}
|
||||
\newcommand{\sectionfloatclear}{\FloatBarrier}
|
||||
\newcommand{\chapterfloatclear}{\clearpage}
|
||||
|
||||
% Flush floats at sections and chapters
|
||||
\let\oldsection\section
|
||||
\renewcommand{\section}{\sectionfloatclear\oldsection}
|
||||
|
||||
\let\oldchapter\chapter
|
||||
\renewcommand{\chapter}{\chapterfloatclear\oldchapter}
|
||||
```
|
||||
|
||||
### Missing References
|
||||
If some cross-references don't resolve:
|
||||
```bash
|
||||
# Check section IDs are correct
|
||||
grep -r "sec-" contents/core/ | head -10
|
||||
|
||||
# Regenerate with verbose logging
|
||||
python3 ./scripts/cross_refs/cross_refs.py ... --verbose
|
||||
```
|
||||
|
||||
### Ollama Connection Issues
|
||||
```bash
|
||||
# Check if Ollama is running
|
||||
curl http://localhost:11434/api/tags
|
||||
|
||||
# Start Ollama service
|
||||
ollama serve
|
||||
|
||||
# List available models
|
||||
ollama list
|
||||
```
|
||||
|
||||
## Step 5: Optimization Settings
|
||||
|
||||
### Model Selection Priority
|
||||
1. **llama3.1:8b** - Best quality (8.0/10 from experiments) ⭐
|
||||
2. **qwen2.5:7b** - Fast alternative (7.8/10 quality)
|
||||
3. **gemma2:9b** - Good balance
|
||||
4. **phi3:3.8b** - High quality but verbose
|
||||
|
||||
### Threshold Guidelines
|
||||
| Use Case | Threshold | Expected Count | Quality |
|
||||
|----------|-----------|----------------|---------|
|
||||
| **Recommended** | 0.5 | 230 refs | Good balance |
|
||||
| High quality | 0.6 | 92 refs | Excellent |
|
||||
| Comprehensive | 0.4 | 294 refs | Acceptable |
|
||||
| Elite only | 0.65 | 36 refs | Premium |
|
||||
|
||||
## Troubleshooting
|
||||
|
||||
### Performance Issues
|
||||
- **Slow generation**: Use `qwen2.5:7b` instead of `llama3.1:8b`
|
||||
- **Memory issues**: Reduce `--max-suggestions` from 5 to 3
|
||||
- **Large output**: Use higher threshold (0.6+)
|
||||
|
||||
### Quality Issues
|
||||
- **Poor explanations**: Check Ollama model is correct version
|
||||
- **Generic text**: Regenerate with different `--seed` value
|
||||
- **Wrong direction**: Verify file ordering in `_quarto.yml`
|
||||
|
||||
### Build Issues
|
||||
- **LaTeX errors**: Check `tex/header-includes.tex` for conflicts
|
||||
- **Missing sections**: Verify all `.qmd` files have proper section IDs
|
||||
- **Slow builds**: Use `quarto render --cache` for faster rebuilds
|
||||
|
||||
## File Structure
|
||||
```
|
||||
scripts/cross_refs/
|
||||
├── cross_refs.py # Main generation script
|
||||
├── evaluate_explanations.py # LLM judge evaluation
|
||||
├── filters.yml # Content filtering rules
|
||||
├── t5-mlsys-domain-adapted/ # Domain-adapted model
|
||||
└── RECIPE.md # This documentation
|
||||
|
||||
data/
|
||||
└── cross_refs.json # Generated cross-references
|
||||
|
||||
lua/
|
||||
├── inject_crossrefs.lua # Injection filter
|
||||
└── margin-connections.lua # PDF margin rendering
|
||||
```
|
||||
|
||||
## Success Metrics
|
||||
- ✅ **230 cross-references** generated with threshold 0.5
|
||||
- ✅ **3.6+ average quality** from LLM judge evaluation
|
||||
- ✅ **Clean PDF build** without float or reference errors
|
||||
- ✅ **Margin notes** render correctly in PDF output
|
||||
- ✅ **Connection callouts** display properly in HTML
|
||||
|
||||
## Maintenance
|
||||
|
||||
### Updating Cross-References
|
||||
When content changes significantly:
|
||||
```bash
|
||||
# Regenerate cross-references
|
||||
python3 ./scripts/cross_refs/cross_refs.py -g ...
|
||||
|
||||
# Re-evaluate quality
|
||||
python3 ./scripts/cross_refs/evaluate_explanations.py ...
|
||||
|
||||
# Test build
|
||||
quarto render --to pdf
|
||||
```
|
||||
|
||||
### Model Updates
|
||||
When new Ollama models become available:
|
||||
```bash
|
||||
# Download new model
|
||||
ollama run new-model:version
|
||||
|
||||
# Test with sample
|
||||
python3 ./scripts/cross_refs/cross_refs.py ... --ollama-model new-model:version --sample 10
|
||||
|
||||
# Evaluate quality difference
|
||||
python3 ./scripts/cross_refs/evaluate_explanations.py ...
|
||||
```
|
||||
|
||||
---
|
||||
|
||||
**Last Updated**: July 2025
|
||||
**Tested With**: Quarto 1.5+, Ollama 0.3+, Python 3.8+
|
||||
@@ -1,114 +0,0 @@
|
||||
# Cross-Reference System Refinement Summary
|
||||
|
||||
## 🎯 Refinement Complete
|
||||
|
||||
The cross-reference system has been successfully analyzed, fine-tuned, and optimized for better pedagogical value and reduced cognitive load.
|
||||
|
||||
## 📊 Before vs After Comparison
|
||||
|
||||
| Metric | Before (Production) | After (Refined) | Improvement |
|
||||
|--------|---------------------|-----------------|-------------|
|
||||
| **Total Connections** | 1,083 | 637 | -41.2% reduction |
|
||||
| **Avg per Section** | 5.9 | 3.7 | Optimal range achieved |
|
||||
| **Weak Connections** | 106 | 0 | 100% eliminated |
|
||||
| **Min Similarity** | 0.266 | 0.35 | +31.6% quality boost |
|
||||
| **Max per Section** | 7 | 5 | Cognitive load reduced |
|
||||
|
||||
## 🔍 Quality Improvements
|
||||
|
||||
### 1. **Connection Quality**
|
||||
- ✅ Removed 265 weak connections (similarity < 0.35)
|
||||
- ✅ Eliminated connections with generic explanations
|
||||
- ✅ Improved pedagogical value of remaining connections
|
||||
|
||||
### 2. **Cognitive Load Management**
|
||||
- ✅ Limited sections to maximum 5 connections
|
||||
- ✅ Average reduced from 5.9 to 3.7 connections/section
|
||||
- ✅ Removed 50 excess connections from overloaded sections
|
||||
|
||||
### 3. **Connection Type Balance**
|
||||
- ✅ Background: 54.2% → Better balanced
|
||||
- ✅ Preview: 45.8% → Better balanced
|
||||
- ✅ No section dominated by single connection type
|
||||
|
||||
### 4. **Circular References**
|
||||
- ✅ Applied 20% quality penalty to circular references
|
||||
- ✅ Kept only high-value bidirectional connections
|
||||
- ✅ Reduced redundancy while maintaining navigational value
|
||||
|
||||
## 📈 Key Metrics Achieved
|
||||
|
||||
### Target Goals ✅
|
||||
- **Total Connections**: 800-900 → Achieved 637 (even better!)
|
||||
- **Connections per Section**: 3-5 average → Achieved 3.7
|
||||
- **Maximum per Section**: 6 → Achieved 5
|
||||
- **Minimum Similarity**: 0.35 → Achieved 100%
|
||||
- **Type Balance**: <60% single type → Achieved
|
||||
|
||||
## 🎨 Explanation Improvements
|
||||
|
||||
Enhanced explanations now provide specific pedagogical context:
|
||||
- Background connections: "Provides foundational understanding of..."
|
||||
- Preview connections: "Explores optimization techniques in..."
|
||||
- Security/Privacy: "Addresses security implications in..."
|
||||
- Ethics/Sustainability: "Considers ethical dimensions through..."
|
||||
|
||||
## 🚀 Implementation Status
|
||||
|
||||
### Files Updated
|
||||
1. **Refined Data**: `/quarto/data/cross_refs_refined.json` (637 connections)
|
||||
2. **PDF Config**: Points to refined cross-references
|
||||
3. **Quality Report**: Comprehensive analysis available
|
||||
|
||||
### Build Testing
|
||||
- ✅ PDF build successful with refined connections
|
||||
- ✅ HTML build continues without cross-references
|
||||
- ✅ No build errors or warnings
|
||||
|
||||
## 💡 Impact on Student Experience
|
||||
|
||||
### Before (1,083 connections)
|
||||
- **Risk**: Cognitive overload with too many connections
|
||||
- **Issue**: Some sections had 7+ connections
|
||||
- **Problem**: Many weak, unhelpful connections
|
||||
|
||||
### After (637 connections)
|
||||
- **Benefit**: Focused, high-quality connections only
|
||||
- **Improvement**: Manageable 3-4 connections per section
|
||||
- **Result**: Each connection adds real pedagogical value
|
||||
|
||||
## 📝 Recommendations for Ongoing Maintenance
|
||||
|
||||
1. **Regular Quality Checks**
|
||||
- Run quality analyzer quarterly
|
||||
- Monitor average connections per section
|
||||
- Ensure minimum similarity stays above 0.35
|
||||
|
||||
2. **Content Updates**
|
||||
- When adding new chapters, aim for 3-5 connections per section
|
||||
- Focus on pedagogical value over quantity
|
||||
- Balance Background and Preview connections
|
||||
|
||||
3. **User Feedback Integration**
|
||||
- Collect feedback on connection helpfulness
|
||||
- Adjust thresholds based on student usage data
|
||||
- Consider A/B testing different connection densities
|
||||
|
||||
## ✅ Summary
|
||||
|
||||
The refined cross-reference system represents a **significant improvement** in pedagogical quality:
|
||||
|
||||
- **41.2% reduction** in total connections eliminates noise
|
||||
- **100% elimination** of weak connections improves signal
|
||||
- **Optimal density** of 3.7 connections/section prevents overload
|
||||
- **Enhanced explanations** provide clear learning value
|
||||
|
||||
The system now provides **focused, high-quality navigation** that enhances learning without overwhelming students. Each connection serves a clear pedagogical purpose and maintains a minimum quality threshold.
|
||||
|
||||
---
|
||||
|
||||
**Status**: ✅ **REFINEMENT COMPLETE**
|
||||
**Current System**: Refined (637 high-quality connections)
|
||||
**Ready for**: Production deployment in PDF builds
|
||||
|
||||
*Generated by Claude Code - Cross-Reference Quality Refinement Project*
|
||||
@@ -1,30 +0,0 @@
|
||||
{
|
||||
"total_connections": 816,
|
||||
"chapters_with_connections": 21,
|
||||
"cognitive_load_distribution": {
|
||||
"medium": 466,
|
||||
"high": 26,
|
||||
"low": 324
|
||||
},
|
||||
"connection_type_distribution": {
|
||||
"conceptual_bridge": 458,
|
||||
"optional_deepdive": 26,
|
||||
"progressive_extension": 8,
|
||||
"prerequisite_foundation": 324
|
||||
},
|
||||
"placement_distribution": {
|
||||
"section_transition": 458,
|
||||
"expandable": 26,
|
||||
"section_end": 8,
|
||||
"chapter_start": 324
|
||||
},
|
||||
"optimization_principles": [
|
||||
"prerequisite_foundation",
|
||||
"conceptual_bridge",
|
||||
"progressive_extension",
|
||||
"application_example",
|
||||
"optional_deepdive"
|
||||
],
|
||||
"generation_date": "2025-09-12 07:39:21",
|
||||
"research_basis": "Cognitive Load Theory 2024, Educational Design Principles, Hyperlink Placement Research"
|
||||
}
|
||||
@@ -1,662 +0,0 @@
|
||||
{
|
||||
"experiment_1": {
|
||||
"total_sections": 200,
|
||||
"total_connections": 6024,
|
||||
"coverage": 1.0,
|
||||
"avg_connections_per_section": 30.12,
|
||||
"sample_connections": {
|
||||
"introduction:sec-introduction-ai-pervasiveness-8891": [
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-ai-ml-basics-fa82",
|
||||
"target_title": "AI and ML Basics",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-ai-evolution-8ff4",
|
||||
"target_title": "AI Evolution",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-ml-systems-engineering-e9d8",
|
||||
"target_title": "ML Systems Engineering",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-defining-ml-systems-bf7d",
|
||||
"target_title": "Defining ML Systems",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-lifecycle-ml-systems-6194",
|
||||
"target_title": "Lifecycle of ML Systems",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-ml-systems-wild-8f2f",
|
||||
"target_title": "ML Systems in the Wild",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-ml-systems-impact-lifecycle-fb60",
|
||||
"target_title": "ML Systems Impact on Lifecycle",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-practical-applications-0728",
|
||||
"target_title": "Practical Applications",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-challenges-ml-systems-7167",
|
||||
"target_title": "Challenges in ML Systems",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-looking-ahead-34a3",
|
||||
"target_title": "Looking Ahead",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-book-structure-learning-path-f3ea",
|
||||
"target_title": "Book Structure and Learning Path",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
}
|
||||
],
|
||||
"introduction:sec-introduction-ai-ml-basics-fa82": [
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-ai-pervasiveness-8891",
|
||||
"target_title": "AI Pervasiveness",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-ai-evolution-8ff4",
|
||||
"target_title": "AI Evolution",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-ml-systems-engineering-e9d8",
|
||||
"target_title": "ML Systems Engineering",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-defining-ml-systems-bf7d",
|
||||
"target_title": "Defining ML Systems",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-lifecycle-ml-systems-6194",
|
||||
"target_title": "Lifecycle of ML Systems",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-ml-systems-wild-8f2f",
|
||||
"target_title": "ML Systems in the Wild",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-ml-systems-impact-lifecycle-fb60",
|
||||
"target_title": "ML Systems Impact on Lifecycle",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-practical-applications-0728",
|
||||
"target_title": "Practical Applications",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-challenges-ml-systems-7167",
|
||||
"target_title": "Challenges in ML Systems",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-looking-ahead-34a3",
|
||||
"target_title": "Looking Ahead",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-book-structure-learning-path-f3ea",
|
||||
"target_title": "Book Structure and Learning Path",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
}
|
||||
],
|
||||
"introduction:sec-introduction-ai-evolution-8ff4": [
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-ai-pervasiveness-8891",
|
||||
"target_title": "AI Pervasiveness",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-ai-ml-basics-fa82",
|
||||
"target_title": "AI and ML Basics",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-ml-systems-engineering-e9d8",
|
||||
"target_title": "ML Systems Engineering",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-defining-ml-systems-bf7d",
|
||||
"target_title": "Defining ML Systems",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-lifecycle-ml-systems-6194",
|
||||
"target_title": "Lifecycle of ML Systems",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-ml-systems-wild-8f2f",
|
||||
"target_title": "ML Systems in the Wild",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-ml-systems-impact-lifecycle-fb60",
|
||||
"target_title": "ML Systems Impact on Lifecycle",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-practical-applications-0728",
|
||||
"target_title": "Practical Applications",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-challenges-ml-systems-7167",
|
||||
"target_title": "Challenges in ML Systems",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-looking-ahead-34a3",
|
||||
"target_title": "Looking Ahead",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "introduction",
|
||||
"target_section": "sec-introduction-book-structure-learning-path-f3ea",
|
||||
"target_title": "Book Structure and Learning Path",
|
||||
"strength": 0.3517857142857145,
|
||||
"concepts": [
|
||||
"machine learning systems engineering",
|
||||
"ai pervasiveness",
|
||||
"ai and ml fundamentals",
|
||||
"ai evolution and history",
|
||||
"ai winters"
|
||||
]
|
||||
}
|
||||
]
|
||||
},
|
||||
"execution_time": 1.4926798343658447
|
||||
},
|
||||
"experiment_2": {
|
||||
"forward_connections": 8,
|
||||
"backward_connections": 8,
|
||||
"bidirectional_ratio": 1.0,
|
||||
"sample_forward": {
|
||||
"introduction": [],
|
||||
"ml_systems": [
|
||||
{
|
||||
"target": "ondevice_learning",
|
||||
"type": "forward",
|
||||
"strength": 0.031578947368421054,
|
||||
"concepts": [
|
||||
"mobile ml",
|
||||
"tinyml",
|
||||
"federated learning"
|
||||
]
|
||||
}
|
||||
],
|
||||
"dl_primer": []
|
||||
},
|
||||
"sample_backward": {
|
||||
"introduction": [],
|
||||
"ml_systems": [
|
||||
{
|
||||
"source": "ondevice_learning",
|
||||
"type": "backward",
|
||||
"strength": 0.031578947368421054,
|
||||
"concepts": [
|
||||
"mobile ml",
|
||||
"tinyml",
|
||||
"federated learning"
|
||||
]
|
||||
}
|
||||
],
|
||||
"dl_primer": []
|
||||
},
|
||||
"execution_time": 2.8810579776763916
|
||||
},
|
||||
"experiment_3": {
|
||||
"threshold_analysis": {
|
||||
"0.01": {
|
||||
"total_connections": 26,
|
||||
"coverage": 0.8181818181818182,
|
||||
"avg_per_chapter": 1.1818181818181819,
|
||||
"quality_score": 0.21272727272727274
|
||||
},
|
||||
"0.02": {
|
||||
"total_connections": 12,
|
||||
"coverage": 0.45454545454545453,
|
||||
"avg_per_chapter": 0.5454545454545454,
|
||||
"quality_score": 0.05454545454545454
|
||||
},
|
||||
"0.03": {
|
||||
"total_connections": 8,
|
||||
"coverage": 0.3181818181818182,
|
||||
"avg_per_chapter": 0.36363636363636365,
|
||||
"quality_score": 0.025454545454545455
|
||||
},
|
||||
"0.05": {
|
||||
"total_connections": 2,
|
||||
"coverage": 0.09090909090909091,
|
||||
"avg_per_chapter": 0.09090909090909091,
|
||||
"quality_score": 0.0018181818181818182
|
||||
},
|
||||
"0.08": {
|
||||
"total_connections": 0,
|
||||
"coverage": 0.0,
|
||||
"avg_per_chapter": 0.0,
|
||||
"quality_score": 0.0
|
||||
},
|
||||
"0.1": {
|
||||
"total_connections": 0,
|
||||
"coverage": 0.0,
|
||||
"avg_per_chapter": 0.0,
|
||||
"quality_score": 0.0
|
||||
}
|
||||
},
|
||||
"optimal_threshold": 0.01,
|
||||
"optimal_stats": {
|
||||
"total_connections": 26,
|
||||
"coverage": 0.8181818181818182,
|
||||
"avg_per_chapter": 1.1818181818181819,
|
||||
"quality_score": 0.21272727272727274
|
||||
},
|
||||
"execution_time": 17.24936294555664
|
||||
},
|
||||
"experiment_4": {
|
||||
"connection_types": [
|
||||
"foundation",
|
||||
"prerequisite",
|
||||
"builds_on",
|
||||
"implements",
|
||||
"applies",
|
||||
"extends",
|
||||
"relates",
|
||||
"contrasts",
|
||||
"example"
|
||||
],
|
||||
"type_distribution": {
|
||||
"prerequisite": 1,
|
||||
"builds_on": 1
|
||||
},
|
||||
"type_percentages": {
|
||||
"prerequisite": 50.0,
|
||||
"builds_on": 50.0
|
||||
},
|
||||
"total_connections": 2,
|
||||
"sample_by_type": {
|
||||
"prerequisite": [
|
||||
{
|
||||
"source": "frontiers",
|
||||
"target": "emerging_topics",
|
||||
"strength": 0.05333333333333333,
|
||||
"concepts": [
|
||||
"technology convergence",
|
||||
"research frontiers",
|
||||
"future applications"
|
||||
]
|
||||
}
|
||||
],
|
||||
"builds_on": [
|
||||
{
|
||||
"source": "emerging_topics",
|
||||
"target": "frontiers",
|
||||
"strength": 0.05333333333333333,
|
||||
"concepts": [
|
||||
"technology convergence",
|
||||
"future applications",
|
||||
"research frontiers"
|
||||
]
|
||||
}
|
||||
]
|
||||
},
|
||||
"execution_time": 2.6563971042633057
|
||||
},
|
||||
"experiment_5_introduction": {
|
||||
"chapter_start": {
|
||||
"locations": 1,
|
||||
"avg_connections_per_location": 3,
|
||||
"total_connections": 3,
|
||||
"pedagogical_impact": "High - sets context",
|
||||
"readability_impact": "Low - doesn't clutter"
|
||||
},
|
||||
"section_start": {
|
||||
"locations": 12,
|
||||
"avg_connections_per_location": 2,
|
||||
"total_connections": 24,
|
||||
"pedagogical_impact": "Very High - contextual",
|
||||
"readability_impact": "Medium - some clutter"
|
||||
},
|
||||
"contextual_inline": {
|
||||
"locations": 36,
|
||||
"avg_connections_per_location": 1,
|
||||
"total_connections": 36,
|
||||
"pedagogical_impact": "Medium - can be distracting",
|
||||
"readability_impact": "High - significant clutter"
|
||||
}
|
||||
},
|
||||
"experiment_5_ml_systems": {
|
||||
"chapter_start": {
|
||||
"locations": 1,
|
||||
"avg_connections_per_location": 3,
|
||||
"total_connections": 3,
|
||||
"pedagogical_impact": "High - sets context",
|
||||
"readability_impact": "Low - doesn't clutter"
|
||||
},
|
||||
"section_start": {
|
||||
"locations": 10,
|
||||
"avg_connections_per_location": 2,
|
||||
"total_connections": 20,
|
||||
"pedagogical_impact": "Very High - contextual",
|
||||
"readability_impact": "Medium - some clutter"
|
||||
},
|
||||
"contextual_inline": {
|
||||
"locations": 30,
|
||||
"avg_connections_per_location": 1,
|
||||
"total_connections": 30,
|
||||
"pedagogical_impact": "Medium - can be distracting",
|
||||
"readability_impact": "High - significant clutter"
|
||||
}
|
||||
},
|
||||
"experiment_5_dl_primer": {
|
||||
"chapter_start": {
|
||||
"locations": 1,
|
||||
"avg_connections_per_location": 3,
|
||||
"total_connections": 3,
|
||||
"pedagogical_impact": "High - sets context",
|
||||
"readability_impact": "Low - doesn't clutter"
|
||||
},
|
||||
"section_start": {
|
||||
"locations": 8,
|
||||
"avg_connections_per_location": 2,
|
||||
"total_connections": 16,
|
||||
"pedagogical_impact": "Very High - contextual",
|
||||
"readability_impact": "Medium - some clutter"
|
||||
},
|
||||
"contextual_inline": {
|
||||
"locations": 24,
|
||||
"avg_connections_per_location": 1,
|
||||
"total_connections": 24,
|
||||
"pedagogical_impact": "Medium - can be distracting",
|
||||
"readability_impact": "High - significant clutter"
|
||||
}
|
||||
},
|
||||
"experiment_5_summary": {
|
||||
"strategies_evaluated": [
|
||||
"chapter_start",
|
||||
"section_start",
|
||||
"contextual_inline",
|
||||
"section_end",
|
||||
"mixed_adaptive"
|
||||
],
|
||||
"recommended_approach": "section_start",
|
||||
"rationale": "Best balance of pedagogical value and readability",
|
||||
"execution_time": 0.002827167510986328
|
||||
}
|
||||
}
|
||||
@@ -1,13 +0,0 @@
|
||||
{
|
||||
"total_connections": 1146,
|
||||
"chapters_with_connections": 21,
|
||||
"connection_type_distribution": {
|
||||
"foundation": 529,
|
||||
"prerequisite": 106,
|
||||
"extends": 230,
|
||||
"complements": 200,
|
||||
"applies": 81
|
||||
},
|
||||
"generation_date": "2025-09-12 11:30:45",
|
||||
"generator_version": "1.0"
|
||||
}
|
||||
@@ -1,614 +0,0 @@
|
||||
{
|
||||
"experiment_a": {
|
||||
"total_sections": 200,
|
||||
"connected_sections": 38,
|
||||
"total_connections": 140,
|
||||
"avg_connections_per_section": 3.6842105263157894,
|
||||
"section_coverage": 0.19,
|
||||
"sample_connections": {
|
||||
"ml_systems:sec-ml-systems-overview-db10": [
|
||||
{
|
||||
"target_chapter": "efficient_ai",
|
||||
"target_section": "sec-efficient-ai-overview-6f6a",
|
||||
"target_title": "Overview",
|
||||
"strength": 0.030272108843537416,
|
||||
"concepts": [
|
||||
"model quantization",
|
||||
"model compression",
|
||||
"energy efficiency"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "optimizations",
|
||||
"target_section": "sec-model-optimizations-overview-b523",
|
||||
"target_title": "Overview",
|
||||
"strength": 0.027483443708609275,
|
||||
"concepts": [
|
||||
"model quantization",
|
||||
"model compression",
|
||||
"knowledge distillation"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "ondevice_learning",
|
||||
"target_section": "sec-ondevice-learning-overview-c195",
|
||||
"target_title": "Overview",
|
||||
"strength": 0.038698630136986295,
|
||||
"concepts": [
|
||||
"model compression",
|
||||
"energy efficiency",
|
||||
"latency optimization"
|
||||
]
|
||||
}
|
||||
],
|
||||
"ml_systems:sec-ml-systems-cloudbased-machine-learning-7606": [
|
||||
{
|
||||
"target_chapter": "efficient_ai",
|
||||
"target_section": "sec-efficient-ai-overview-6f6a",
|
||||
"target_title": "Overview",
|
||||
"strength": 0.030272108843537416,
|
||||
"concepts": [
|
||||
"model quantization",
|
||||
"model compression",
|
||||
"energy efficiency"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "optimizations",
|
||||
"target_section": "sec-model-optimizations-overview-b523",
|
||||
"target_title": "Overview",
|
||||
"strength": 0.027483443708609275,
|
||||
"concepts": [
|
||||
"model quantization",
|
||||
"model compression",
|
||||
"knowledge distillation"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target_chapter": "ondevice_learning",
|
||||
"target_section": "sec-ondevice-learning-overview-c195",
|
||||
"target_title": "Overview",
|
||||
"strength": 0.038698630136986295,
|
||||
"concepts": [
|
||||
"model compression",
|
||||
"energy efficiency",
|
||||
"latency optimization"
|
||||
]
|
||||
}
|
||||
]
|
||||
},
|
||||
"execution_time": 2.8242580890655518
|
||||
},
|
||||
"experiment_b": {
|
||||
"threshold_analysis": {
|
||||
"0.005": {
|
||||
"total_connections": 238,
|
||||
"coverage": 1.0,
|
||||
"avg_quality": 0.5451270326397196,
|
||||
"composite_score": 0.8635381097919158,
|
||||
"connections_per_chapter": 10.818181818181818
|
||||
},
|
||||
"0.008": {
|
||||
"total_connections": 202,
|
||||
"coverage": 1.0,
|
||||
"avg_quality": 0.5556839742750035,
|
||||
"composite_score": 0.866705192282501,
|
||||
"connections_per_chapter": 9.181818181818182
|
||||
},
|
||||
"0.01": {
|
||||
"total_connections": 156,
|
||||
"coverage": 1.0,
|
||||
"avg_quality": 0.594509053730751,
|
||||
"composite_score": 0.8783527161192253,
|
||||
"connections_per_chapter": 7.090909090909091
|
||||
},
|
||||
"0.015": {
|
||||
"total_connections": 106,
|
||||
"coverage": 0.9545454545454546,
|
||||
"avg_quality": 0.6094947329054752,
|
||||
"composite_score": 0.8646666016898245,
|
||||
"connections_per_chapter": 4.818181818181818
|
||||
},
|
||||
"0.02": {
|
||||
"total_connections": 70,
|
||||
"coverage": 0.8636363636363636,
|
||||
"avg_quality": 0.6137011637536383,
|
||||
"composite_score": 0.829564894580637,
|
||||
"connections_per_chapter": 3.1818181818181817
|
||||
},
|
||||
"0.025": {
|
||||
"total_connections": 52,
|
||||
"coverage": 0.8181818181818182,
|
||||
"avg_quality": 0.629927244840484,
|
||||
"composite_score": 0.8162509007248725,
|
||||
"connections_per_chapter": 2.3636363636363638
|
||||
},
|
||||
"0.03": {
|
||||
"total_connections": 36,
|
||||
"coverage": 0.5909090909090909,
|
||||
"avg_quality": 0.6465534751229627,
|
||||
"composite_score": 0.6463296789005252,
|
||||
"connections_per_chapter": 1.6363636363636365
|
||||
}
|
||||
},
|
||||
"optimal_threshold": 0.01,
|
||||
"optimal_stats": {
|
||||
"total_connections": 156,
|
||||
"coverage": 1.0,
|
||||
"avg_quality": 0.594509053730751,
|
||||
"composite_score": 0.8783527161192253,
|
||||
"connections_per_chapter": 7.090909090909091
|
||||
},
|
||||
"execution_time": 19.94976496696472
|
||||
},
|
||||
"experiment_c": {
|
||||
"connection_types_found": [
|
||||
"advanced_application",
|
||||
"theory_to_practice",
|
||||
"peer_concept",
|
||||
"sequential_progression",
|
||||
"strong_conceptual_link",
|
||||
"optimization_related",
|
||||
"builds_on_foundation",
|
||||
"practice_to_optimization",
|
||||
"topical_connection",
|
||||
"systems_related",
|
||||
"complementary_approach"
|
||||
],
|
||||
"type_distribution": {
|
||||
"advanced_application": 6,
|
||||
"theory_to_practice": 2,
|
||||
"peer_concept": 14,
|
||||
"sequential_progression": 9,
|
||||
"strong_conceptual_link": 1,
|
||||
"optimization_related": 1,
|
||||
"builds_on_foundation": 25,
|
||||
"practice_to_optimization": 3,
|
||||
"topical_connection": 2,
|
||||
"systems_related": 1,
|
||||
"complementary_approach": 6
|
||||
},
|
||||
"type_percentages": {
|
||||
"advanced_application": 8.571428571428571,
|
||||
"theory_to_practice": 2.857142857142857,
|
||||
"peer_concept": 20.0,
|
||||
"sequential_progression": 12.857142857142856,
|
||||
"strong_conceptual_link": 1.4285714285714286,
|
||||
"optimization_related": 1.4285714285714286,
|
||||
"builds_on_foundation": 35.714285714285715,
|
||||
"practice_to_optimization": 4.285714285714286,
|
||||
"topical_connection": 2.857142857142857,
|
||||
"systems_related": 1.4285714285714286,
|
||||
"complementary_approach": 8.571428571428571
|
||||
},
|
||||
"total_connections": 70,
|
||||
"level_consistency": 0.6857142857142857,
|
||||
"sample_by_type": {
|
||||
"advanced_application": [
|
||||
{
|
||||
"source": "ml_systems",
|
||||
"target": "efficient_ai",
|
||||
"strength": 0.030272108843537416,
|
||||
"concepts": [
|
||||
"model quantization",
|
||||
"model compression",
|
||||
"energy efficiency"
|
||||
],
|
||||
"source_level": 1,
|
||||
"target_level": 4
|
||||
},
|
||||
{
|
||||
"source": "ml_systems",
|
||||
"target": "optimizations",
|
||||
"strength": 0.027483443708609275,
|
||||
"concepts": [
|
||||
"model quantization",
|
||||
"model compression",
|
||||
"knowledge distillation"
|
||||
],
|
||||
"source_level": 1,
|
||||
"target_level": 4
|
||||
}
|
||||
],
|
||||
"theory_to_practice": [
|
||||
{
|
||||
"source": "dl_primer",
|
||||
"target": "frameworks",
|
||||
"strength": 0.036,
|
||||
"concepts": [
|
||||
"backpropagation",
|
||||
"computational graph",
|
||||
"gradient computation"
|
||||
],
|
||||
"source_level": 2,
|
||||
"target_level": 3
|
||||
},
|
||||
{
|
||||
"source": "dl_primer",
|
||||
"target": "training",
|
||||
"strength": 0.048999999999999995,
|
||||
"concepts": [
|
||||
"backpropagation",
|
||||
"gradient descent",
|
||||
"activation functions"
|
||||
],
|
||||
"source_level": 2,
|
||||
"target_level": 3
|
||||
}
|
||||
],
|
||||
"peer_concept": [
|
||||
{
|
||||
"source": "workflow",
|
||||
"target": "data_engineering",
|
||||
"strength": 0.04685534591194969,
|
||||
"concepts": [
|
||||
"problem definition",
|
||||
"data versioning",
|
||||
"feature engineering"
|
||||
],
|
||||
"source_level": 2,
|
||||
"target_level": 2
|
||||
},
|
||||
{
|
||||
"source": "data_engineering",
|
||||
"target": "workflow",
|
||||
"strength": 0.04685534591194969,
|
||||
"concepts": [
|
||||
"data versioning",
|
||||
"data drift",
|
||||
"systematic problem definition"
|
||||
],
|
||||
"source_level": 2,
|
||||
"target_level": 2
|
||||
}
|
||||
],
|
||||
"sequential_progression": [
|
||||
{
|
||||
"source": "workflow",
|
||||
"target": "frameworks",
|
||||
"strength": 0.0329192546583851,
|
||||
"concepts": [
|
||||
"model versioning",
|
||||
"performance optimization",
|
||||
"scalability planning"
|
||||
],
|
||||
"source_level": 2,
|
||||
"target_level": 3
|
||||
},
|
||||
{
|
||||
"source": "workflow",
|
||||
"target": "benchmarking",
|
||||
"strength": 0.027678571428571424,
|
||||
"concepts": [
|
||||
"a/b testing",
|
||||
"cross-validation",
|
||||
"model selection"
|
||||
],
|
||||
"source_level": 2,
|
||||
"target_level": 3
|
||||
}
|
||||
],
|
||||
"strong_conceptual_link": [
|
||||
{
|
||||
"source": "workflow",
|
||||
"target": "ops",
|
||||
"strength": 0.08397435897435895,
|
||||
"concepts": [
|
||||
"mlops (machine learning operations)",
|
||||
"experiment tracking",
|
||||
"model versioning"
|
||||
],
|
||||
"source_level": 2,
|
||||
"target_level": 4
|
||||
}
|
||||
],
|
||||
"optimization_related": [
|
||||
{
|
||||
"source": "data_engineering",
|
||||
"target": "ops",
|
||||
"strength": 0.026988636363636364,
|
||||
"concepts": [
|
||||
"metadata management",
|
||||
"performance optimization",
|
||||
"compliance management"
|
||||
],
|
||||
"source_level": 2,
|
||||
"target_level": 4
|
||||
}
|
||||
],
|
||||
"builds_on_foundation": [
|
||||
{
|
||||
"source": "frameworks",
|
||||
"target": "dl_primer",
|
||||
"strength": 0.036000000000000004,
|
||||
"concepts": [
|
||||
"computational graph",
|
||||
"gradient computation",
|
||||
"backpropagation"
|
||||
],
|
||||
"source_level": 3,
|
||||
"target_level": 2
|
||||
},
|
||||
{
|
||||
"source": "frameworks",
|
||||
"target": "workflow",
|
||||
"strength": 0.0329192546583851,
|
||||
"concepts": [
|
||||
"model versioning",
|
||||
"scalability planning",
|
||||
"performance optimization"
|
||||
],
|
||||
"source_level": 3,
|
||||
"target_level": 2
|
||||
}
|
||||
],
|
||||
"practice_to_optimization": [
|
||||
{
|
||||
"source": "frameworks",
|
||||
"target": "efficient_ai",
|
||||
"strength": 0.032848837209302324,
|
||||
"concepts": [
|
||||
"model quantization",
|
||||
"computer vision",
|
||||
"natural language processing"
|
||||
],
|
||||
"source_level": 3,
|
||||
"target_level": 4
|
||||
},
|
||||
{
|
||||
"source": "frameworks",
|
||||
"target": "optimizations",
|
||||
"strength": 0.021067415730337078,
|
||||
"concepts": [
|
||||
"model quantization",
|
||||
"computer vision",
|
||||
"natural language processing"
|
||||
],
|
||||
"source_level": 3,
|
||||
"target_level": 4
|
||||
}
|
||||
],
|
||||
"topical_connection": [
|
||||
{
|
||||
"source": "training",
|
||||
"target": "ondevice_learning",
|
||||
"strength": 0.045953757225433524,
|
||||
"concepts": [
|
||||
"federated learning",
|
||||
"transfer learning",
|
||||
"curriculum learning"
|
||||
],
|
||||
"source_level": 3,
|
||||
"target_level": 5
|
||||
},
|
||||
{
|
||||
"source": "benchmarking",
|
||||
"target": "ondevice_learning",
|
||||
"strength": 0.021703296703296706,
|
||||
"concepts": [
|
||||
"performance profiling",
|
||||
"cross-validation",
|
||||
"latency analysis"
|
||||
],
|
||||
"source_level": 3,
|
||||
"target_level": 5
|
||||
}
|
||||
],
|
||||
"systems_related": [
|
||||
{
|
||||
"source": "efficient_ai",
|
||||
"target": "emerging_topics",
|
||||
"strength": 0.022839506172839506,
|
||||
"concepts": [
|
||||
"evolutionary algorithms",
|
||||
"few-shot learning",
|
||||
"continual learning"
|
||||
],
|
||||
"source_level": 4,
|
||||
"target_level": 6
|
||||
}
|
||||
],
|
||||
"complementary_approach": [
|
||||
{
|
||||
"source": "responsible_ai",
|
||||
"target": "ai_for_good",
|
||||
"strength": 0.026666666666666665,
|
||||
"concepts": [
|
||||
"participatory design",
|
||||
"human-centered design",
|
||||
"monitoring and evaluation"
|
||||
],
|
||||
"source_level": 5,
|
||||
"target_level": 5
|
||||
},
|
||||
{
|
||||
"source": "ai_for_good",
|
||||
"target": "responsible_ai",
|
||||
"strength": 0.026666666666666665,
|
||||
"concepts": [
|
||||
"educational technology",
|
||||
"smart cities",
|
||||
"human-centered design"
|
||||
],
|
||||
"source_level": 5,
|
||||
"target_level": 5
|
||||
}
|
||||
]
|
||||
},
|
||||
"execution_time": 2.9256129264831543
|
||||
},
|
||||
"experiment_d": {
|
||||
"forward_connections": 45,
|
||||
"backward_connections": 44,
|
||||
"asymmetry_ratio": 1.0227272727272727,
|
||||
"asymmetric_examples": [
|
||||
{
|
||||
"chapter": "privacy_security",
|
||||
"forward_count": 1,
|
||||
"backward_count": 0,
|
||||
"asymmetry_ratio": 10.0
|
||||
},
|
||||
{
|
||||
"chapter": "benchmarking",
|
||||
"forward_count": 0,
|
||||
"backward_count": 2,
|
||||
"asymmetry_ratio": 0.0
|
||||
},
|
||||
{
|
||||
"chapter": "emerging_topics",
|
||||
"forward_count": 2,
|
||||
"backward_count": 1,
|
||||
"asymmetry_ratio": 1.8181818181818181
|
||||
},
|
||||
{
|
||||
"chapter": "ondevice_learning",
|
||||
"forward_count": 7,
|
||||
"backward_count": 5,
|
||||
"asymmetry_ratio": 1.3725490196078431
|
||||
},
|
||||
{
|
||||
"chapter": "efficient_ai",
|
||||
"forward_count": 3,
|
||||
"backward_count": 4,
|
||||
"asymmetry_ratio": 0.7317073170731708
|
||||
}
|
||||
],
|
||||
"sample_forward": {
|
||||
"ml_systems": [
|
||||
{
|
||||
"target": "data_engineering",
|
||||
"strength": 0.023013698630136983,
|
||||
"type": "leads_to",
|
||||
"concepts": [
|
||||
"recommendation systems",
|
||||
"fraud detection",
|
||||
"autonomous vehicles"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target": "efficient_ai",
|
||||
"strength": 0.024217687074829936,
|
||||
"type": "leads_to",
|
||||
"concepts": [
|
||||
"model quantization",
|
||||
"model compression",
|
||||
"energy efficiency"
|
||||
]
|
||||
}
|
||||
],
|
||||
"dl_primer": [
|
||||
{
|
||||
"target": "frameworks",
|
||||
"strength": 0.043199999999999995,
|
||||
"type": "leads_to",
|
||||
"concepts": [
|
||||
"backpropagation",
|
||||
"computational graph",
|
||||
"gradient computation"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target": "training",
|
||||
"strength": 0.05879999999999999,
|
||||
"type": "leads_to",
|
||||
"concepts": [
|
||||
"backpropagation",
|
||||
"gradient descent",
|
||||
"activation functions"
|
||||
]
|
||||
}
|
||||
],
|
||||
"workflow": [
|
||||
{
|
||||
"target": "data_engineering",
|
||||
"strength": 0.028113207547169814,
|
||||
"type": "leads_to",
|
||||
"concepts": [
|
||||
"problem definition",
|
||||
"data versioning",
|
||||
"feature engineering"
|
||||
]
|
||||
},
|
||||
{
|
||||
"target": "frameworks",
|
||||
"strength": 0.039503105590062114,
|
||||
"type": "leads_to",
|
||||
"concepts": [
|
||||
"model versioning",
|
||||
"performance optimization",
|
||||
"scalability planning"
|
||||
]
|
||||
}
|
||||
]
|
||||
},
|
||||
"sample_backward": {
|
||||
"training": [
|
||||
{
|
||||
"source": "dl_primer",
|
||||
"strength": 0.024499999999999997,
|
||||
"type": "builds_on",
|
||||
"concepts": [
|
||||
"backpropagation",
|
||||
"gradient descent",
|
||||
"activation functions"
|
||||
]
|
||||
},
|
||||
{
|
||||
"source": "frameworks",
|
||||
"strength": 0.0286144578313253,
|
||||
"type": "builds_on",
|
||||
"concepts": [
|
||||
"tensor operations",
|
||||
"automatic differentiation",
|
||||
"computational graph"
|
||||
]
|
||||
}
|
||||
],
|
||||
"data_engineering": [
|
||||
{
|
||||
"source": "workflow",
|
||||
"strength": 0.023427672955974845,
|
||||
"type": "builds_on",
|
||||
"concepts": [
|
||||
"problem definition",
|
||||
"data versioning",
|
||||
"feature engineering"
|
||||
]
|
||||
},
|
||||
{
|
||||
"source": "frameworks",
|
||||
"strength": 0.025697674418604648,
|
||||
"type": "builds_on",
|
||||
"concepts": [
|
||||
"performance optimization",
|
||||
"computer vision",
|
||||
"natural language processing"
|
||||
]
|
||||
}
|
||||
],
|
||||
"ops": [
|
||||
{
|
||||
"source": "workflow",
|
||||
"strength": 0.04198717948717948,
|
||||
"type": "builds_on",
|
||||
"concepts": [
|
||||
"mlops (machine learning operations)",
|
||||
"experiment tracking",
|
||||
"model versioning"
|
||||
]
|
||||
},
|
||||
{
|
||||
"source": "privacy_security",
|
||||
"strength": 0.021195652173913043,
|
||||
"type": "builds_on",
|
||||
"concepts": [
|
||||
"incident response",
|
||||
"financial services",
|
||||
"edge computing"
|
||||
]
|
||||
}
|
||||
]
|
||||
},
|
||||
"execution_time": 3.056798219680786
|
||||
}
|
||||
}
|
||||
Reference in New Issue
Block a user