mirror of
https://github.com/harvard-edge/cs249r_book.git
synced 2026-05-08 02:28:25 -05:00
[PR #1115] [MERGED] fix: miscellaneous fix for Tokenizer #1125
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
📋 Pull Request Information
Original PR: https://github.com/harvard-edge/cs249r_book/pull/1115
Author: @minhdang26403
Created: 1/19/2026
Status: ✅ Merged
Merged: 1/19/2026
Merged by: @profvjreddi
Base:
dev← Head:fix/tokenizer📝 Commits (1)
7af2499fix: miscellaneous fix for Tokenizer📊 Changes
1 file changed (+7 additions, -9 deletions)
View changed files
📝
tinytorch/src/10_tokenization/10_tokenization.py(+7 -9)📄 Description
The Pull Request contains several miscellaneous fix for the Tokenizer module implementation:
vocab(which is a set) instead ofself.vocab(which is a list) for O(1) lookup.sorted(...)function accept an iterable and returns a list, so we don't need to convert a Python set to a Python list before passing it tosorted.🔄 This issue represents a GitHub Pull Request. It cannot be merged through Gitea due to API limitations.