1
0

Refactor
Some checks failed
CI / Check typos (push) Failing after 9s
CI / Check links (push) Successful in 8s
CI / Clippy (push) Failing after 1m0s
CI / Build and test (push) Failing after 5m14s

This commit is contained in:
2025-12-15 20:31:47 -08:00
parent c59ca2164e
commit 0bf0aca1ab
19 changed files with 1584 additions and 1066 deletions

26
README.md Normal file
View File

@@ -0,0 +1,26 @@
# LLM from scratch
## Resources
- [Build a Large Language Model](https://www.manning.com/books/build-a-large-language-model-from-scratch)
- [Writing an LLM from scratch, part 28](https://www.gilesthomas.com/2025/12/llm-from-scratch-28-training-a-base-model-from-scratch)
- [nanochat](https://github.com/karpathy/nanochat)
## TODO:
- chat cli, evaluate each epoch
- better arch (read nanochat)
- count tokens
- download more data (code, full fineweb)
- Notes
- comments
- TrainTestIterator
- total length
- deterministic shuffle
- prepare in parallel
- refactor new() into builder
- small texts (<|bos|>?)
- Training
- multi-device training
- model parameters in file