1
0

Refactor
Some checks failed
CI / Check links (push) Successful in 7s
CI / Check typos (push) Failing after 10s
CI / Clippy (push) Failing after 57s
CI / Build and test (push) Has been cancelled

This commit is contained in:
2025-12-15 20:27:34 -08:00
parent c59ca2164e
commit cd79814c91
19 changed files with 1572 additions and 1066 deletions

26
README.md Normal file
View File

@@ -0,0 +1,26 @@
# LLM from scratch
## Resources
- [Build a Large Language Model](https://www.manning.com/books/build-a-large-language-model-from-scratch)
- [Writing an LLM from scratch, part 28](https://www.gilesthomas.com/2025/12/llm-from-scratch-28-training-a-base-model-from-scratch)
- [nanochat](https://github.com/karpathy/nanochat)
## TODO:
- chat cli, evaluate each epoch
- better arch (read nanochat)
- count tokens
- download more data (code, full fineweb)
- Notes
- comments
- TrainTestIterator
- total length
- deterministic shuffle
- prepare in parallel
- refactor new() into builder
- small texts (<|bos|>?)
- Training
- multi-device training
- model parameters in file