1
0
Files
llmfs/README.md
rm-dr 0bf0aca1ab
Some checks failed
CI / Check typos (push) Failing after 9s
CI / Check links (push) Successful in 8s
CI / Clippy (push) Failing after 1m0s
CI / Build and test (push) Failing after 5m14s
Refactor
2025-12-15 20:31:47 -08:00

27 lines
688 B
Markdown

# LLM from scratch
## Resources
- [Build a Large Language Model](https://www.manning.com/books/build-a-large-language-model-from-scratch)
- [Writing an LLM from scratch, part 28](https://www.gilesthomas.com/2025/12/llm-from-scratch-28-training-a-base-model-from-scratch)
- [nanochat](https://github.com/karpathy/nanochat)
## TODO:
- chat cli, evaluate each epoch
- better arch (read nanochat)
- count tokens
- download more data (code, full fineweb)
- Notes
- comments
- TrainTestIterator
- total length
- deterministic shuffle
- prepare in parallel
- refactor new() into builder
- small texts (<|bos|>?)
- Training
- multi-device training
- model parameters in file