No description
- Rust 97.8%
- Nix 2.2%
| .gitea/workflows | ||
| crates | ||
| .editorconfig | ||
| .gitignore | ||
| Cargo.lock | ||
| Cargo.toml | ||
| lychee.toml | ||
| README.md | ||
| rustfmt.toml | ||
| shell.nix | ||
| typos.toml | ||
LLM from scratch
Resources
TODO:
-
chat cli, evaluate each epoch
-
better arch (read nanochat)
-
count tokens
-
download more data (code, full fineweb)
-
better train progress bar
-
Notes
-
TrainTestIterator
- total length
- deterministic shuffle
- prepare in parallel
- refactor new() into builder
- small texts (<|bos|>?)
-
Training
- multi-device training
- model parameters in file