No description
  • Rust 97.8%
  • Nix 2.2%
Find a file
2025-12-16 22:11:43 -08:00
.gitea/workflows CI 2025-12-13 08:44:35 -08:00
crates Refactor 2025-12-15 21:59:14 -08:00
.editorconfig Refactor 2025-12-15 21:59:14 -08:00
.gitignore Tokenizer 2025-12-13 08:44:38 -08:00
Cargo.lock Refactor 2025-12-15 21:59:14 -08:00
Cargo.toml Refactor 2025-12-15 21:59:14 -08:00
lychee.toml CI 2025-12-13 08:44:35 -08:00
README.md Refactor 2025-12-15 21:59:14 -08:00
rustfmt.toml Initial commit 2025-12-11 17:37:12 -08:00
shell.nix nix 2025-12-16 22:11:43 -08:00
typos.toml CI 2025-12-13 08:44:35 -08:00

LLM from scratch

Resources

TODO:

  • chat cli, evaluate each epoch

  • better arch (read nanochat)

  • count tokens

  • download more data (code, full fineweb)

  • better train progress bar

  • Notes

  • TrainTestIterator

    • total length
    • deterministic shuffle
    • prepare in parallel
    • refactor new() into builder
    • small texts (<|bos|>?)
  • Training

    • multi-device training
    • model parameters in file