1
0
Files
llmfs/README.md
rm-dr 656704b281
Some checks failed
CI / Check typos (push) Failing after 10s
CI / Check links (push) Successful in 13s
CI / Clippy (push) Failing after 1m3s
CI / Build and test (push) Failing after 10m31s
Refactor
2025-12-15 21:11:29 -08:00

702 B

LLM from scratch

Resources

TODO:

  • chat cli, evaluate each epoch

  • better arch (read nanochat)

  • count tokens

  • download more data (code, full fineweb)

  • better train progress bar

  • Notes

  • TrainTestIterator

    • total length
    • deterministic shuffle
    • prepare in parallel
    • refactor new() into builder
    • small texts (<|bos|>?)
  • Training

    • multi-device training
    • model parameters in file