nanogpt
https://github.com/karpathy/nanogpt
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
not yet supported1 Subscribers
Add a CodeTriage badge to nanogpt
Help out
- Issues
- loss increase when finetuning on shakespeare datasets with gpt-xl and mps device
- does the seed has to be the same in sample.py as in train.py?
- Can I train the model on Intel GPU like A730m?
- Is it possible to try using mse instead of cross entropy as the loss function?
- Use own, custom tokens
- High Loss Value When Training NanoGPT on a Single Small GPU
- Why the shapes of Q/K/V weight matrices are all (C//n_heads, C)?
- Setting RNG state while looping through model generate, Reproducibility.
- Which Python version can be used
- AssertionError when trying to run sample.py
- Docs
- not yet supported