nanogpt
https://github.com/karpathy/nanogpt
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
not yet supported1 Subscribers
Add a CodeTriage badge to nanogpt
Help out
- Issues
- nothing has been written into???
- Why don't we crop attn.weight as well?
- PyTorch nn.LayerNorm now takes bias arg - removed custom class
- Why do we need further pretrain given the loss is already converged
- no cuda training does not work.
- Training loss converges much earlier compared to max_iters
- How to Set "vocab_size" and "block_size" for Word Embedding?
- could nanoGPT be the AI assistant for the development of CAX software?
- Recommendation for something smaller
- nanoGPT/model.py where `manual implementation of attention`,Is it correct to modify it like I did?
- Docs
- not yet supported