nanogpt
https://github.com/karpathy/nanogpt
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
not yet supported1 Subscribers
Add a CodeTriage badge to nanogpt
Help out
- Issues
- High validation loss when fine-tuning Shakespeare on gpt-xl?
- Triton Error [CUDA]: invalid argument
- gpt2-xl
- Prepending previous K,V output of attn to current KV
- Configurator will work correctly with default None values
- How do I load trained models?
- Any luck on image generation using min/nano GPT?
- Integrate Aim - an open-source experiment tracker
- Nan's training with 'MPS'
- adding IPEX, and autocast for CPU
- Docs
- not yet supported