nanogpt
https://github.com/karpathy/nanogpt
Triage Issues!
When you volunteer to triage issues, you'll receive an email each day with a link to an open issue that needs help in this project. You'll also receive instructions on how to triage issues.
Triage Docs!
Receive a documented method or class from your favorite GitHub repos in your inbox every day. If you're really pro, receive undocumented methods or classes and supercharge your commit history.
not yet supported1 Subscribers
Add a CodeTriage badge to nanogpt
Help out
- Issues
- Make wandb training logs public
- How to load the GPT-2 model
- Just a question
- GPU specs for finetuning gpt2-xl
- Signal: Segmentation fault
- Never succeed for downloading and splitting openwebtext
- The GPU utilization very low?
- Multi GPUs training is very slow
- int8 training/inference
- Should `loss` be divided by `gradient_accumulation_steps`?
- Docs
- not yet supported